:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Cangqing, Yang, Yutian, Li, Ruisi, Sun, Dan, Cai, Ruicong, Zhang, Yuzhu, Fu, Chengqian, Floyd, Lillian
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence Computation and Language
Online Access:	https://arxiv.org/abs/2404.04997
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Prompt Recursive Search: A Living Framework with Adaptive Growth in LLM Auto-Prompting
by: Zhao, Xiangyu, et al.
Published: (2024)

CA-BERT: Leveraging Context Awareness for Enhanced Multi-Turn Chat Interaction
by: Liu, Minghao, et al.
Published: (2024)

ATACompressor: Adaptive Task-Aware Compression for Efficient Long-Context Processing in LLMs
by: Li, Xuancheng, et al.
Published: (2026)

Modular Representation Compression: Adapting LLMs for Efficient and Effective Recommendations
by: Xi, Yunjia, et al.
Published: (2026)

Reinforcement Learning Approach for Integrating Compressed Contexts into Knowledge Graphs
by: Quach, Ngoc, et al.
Published: (2024)

The Power of Adaptation: Boosting In-Context Learning through Adaptive Prompting
by: Cai, Shuzhang, et al.
Published: (2024)

Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression
by: Hong, Junyuan, et al.
Published: (2024)

Soft Begging: Modular and Efficient Shielding of LLMs against Prompt Injection and Jailbreaking based on Prompt Tuning
by: Ostermann, Simon, et al.
Published: (2024)

Efficient and Effective Prompt Tuning via Prompt Decomposition and Compressed Outer Product
by: Lan, Pengxiang, et al.
Published: (2025)

ZSMerge: Zero-Shot KV Cache Compression for Memory-Efficient Long-Context LLMs
by: Liu, Xin, et al.
Published: (2025)

Adapting Psycholinguistic Research for LLMs: Gender-inclusive Language in a Coreference Context
by: Bartl, Marion, et al.
Published: (2025)

EFPC: Towards Efficient and Flexible Prompt Compression
by: Cao, Yun-Hao, et al.
Published: (2025)

Adapting Whisper for Parameter-efficient Code-Switching Speech Recognition via Soft Prompt Tuning
by: Yang, Hongli, et al.
Published: (2025)

Recurrent Context Compression: Efficiently Expanding the Context Window of LLM
by: Huang, Chensen, et al.
Published: (2024)

Prompting Large Language Models for Supporting the Differential Diagnosis of Anemia
by: Castagnari, Elisa, et al.
Published: (2024)

Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation
by: Ning, Xuefei, et al.
Published: (2023)

PromptEmbedder:: Efficient and Transferable Text Embedding via Dual-LLM Soft Prompting
by: Tsai, Yu-Che, et al.
Published: (2026)

Sentinel: Decoding Context Utilization via Attention Probing for Efficient LLM Context Compression
by: Zhang, Yong, et al.
Published: (2025)

KV-Distill: Nearly Lossless Learnable Context Compression for LLMs
by: Chari, Vivek, et al.
Published: (2025)

QUITO: Accelerating Long-Context Reasoning through Query-Guided Context Compression
by: Wang, Wenshan, et al.
Published: (2024)

Accelerating Prefilling for Long-Context LLMs via Sparse Pattern Sharing
by: Peng, Dan, et al.
Published: (2025)

Dynamic Compressing Prompts for Efficient Inference of Large Language Models
by: Hu, Jinwu, et al.
Published: (2025)

C3: A Bilingual Benchmark for Spoken Dialogue Models Exploring Challenges in Complex Conversations
by: Ma, Chengqian, et al.
Published: (2025)

On the Transformations across Reward Model, Parameter Update, and In-Context Prompt
by: Cai, Deng, et al.
Published: (2024)

PIS: Linking Importance Sampling and Attention Mechanisms for Efficient Prompt Compression
by: Chen, Lizhe, et al.
Published: (2025)

No Universal Prompt: Unifying Reasoning through Adaptive Prompting for Temporal Table Reasoning
by: Rajgaria, Abhishek, et al.
Published: (2025)

Steer2Adapt: Dynamically Composing Steering Vectors Elicits Efficient Adaptation of LLMs
by: Han, Pengrui, et al.
Published: (2026)

IPS: In-Prompt Process Supervision for Short Video Content Moderation
by: Liu, Mingchao, et al.
Published: (2024)

Context Engineering 2.0: The Context of Context Engineering
by: Hua, Qishuo, et al.
Published: (2025)

Methodology of Adapting Large English Language Models for Specific Cultural Contexts
by: Zhang, Wenjing, et al.
Published: (2024)

InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory
by: Xiao, Chaojun, et al.
Published: (2024)

Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning
by: Yang, Juncheng, et al.
Published: (2024)

From Context to EDUs: Faithful and Structured Context Compression via Elementary Discourse Unit Decomposition
by: Zhou, Yiqing, et al.
Published: (2025)

Theoretical Analysis of Meta Reinforcement Learning: Generalization Bounds and Convergence Guarantees
by: Wang, Cangqing, et al.
Published: (2024)

Benchmarking and Adapting On-Device LLMs for Clinical Decision Support
by: Munim, Alif, et al.
Published: (2025)

Adapting LLMs for Minimal-edit Grammatical Error Correction
by: Staruch, Ryszard, et al.
Published: (2025)

SoftTiger: A Clinical Foundation Model for Healthcare Workflows
by: Chen, Ye, et al.
Published: (2024)

Domain-Adapted Retrieval for In-Context Annotation of Pedagogical Dialogue Acts
by: Lee, Jinsook, et al.
Published: (2026)

Discrete Prompt Compression with Reinforcement Learning
by: Jung, Hoyoun, et al.
Published: (2023)

PISanitizer: Preventing Prompt Injection to Long-Context LLMs via Prompt Sanitization
by: Geng, Runpeng, et al.
Published: (2025)