Saved in:
| Main Authors: | Wang, Cangqing, Yang, Yutian, Li, Ruisi, Sun, Dan, Cai, Ruicong, Zhang, Yuzhu, Fu, Chengqian, Floyd, Lillian |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.04997 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Prompt Recursive Search: A Living Framework with Adaptive Growth in LLM Auto-Prompting
by: Zhao, Xiangyu, et al.
Published: (2024)
by: Zhao, Xiangyu, et al.
Published: (2024)
CA-BERT: Leveraging Context Awareness for Enhanced Multi-Turn Chat Interaction
by: Liu, Minghao, et al.
Published: (2024)
by: Liu, Minghao, et al.
Published: (2024)
ATACompressor: Adaptive Task-Aware Compression for Efficient Long-Context Processing in LLMs
by: Li, Xuancheng, et al.
Published: (2026)
by: Li, Xuancheng, et al.
Published: (2026)
Modular Representation Compression: Adapting LLMs for Efficient and Effective Recommendations
by: Xi, Yunjia, et al.
Published: (2026)
by: Xi, Yunjia, et al.
Published: (2026)
Reinforcement Learning Approach for Integrating Compressed Contexts into Knowledge Graphs
by: Quach, Ngoc, et al.
Published: (2024)
by: Quach, Ngoc, et al.
Published: (2024)
The Power of Adaptation: Boosting In-Context Learning through Adaptive Prompting
by: Cai, Shuzhang, et al.
Published: (2024)
by: Cai, Shuzhang, et al.
Published: (2024)
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression
by: Hong, Junyuan, et al.
Published: (2024)
by: Hong, Junyuan, et al.
Published: (2024)
Soft Begging: Modular and Efficient Shielding of LLMs against Prompt Injection and Jailbreaking based on Prompt Tuning
by: Ostermann, Simon, et al.
Published: (2024)
by: Ostermann, Simon, et al.
Published: (2024)
Efficient and Effective Prompt Tuning via Prompt Decomposition and Compressed Outer Product
by: Lan, Pengxiang, et al.
Published: (2025)
by: Lan, Pengxiang, et al.
Published: (2025)
ZSMerge: Zero-Shot KV Cache Compression for Memory-Efficient Long-Context LLMs
by: Liu, Xin, et al.
Published: (2025)
by: Liu, Xin, et al.
Published: (2025)
Adapting Psycholinguistic Research for LLMs: Gender-inclusive Language in a Coreference Context
by: Bartl, Marion, et al.
Published: (2025)
by: Bartl, Marion, et al.
Published: (2025)
EFPC: Towards Efficient and Flexible Prompt Compression
by: Cao, Yun-Hao, et al.
Published: (2025)
by: Cao, Yun-Hao, et al.
Published: (2025)
Adapting Whisper for Parameter-efficient Code-Switching Speech Recognition via Soft Prompt Tuning
by: Yang, Hongli, et al.
Published: (2025)
by: Yang, Hongli, et al.
Published: (2025)
Recurrent Context Compression: Efficiently Expanding the Context Window of LLM
by: Huang, Chensen, et al.
Published: (2024)
by: Huang, Chensen, et al.
Published: (2024)
Prompting Large Language Models for Supporting the Differential Diagnosis of Anemia
by: Castagnari, Elisa, et al.
Published: (2024)
by: Castagnari, Elisa, et al.
Published: (2024)
Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation
by: Ning, Xuefei, et al.
Published: (2023)
by: Ning, Xuefei, et al.
Published: (2023)
PromptEmbedder:: Efficient and Transferable Text Embedding via Dual-LLM Soft Prompting
by: Tsai, Yu-Che, et al.
Published: (2026)
by: Tsai, Yu-Che, et al.
Published: (2026)
Sentinel: Decoding Context Utilization via Attention Probing for Efficient LLM Context Compression
by: Zhang, Yong, et al.
Published: (2025)
by: Zhang, Yong, et al.
Published: (2025)
KV-Distill: Nearly Lossless Learnable Context Compression for LLMs
by: Chari, Vivek, et al.
Published: (2025)
by: Chari, Vivek, et al.
Published: (2025)
QUITO: Accelerating Long-Context Reasoning through Query-Guided Context Compression
by: Wang, Wenshan, et al.
Published: (2024)
by: Wang, Wenshan, et al.
Published: (2024)
Accelerating Prefilling for Long-Context LLMs via Sparse Pattern Sharing
by: Peng, Dan, et al.
Published: (2025)
by: Peng, Dan, et al.
Published: (2025)
Dynamic Compressing Prompts for Efficient Inference of Large Language Models
by: Hu, Jinwu, et al.
Published: (2025)
by: Hu, Jinwu, et al.
Published: (2025)
C3: A Bilingual Benchmark for Spoken Dialogue Models Exploring Challenges in Complex Conversations
by: Ma, Chengqian, et al.
Published: (2025)
by: Ma, Chengqian, et al.
Published: (2025)
On the Transformations across Reward Model, Parameter Update, and In-Context Prompt
by: Cai, Deng, et al.
Published: (2024)
by: Cai, Deng, et al.
Published: (2024)
PIS: Linking Importance Sampling and Attention Mechanisms for Efficient Prompt Compression
by: Chen, Lizhe, et al.
Published: (2025)
by: Chen, Lizhe, et al.
Published: (2025)
No Universal Prompt: Unifying Reasoning through Adaptive Prompting for Temporal Table Reasoning
by: Rajgaria, Abhishek, et al.
Published: (2025)
by: Rajgaria, Abhishek, et al.
Published: (2025)
Steer2Adapt: Dynamically Composing Steering Vectors Elicits Efficient Adaptation of LLMs
by: Han, Pengrui, et al.
Published: (2026)
by: Han, Pengrui, et al.
Published: (2026)
IPS: In-Prompt Process Supervision for Short Video Content Moderation
by: Liu, Mingchao, et al.
Published: (2024)
by: Liu, Mingchao, et al.
Published: (2024)
Context Engineering 2.0: The Context of Context Engineering
by: Hua, Qishuo, et al.
Published: (2025)
by: Hua, Qishuo, et al.
Published: (2025)
Methodology of Adapting Large English Language Models for Specific Cultural Contexts
by: Zhang, Wenjing, et al.
Published: (2024)
by: Zhang, Wenjing, et al.
Published: (2024)
InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory
by: Xiao, Chaojun, et al.
Published: (2024)
by: Xiao, Chaojun, et al.
Published: (2024)
Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning
by: Yang, Juncheng, et al.
Published: (2024)
by: Yang, Juncheng, et al.
Published: (2024)
From Context to EDUs: Faithful and Structured Context Compression via Elementary Discourse Unit Decomposition
by: Zhou, Yiqing, et al.
Published: (2025)
by: Zhou, Yiqing, et al.
Published: (2025)
Theoretical Analysis of Meta Reinforcement Learning: Generalization Bounds and Convergence Guarantees
by: Wang, Cangqing, et al.
Published: (2024)
by: Wang, Cangqing, et al.
Published: (2024)
Benchmarking and Adapting On-Device LLMs for Clinical Decision Support
by: Munim, Alif, et al.
Published: (2025)
by: Munim, Alif, et al.
Published: (2025)
Adapting LLMs for Minimal-edit Grammatical Error Correction
by: Staruch, Ryszard, et al.
Published: (2025)
by: Staruch, Ryszard, et al.
Published: (2025)
SoftTiger: A Clinical Foundation Model for Healthcare Workflows
by: Chen, Ye, et al.
Published: (2024)
by: Chen, Ye, et al.
Published: (2024)
Domain-Adapted Retrieval for In-Context Annotation of Pedagogical Dialogue Acts
by: Lee, Jinsook, et al.
Published: (2026)
by: Lee, Jinsook, et al.
Published: (2026)
Discrete Prompt Compression with Reinforcement Learning
by: Jung, Hoyoun, et al.
Published: (2023)
by: Jung, Hoyoun, et al.
Published: (2023)
PISanitizer: Preventing Prompt Injection to Long-Context LLMs via Prompt Sanitization
by: Geng, Runpeng, et al.
Published: (2025)
by: Geng, Runpeng, et al.
Published: (2025)
Similar Items
-
Prompt Recursive Search: A Living Framework with Adaptive Growth in LLM Auto-Prompting
by: Zhao, Xiangyu, et al.
Published: (2024) -
CA-BERT: Leveraging Context Awareness for Enhanced Multi-Turn Chat Interaction
by: Liu, Minghao, et al.
Published: (2024) -
ATACompressor: Adaptive Task-Aware Compression for Efficient Long-Context Processing in LLMs
by: Li, Xuancheng, et al.
Published: (2026) -
Modular Representation Compression: Adapting LLMs for Efficient and Effective Recommendations
by: Xi, Yunjia, et al.
Published: (2026) -
Reinforcement Learning Approach for Integrating Compressed Contexts into Knowledge Graphs
by: Quach, Ngoc, et al.
Published: (2024)