Saved in:
| Main Authors: | Yan, Binwei, Fu, Yifei, Zhu, Mingjian, Chen, Hanting, Yuan, Mingxuan, Wang, Yunhe, Hu, Hailin |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.10874 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Transferable text data distillation by trajectory matching
by: Yao, Rong, et al.
Published: (2025)
by: Yao, Rong, et al.
Published: (2025)
Saliency-driven Dynamic Token Pruning for Large Language Models
by: Tao, Yao, et al.
Published: (2025)
by: Tao, Yao, et al.
Published: (2025)
MoRAgent: Parameter Efficient Agent Tuning with Mixture-of-Roles
by: Han, Jing, et al.
Published: (2025)
by: Han, Jing, et al.
Published: (2025)
DiJiang: Efficient Large Language Models through Compact Kernelization
by: Chen, Hanting, et al.
Published: (2024)
by: Chen, Hanting, et al.
Published: (2024)
Deferred Commitment Decoding for Diffusion Language Models
by: Shu, Yingte, et al.
Published: (2026)
by: Shu, Yingte, et al.
Published: (2026)
Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
by: Chen, Hanting, et al.
Published: (2025)
by: Chen, Hanting, et al.
Published: (2025)
Unshackling Context Length: An Efficient Selective Attention Approach through Query-Key Compression
by: Wang, Haoyu, et al.
Published: (2025)
by: Wang, Haoyu, et al.
Published: (2025)
Benchmarking Machine Translation with Cultural Awareness
by: Yao, Binwei, et al.
Published: (2023)
by: Yao, Binwei, et al.
Published: (2023)
MemDLM: Memory-Enhanced DLM Training
by: Pei, Zehua, et al.
Published: (2026)
by: Pei, Zehua, et al.
Published: (2026)
SCOPE: Prompt Evolution for Enhancing Agent Effectiveness
by: Pei, Zehua, et al.
Published: (2025)
by: Pei, Zehua, et al.
Published: (2025)
Rethinking 1-bit Optimization Leveraging Pre-trained Large Language Models
by: Tu, Zhijun, et al.
Published: (2025)
by: Tu, Zhijun, et al.
Published: (2025)
Nexus: Higher-Order Attention Mechanisms in Transformers
by: Chen, Hanting, et al.
Published: (2025)
by: Chen, Hanting, et al.
Published: (2025)
Multi-Perspective Attention Mechanism for Bias-Aware Sequential Recommendation
by: Fu, Mingjian, et al.
Published: (2025)
by: Fu, Mingjian, et al.
Published: (2025)
Omni-Dimensional Frequency Learner for General Time Series Analysis
by: Chen, Xianing, et al.
Published: (2024)
by: Chen, Xianing, et al.
Published: (2024)
DLLM Agent: See Farther, Run Faster
by: Zhen, Huiling, et al.
Published: (2026)
by: Zhen, Huiling, et al.
Published: (2026)
EAQuant: Enhancing Post-Training Quantization for MoE Models via Expert-Aware Optimization
by: Fu, Zhongqian, et al.
Published: (2025)
by: Fu, Zhongqian, et al.
Published: (2025)
Multiscale Positive-Unlabeled Detection of AI-Generated Texts
by: Tian, Yuchuan, et al.
Published: (2023)
by: Tian, Yuchuan, et al.
Published: (2023)
Near-Policy: Accelerating On-Policy Distillation via Asynchronous Generation and Selective Packing
by: Rang, Miao, et al.
Published: (2026)
by: Rang, Miao, et al.
Published: (2026)
PanGu-$π$: Enhancing Language Model Architectures via Nonlinearity Compensation
by: Wang, Yunhe, et al.
Published: (2023)
by: Wang, Yunhe, et al.
Published: (2023)
DP-GTR: Differentially Private Prompt Protection via Group Text Rewriting
by: Li, Mingchen, et al.
Published: (2025)
by: Li, Mingchen, et al.
Published: (2025)
Multi-Granularity Semantic Revision for Large Language Model Distillation
by: Liu, Xiaoyu, et al.
Published: (2024)
by: Liu, Xiaoyu, et al.
Published: (2024)
A Pluggable Multi-Task Learning Framework for Sentiment-Aware Financial Relation Extraction
by: Luo, Jinming, et al.
Published: (2025)
by: Luo, Jinming, et al.
Published: (2025)
Towards Efficient Agents: A Co-Design of Inference Architecture and System
by: Lin, Weizhe, et al.
Published: (2025)
by: Lin, Weizhe, et al.
Published: (2025)
A Survey on Transformer Compression
by: Tang, Yehui, et al.
Published: (2024)
by: Tang, Yehui, et al.
Published: (2024)
RDEx-MOP: Indicator-Guided Reconstructed Differential Evolution for Fixed-Budget Multiobjective Optimization
by: Tao, Sichen, et al.
Published: (2026)
by: Tao, Sichen, et al.
Published: (2026)
CFinBench: A Comprehensive Chinese Financial Benchmark for Large Language Models
by: Nie, Ying, et al.
Published: (2024)
by: Nie, Ying, et al.
Published: (2024)
C-Evolve: Consensus-based Evolution for Prompt Groups
by: Li, Tiancheng, et al.
Published: (2025)
by: Li, Tiancheng, et al.
Published: (2025)
Introducing MAPO: Momentum-Aided Gradient Descent Prompt Optimization
by: Cui, Anthony, et al.
Published: (2024)
by: Cui, Anthony, et al.
Published: (2024)
SITA: Learning Speaker-Invariant and Tone-Aware Speech Representations for Low-Resource Tonal Languages
by: Xu, Tianyi, et al.
Published: (2026)
by: Xu, Tianyi, et al.
Published: (2026)
CBQ: Cross-Block Quantization for Large Language Models
by: Ding, Xin, et al.
Published: (2023)
by: Ding, Xin, et al.
Published: (2023)
AI as a deliberative partner fosters intercultural empathy for Americans but fails for Latin American participants
by: Villanueva, Isabel, et al.
Published: (2025)
by: Villanueva, Isabel, et al.
Published: (2025)
Question-Aware Knowledge Graph Prompting for Enhancing Large Language Models
by: Liu, Haochen, et al.
Published: (2025)
by: Liu, Haochen, et al.
Published: (2025)
IAPT: Instruction-Aware Prompt Tuning for Large Language Models
by: Zhu, Wei, et al.
Published: (2024)
by: Zhu, Wei, et al.
Published: (2024)
Ace-Skill: Bootstrapping Multimodal Agents with Prioritized and Clustered Evolution
by: Xiong, Feng, et al.
Published: (2026)
by: Xiong, Feng, et al.
Published: (2026)
CAF-I: A Collaborative Multi-Agent Framework for Enhanced Irony Detection with Large Language Models
by: Liu, Ziqi., et al.
Published: (2025)
by: Liu, Ziqi., et al.
Published: (2025)
Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization
by: Liu, Yuanye, et al.
Published: (2025)
by: Liu, Yuanye, et al.
Published: (2025)
Mixture of In-Context Experts Enhance LLMs' Long Context Awareness
by: Lin, Hongzhan, et al.
Published: (2024)
by: Lin, Hongzhan, et al.
Published: (2024)
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training
by: Huang, Tianjin, et al.
Published: (2025)
by: Huang, Tianjin, et al.
Published: (2025)
The Detection-Extraction Gap: Models Know the Answer Before They Can Say It
by: Wang, Hanyang, et al.
Published: (2026)
by: Wang, Hanyang, et al.
Published: (2026)
Towards Reliable and Empathetic Depression-Diagnosis-Oriented Chats
by: Lan, Kunyao, et al.
Published: (2024)
by: Lan, Kunyao, et al.
Published: (2024)
Similar Items
-
Transferable text data distillation by trajectory matching
by: Yao, Rong, et al.
Published: (2025) -
Saliency-driven Dynamic Token Pruning for Large Language Models
by: Tao, Yao, et al.
Published: (2025) -
MoRAgent: Parameter Efficient Agent Tuning with Mixture-of-Roles
by: Han, Jing, et al.
Published: (2025) -
DiJiang: Efficient Large Language Models through Compact Kernelization
by: Chen, Hanting, et al.
Published: (2024) -
Deferred Commitment Decoding for Diffusion Language Models
by: Shu, Yingte, et al.
Published: (2026)