Saved in:
| Main Authors: | Yao, Rong, Hu, Hailin, Fu, Yifei, Chen, Hanting, Fang, Wenyi, Du, Fanyi, Han, Kai, Wang, Yunhe |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.09818 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
C-MOP: Integrating Momentum and Boundary-Aware Clustering for Enhanced Prompt Evolution
by: Yan, Binwei, et al.
Published: (2026)
by: Yan, Binwei, et al.
Published: (2026)
DiJiang: Efficient Large Language Models through Compact Kernelization
by: Chen, Hanting, et al.
Published: (2024)
by: Chen, Hanting, et al.
Published: (2024)
Saliency-driven Dynamic Token Pruning for Large Language Models
by: Tao, Yao, et al.
Published: (2025)
by: Tao, Yao, et al.
Published: (2025)
A Survey on Transformer Compression
by: Tang, Yehui, et al.
Published: (2024)
by: Tang, Yehui, et al.
Published: (2024)
Deferred Commitment Decoding for Diffusion Language Models
by: Shu, Yingte, et al.
Published: (2026)
by: Shu, Yingte, et al.
Published: (2026)
Near-Policy: Accelerating On-Policy Distillation via Asynchronous Generation and Selective Packing
by: Rang, Miao, et al.
Published: (2026)
by: Rang, Miao, et al.
Published: (2026)
Nexus: Higher-Order Attention Mechanisms in Transformers
by: Chen, Hanting, et al.
Published: (2025)
by: Chen, Hanting, et al.
Published: (2025)
Unshackling Context Length: An Efficient Selective Attention Approach through Query-Key Compression
by: Wang, Haoyu, et al.
Published: (2025)
by: Wang, Haoyu, et al.
Published: (2025)
Rethinking 1-bit Optimization Leveraging Pre-trained Large Language Models
by: Tu, Zhijun, et al.
Published: (2025)
by: Tu, Zhijun, et al.
Published: (2025)
Progressive trajectory matching for medical dataset distillation
by: Yu, Zhen, et al.
Published: (2024)
by: Yu, Zhen, et al.
Published: (2024)
Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
by: Chen, Hanting, et al.
Published: (2025)
by: Chen, Hanting, et al.
Published: (2025)
From Next-Token to Next-Block: A Principled Adaptation Path for Diffusion LLMs
by: Tian, Yuchuan, et al.
Published: (2025)
by: Tian, Yuchuan, et al.
Published: (2025)
PanGu-$π$: Enhancing Language Model Architectures via Nonlinearity Compensation
by: Wang, Yunhe, et al.
Published: (2023)
by: Wang, Yunhe, et al.
Published: (2023)
Omni-Dimensional Frequency Learner for General Time Series Analysis
by: Chen, Xianing, et al.
Published: (2024)
by: Chen, Xianing, et al.
Published: (2024)
Multi-Granularity Semantic Revision for Large Language Model Distillation
by: Liu, Xiaoyu, et al.
Published: (2024)
by: Liu, Xiaoyu, et al.
Published: (2024)
Multiscale Positive-Unlabeled Detection of AI-Generated Texts
by: Tian, Yuchuan, et al.
Published: (2023)
by: Tian, Yuchuan, et al.
Published: (2023)
EMS-SD: Efficient Multi-sample Speculative Decoding for Accelerating Large Language Models
by: Ni, Yunsheng, et al.
Published: (2024)
by: Ni, Yunsheng, et al.
Published: (2024)
Long document summarization using page specific target text alignment and distilling page importance
by: Devi, Pushpa, et al.
Published: (2025)
by: Devi, Pushpa, et al.
Published: (2025)
CBQ: Cross-Block Quantization for Large Language Models
by: Ding, Xin, et al.
Published: (2023)
by: Ding, Xin, et al.
Published: (2023)
Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning
by: Bi, Zhenni, et al.
Published: (2024)
by: Bi, Zhenni, et al.
Published: (2024)
DimMem: Dimensional Structuring for Efficient Long-Term Agent Memory
by: Qiu, Wentao, et al.
Published: (2026)
by: Qiu, Wentao, et al.
Published: (2026)
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models
by: Guo, Jianyuan, et al.
Published: (2024)
by: Guo, Jianyuan, et al.
Published: (2024)
DLLM Agent: See Farther, Run Faster
by: Zhen, Huiling, et al.
Published: (2026)
by: Zhen, Huiling, et al.
Published: (2026)
Domain Regeneration: How well do LLMs match syntactic properties of text domains?
by: Ju, Da, et al.
Published: (2025)
by: Ju, Da, et al.
Published: (2025)
Ungrammatical-syntax-based In-context Example Selection for Grammatical Error Correction
by: Tang, Chenming, et al.
Published: (2024)
by: Tang, Chenming, et al.
Published: (2024)
Unsupervised Distractor Generation via Large Language Model Distilling and Counterfactual Contrastive Decoding
by: Qu, Fanyi, et al.
Published: (2024)
by: Qu, Fanyi, et al.
Published: (2024)
Evaluating the Capability of Large-scale Language Models on Chinese Grammatical Error Correction Task
by: Qu, Fanyi, et al.
Published: (2023)
by: Qu, Fanyi, et al.
Published: (2023)
$\text{R}^2\text{R}$: A Route-to-Rerank Post-Training Framework for Multi-Domain Decoder-Only Rerankers
by: Wang, Xinyu, et al.
Published: (2025)
by: Wang, Xinyu, et al.
Published: (2025)
VersatileFFN: Achieving Parameter Efficiency in LLMs via Adaptive Wide-and-Deep Reuse
by: Nie, Ying, et al.
Published: (2025)
by: Nie, Ying, et al.
Published: (2025)
Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting
by: Liu, Fangcheng, et al.
Published: (2024)
by: Liu, Fangcheng, et al.
Published: (2024)
BARD: budget-aware reasoning distillation
by: Niu, Lujie, et al.
Published: (2025)
by: Niu, Lujie, et al.
Published: (2025)
How do we measure privacy in text? A survey of text anonymization metrics
by: Ren, Yaxuan, et al.
Published: (2025)
by: Ren, Yaxuan, et al.
Published: (2025)
DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models
by: He, Wei, et al.
Published: (2024)
by: He, Wei, et al.
Published: (2024)
Top 10 Open Challenges Steering the Future of Diffusion Language Model and Its Variants
by: Wang, Yunhe, et al.
Published: (2026)
by: Wang, Yunhe, et al.
Published: (2026)
Knowledge Fusion By Evolving Weights of Language Models
by: Du, Guodong, et al.
Published: (2024)
by: Du, Guodong, et al.
Published: (2024)
Assessing News Thumbnail Representativeness: Counterfactual text can enhance the cross-modal matching ability
by: Yoon, Yejun, et al.
Published: (2024)
by: Yoon, Yejun, et al.
Published: (2024)
IPT-V2: Efficient Image Processing Transformer using Hierarchical Attentions
by: Tu, Zhijun, et al.
Published: (2024)
by: Tu, Zhijun, et al.
Published: (2024)
An efficient text augmentation approach for contextualized Mandarin speech recognition
by: Zheng, Naijun, et al.
Published: (2024)
by: Zheng, Naijun, et al.
Published: (2024)
A Pluggable Multi-Task Learning Framework for Sentiment-Aware Financial Relation Extraction
by: Luo, Jinming, et al.
Published: (2025)
by: Luo, Jinming, et al.
Published: (2025)
MoRAgent: Parameter Efficient Agent Tuning with Mixture-of-Roles
by: Han, Jing, et al.
Published: (2025)
by: Han, Jing, et al.
Published: (2025)
Similar Items
-
C-MOP: Integrating Momentum and Boundary-Aware Clustering for Enhanced Prompt Evolution
by: Yan, Binwei, et al.
Published: (2026) -
DiJiang: Efficient Large Language Models through Compact Kernelization
by: Chen, Hanting, et al.
Published: (2024) -
Saliency-driven Dynamic Token Pruning for Large Language Models
by: Tao, Yao, et al.
Published: (2025) -
A Survey on Transformer Compression
by: Tang, Yehui, et al.
Published: (2024) -
Deferred Commitment Decoding for Diffusion Language Models
by: Shu, Yingte, et al.
Published: (2026)