Saved in:
| Main Authors: | Jiang, Guochao, Quan, Guofeng, Ding, Zepeng, Luo, Ziqin, Wang, Dixuan, Hu, Zheng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.13949 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
RLAP: A Reinforcement Learning Enhanced Adaptive Planning Framework for Multi-step NLP Task Solving
by: Ding, Zepeng, et al.
Published: (2025)
by: Ding, Zepeng, et al.
Published: (2025)
Tokenization Matters! Degrading Large Language Models through Challenging Their Tokenization
by: Wang, Dixuan, et al.
Published: (2024)
by: Wang, Dixuan, et al.
Published: (2024)
Mitigating Out-of-Entity Errors in Named Entity Recognition: A Sentence-Level Strategy
by: Jiang, Guochao, et al.
Published: (2024)
by: Jiang, Guochao, et al.
Published: (2024)
ToNER: Type-oriented Named Entity Recognition with Generative Language Model
by: Jiang, Guochao, et al.
Published: (2024)
by: Jiang, Guochao, et al.
Published: (2024)
Reason from Fallacy: Enhancing Large Language Models' Logical Reasoning through Logical Fallacy Understanding
by: Li, Yanda, et al.
Published: (2024)
by: Li, Yanda, et al.
Published: (2024)
Dynamic Early Exit in Reasoning Models
by: Yang, Chenxu, et al.
Published: (2025)
by: Yang, Chenxu, et al.
Published: (2025)
RASD: Retrieval-Augmented Speculative Decoding
by: Quan, Guofeng, et al.
Published: (2025)
by: Quan, Guofeng, et al.
Published: (2025)
The Zero-Step Thinking: An Empirical Study of Mode Selection as Harder Early Exit in Reasoning Models
by: Tan, Yuqiao, et al.
Published: (2025)
by: Tan, Yuqiao, et al.
Published: (2025)
Adaptive Reinforcement Learning Planning: Harnessing Large Language Models for Complex Information Extraction
by: Ding, Zepeng, et al.
Published: (2024)
by: Ding, Zepeng, et al.
Published: (2024)
SED: Self-Evaluation Decoding Enhances Large Language Models for Better Generation
by: Luo, Ziqin, et al.
Published: (2024)
by: Luo, Ziqin, et al.
Published: (2024)
Efficient Reasoning with Balanced Thinking
by: Li, Yulin, et al.
Published: (2026)
by: Li, Yulin, et al.
Published: (2026)
The Diminishing Returns of Early-Exit Decoding in Modern LLMs
by: Wei, Rui, et al.
Published: (2026)
by: Wei, Rui, et al.
Published: (2026)
TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning
by: Nagle, Alliot, et al.
Published: (2026)
by: Nagle, Alliot, et al.
Published: (2026)
Do Thinking Tokens Help or Trap? Towards More Efficient Large Reasoning Model
by: Ding, Bowen, et al.
Published: (2025)
by: Ding, Bowen, et al.
Published: (2025)
ADEPT: Adaptive Dynamic Early-Exit Process for Transformers
by: Yoo, Sangmin, et al.
Published: (2026)
by: Yoo, Sangmin, et al.
Published: (2026)
SpecExit: Accelerating Large Reasoning Model via Speculative Exit
by: Yang, Rubing, et al.
Published: (2025)
by: Yang, Rubing, et al.
Published: (2025)
Efficient Reasoning with Hidden Thinking
by: Shen, Xuan, et al.
Published: (2025)
by: Shen, Xuan, et al.
Published: (2025)
DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs
by: Bajpai, Divya Jyoti, et al.
Published: (2024)
by: Bajpai, Divya Jyoti, et al.
Published: (2024)
Runaway is Ashamed, But Helpful: On the Early-Exit Behavior of Large Language Model-based Agents in Embodied Environments
by: Lu, Qingyu, et al.
Published: (2025)
by: Lu, Qingyu, et al.
Published: (2025)
When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning
by: Zhang, Xiaoyun, et al.
Published: (2025)
by: Zhang, Xiaoyun, et al.
Published: (2025)
Dynamic Vocabulary Pruning in Early-Exit LLMs
by: Vincenti, Jort, et al.
Published: (2024)
by: Vincenti, Jort, et al.
Published: (2024)
InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning
by: Yan, Yuchen, et al.
Published: (2026)
by: Yan, Yuchen, et al.
Published: (2026)
Controlling Thinking Speed in Reasoning Models
by: Lin, Zhengkai, et al.
Published: (2025)
by: Lin, Zhengkai, et al.
Published: (2025)
EE-Tuning: An Economical yet Scalable Solution for Tuning Early-Exit Large Language Models
by: Pan, Xuchen, et al.
Published: (2024)
by: Pan, Xuchen, et al.
Published: (2024)
BEExformer: A Fast Inferencing Binarized Transformer with Early Exits
by: Ansar, Wazib, et al.
Published: (2024)
by: Ansar, Wazib, et al.
Published: (2024)
CAPEEN: Image Captioning with Early Exits and Knowledge Distillation
by: Bajpai, Divya Jyoti, et al.
Published: (2024)
by: Bajpai, Divya Jyoti, et al.
Published: (2024)
LYNX: Learning Dynamic Exits for Confidence-Controlled Reasoning
by: Akgül, Ömer Faruk, et al.
Published: (2025)
by: Akgül, Ömer Faruk, et al.
Published: (2025)
MatryoshkaThinking: Recursive Test-Time Scaling Enables Efficient Reasoning
by: Chen, Hongwei, et al.
Published: (2025)
by: Chen, Hongwei, et al.
Published: (2025)
When Can Large Reasoning Models Save Thinking? Mechanistic Analysis of Behavioral Divergence in Reasoning
by: Zhu, Rongzhi, et al.
Published: (2025)
by: Zhu, Rongzhi, et al.
Published: (2025)
MeTHanol: Modularized Thinking Language Models with Intermediate Layer Thinking, Decoding and Bootstrapping Reasoning
by: Xi, Ningyuan, et al.
Published: (2024)
by: Xi, Ningyuan, et al.
Published: (2024)
ThinkRouter: Efficient Reasoning via Routing Thinking between Latent and Discrete Spaces
by: Xu, Xin, et al.
Published: (2026)
by: Xu, Xin, et al.
Published: (2026)
Pipeline Parallelism is All You Need for Optimized Early-Exit Based Self-Speculative Decoding
by: Li, Ruanjun, et al.
Published: (2025)
by: Li, Ruanjun, et al.
Published: (2025)
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding
by: Elhoushi, Mostafa, et al.
Published: (2024)
by: Elhoushi, Mostafa, et al.
Published: (2024)
One Jump Is All You Need: Short-Cutting Transformers for Early Exit Prediction with One Jump to Fit All Exit Levels
by: Seshadri, Amrit Diggavi
Published: (2025)
by: Seshadri, Amrit Diggavi
Published: (2025)
Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning
by: Cheng, Xiaoxue, et al.
Published: (2025)
by: Cheng, Xiaoxue, et al.
Published: (2025)
Draft-Thinking: Learning Efficient Reasoning in Long Chain-of-Thought LLMs
by: Cao, Jie, et al.
Published: (2026)
by: Cao, Jie, et al.
Published: (2026)
MixReasoning: Switching Modes to Think
by: Lu, Haiquan, et al.
Published: (2025)
by: Lu, Haiquan, et al.
Published: (2025)
Dynamic Thinking-Token Selection for Efficient Reasoning in Large Reasoning Models
by: Guo, Zhenyuan, et al.
Published: (2026)
by: Guo, Zhenyuan, et al.
Published: (2026)
P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models
by: Jiang, Guochao, et al.
Published: (2024)
by: Jiang, Guochao, et al.
Published: (2024)
FlashEVA: Accelerating LLM inference via Efficient Attention
by: Kostelec, Juan Gabriel, et al.
Published: (2025)
by: Kostelec, Juan Gabriel, et al.
Published: (2025)
Similar Items
-
RLAP: A Reinforcement Learning Enhanced Adaptive Planning Framework for Multi-step NLP Task Solving
by: Ding, Zepeng, et al.
Published: (2025) -
Tokenization Matters! Degrading Large Language Models through Challenging Their Tokenization
by: Wang, Dixuan, et al.
Published: (2024) -
Mitigating Out-of-Entity Errors in Named Entity Recognition: A Sentence-Level Strategy
by: Jiang, Guochao, et al.
Published: (2024) -
ToNER: Type-oriented Named Entity Recognition with Generative Language Model
by: Jiang, Guochao, et al.
Published: (2024) -
Reason from Fallacy: Enhancing Large Language Models' Logical Reasoning through Logical Fallacy Understanding
by: Li, Yanda, et al.
Published: (2024)