Saved in:
| Main Authors: | Wang, Zijian, Huang, Tiancheng, Li, Hanqi, Ma, Da, Chen, Lu, Yu, Kai |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.12988 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Making Small Language Models Efficient Reasoners: Intervention, Supervision, Reinforcement
by: Zhang, Xuechen, et al.
Published: (2025)
by: Zhang, Xuechen, et al.
Published: (2025)
The Last Human-Written Paper: Agent-Native Research Artifacts
by: Liu, Jiachen, et al.
Published: (2026)
by: Liu, Jiachen, et al.
Published: (2026)
From Small to Large Language Models: Revisiting the Federalist Papers
by: Jeong, So Won, et al.
Published: (2025)
by: Jeong, So Won, et al.
Published: (2025)
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents
by: Miao, Jiacheng, et al.
Published: (2025)
by: Miao, Jiacheng, et al.
Published: (2025)
Evolving Subnetwork Training for Large Language Models
by: Li, Hanqi, et al.
Published: (2024)
by: Li, Hanqi, et al.
Published: (2024)
Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners
by: Xu, Xin, et al.
Published: (2025)
by: Xu, Xin, et al.
Published: (2025)
PaSa: An LLM Agent for Comprehensive Academic Paper Search
by: He, Yichen, et al.
Published: (2025)
by: He, Yichen, et al.
Published: (2025)
VeriThinker: Learning to Verify Makes Reasoning Model Efficient
by: Chen, Zigeng, et al.
Published: (2025)
by: Chen, Zigeng, et al.
Published: (2025)
Goal-Guided Efficient Exploration via Large Language Model in Reinforcement Learning
by: Qi, Yajie, et al.
Published: (2025)
by: Qi, Yajie, et al.
Published: (2025)
PaperOrchestra: A Multi-Agent Framework for Automated AI Research Paper Writing
by: Song, Yiwen, et al.
Published: (2026)
by: Song, Yiwen, et al.
Published: (2026)
RAPID: An Efficient Reinforcement Learning Algorithm for Small Language Models
by: Huang, Lianghuan, et al.
Published: (2025)
by: Huang, Lianghuan, et al.
Published: (2025)
Constraint-Driven Small Language Models Based on Agent and OpenAlex Knowledge Graph: Mining Conceptual Pathways and Discovering Innovation Points in Academic Papers
by: Xia, Ziye, et al.
Published: (2025)
by: Xia, Ziye, et al.
Published: (2025)
Novelty-Guided Data Reuse for Efficient and Diversified Multi-Agent Reinforcement Learning
by: Chen, Yangkun, et al.
Published: (2024)
by: Chen, Yangkun, et al.
Published: (2024)
QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search
by: Lin, Zongyu, et al.
Published: (2025)
by: Lin, Zongyu, et al.
Published: (2025)
Task-Specific Efficiency Analysis: When Small Language Models Outperform Large Language Models
by: Cao, Jinghan, et al.
Published: (2026)
by: Cao, Jinghan, et al.
Published: (2026)
ES-dLLM: Efficient Inference for Diffusion Large Language Models by Early-Skipping
by: Zhu, Zijian, et al.
Published: (2026)
by: Zhu, Zijian, et al.
Published: (2026)
Wonderful Matrices: More Efficient and Effective Architecture for Language Modeling Tasks
by: Shi, Jingze, et al.
Published: (2024)
by: Shi, Jingze, et al.
Published: (2024)
What Makes Value Learning Efficient in Residual Reinforcement Learning?
by: Ma, Guozheng, et al.
Published: (2026)
by: Ma, Guozheng, et al.
Published: (2026)
SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents
by: Kon, Patrick Tser Jern, et al.
Published: (2026)
by: Kon, Patrick Tser Jern, et al.
Published: (2026)
PACE: Two-Timescale Self-Evolution for Small Language Model Agents
by: Ling, Chen, et al.
Published: (2026)
by: Ling, Chen, et al.
Published: (2026)
Efficient Sequential Decision Making with Large Language Models
by: Chen, Dingyang, et al.
Published: (2024)
by: Chen, Dingyang, et al.
Published: (2024)
Chain-of-Factors Paper-Reviewer Matching
by: Zhang, Yu, et al.
Published: (2023)
by: Zhang, Yu, et al.
Published: (2023)
Paper Reconstruction Evaluation: Evaluating Presentation and Hallucination in AI-written Papers
by: Miyai, Atsuyuki, et al.
Published: (2026)
by: Miyai, Atsuyuki, et al.
Published: (2026)
Efficient Reasoning for Large Reasoning Language Models via Certainty-Guided Reflection Suppression
by: Huang, Jiameng, et al.
Published: (2025)
by: Huang, Jiameng, et al.
Published: (2025)
LLMBoost: Make Large Language Models Stronger with Boosting
by: Chen, Zehao, et al.
Published: (2025)
by: Chen, Zehao, et al.
Published: (2025)
STAR: Stage-Wise Attention-Guided Token Reduction for Efficient Large Vision-Language Models Inference
by: Guo, Yichen, et al.
Published: (2025)
by: Guo, Yichen, et al.
Published: (2025)
Walk Wisely on Graph: Knowledge Graph Reasoning with Dual Agents via Efficient Guidance-Exploration
by: Wang, Zijian, et al.
Published: (2024)
by: Wang, Zijian, et al.
Published: (2024)
EPIC: Efficient Position-Independent Caching for Serving Large Language Models
by: Hu, Junhao, et al.
Published: (2024)
by: Hu, Junhao, et al.
Published: (2024)
Position Paper: Assessing Robustness, Privacy, and Fairness in Federated Learning Integrated with Foundation Models
by: Wang, Jiaqi, et al.
Published: (2024)
by: Wang, Jiaqi, et al.
Published: (2024)
Hard Negative Sample-Augmented DPO Post-Training for Small Language Models
by: Lu, Haocheng, et al.
Published: (2025)
by: Lu, Haocheng, et al.
Published: (2025)
Less is More: Efficient Model Merging with Binary Task Switch
by: Qi, Biqing, et al.
Published: (2024)
by: Qi, Biqing, et al.
Published: (2024)
Visual CoT Makes VLMs Smarter but More Fragile
by: Xu, Chunxue, et al.
Published: (2025)
by: Xu, Chunxue, et al.
Published: (2025)
EASE: Practical and Efficient Safety Alignment for Small Language Models
by: Shi, Haonan, et al.
Published: (2025)
by: Shi, Haonan, et al.
Published: (2025)
Run-time Observation Interventions Make Vision-Language-Action Models More Visually Robust
by: Hancock, Asher J., et al.
Published: (2024)
by: Hancock, Asher J., et al.
Published: (2024)
Good Learners Think Their Thinking: Generative PRM Makes Large Reasoning Model More Efficient Math Learner
by: He, Tao, et al.
Published: (2025)
by: He, Tao, et al.
Published: (2025)
A Unified Industrial Large Knowledge Model Framework in Industry 4.0 and Smart Manufacturing
by: Lee, Jay, et al.
Published: (2023)
by: Lee, Jay, et al.
Published: (2023)
FedGuCci: Making Local Models More Connected in Landscape for Federated Learning
by: Li, Zexi, et al.
Published: (2024)
by: Li, Zexi, et al.
Published: (2024)
Guiding Through Complexity: What Makes Good Supervision for Hard Math Reasoning Tasks?
by: He, Xuan, et al.
Published: (2024)
by: He, Xuan, et al.
Published: (2024)
WirelessAgent: Large Language Model Agents for Intelligent Wireless Networks
by: Tong, Jingwen, et al.
Published: (2024)
by: Tong, Jingwen, et al.
Published: (2024)
Training Language Model Agents to Find Vulnerabilities with CTF-Dojo
by: Zhuo, Terry Yue, et al.
Published: (2025)
by: Zhuo, Terry Yue, et al.
Published: (2025)
Similar Items
-
Making Small Language Models Efficient Reasoners: Intervention, Supervision, Reinforcement
by: Zhang, Xuechen, et al.
Published: (2025) -
The Last Human-Written Paper: Agent-Native Research Artifacts
by: Liu, Jiachen, et al.
Published: (2026) -
From Small to Large Language Models: Revisiting the Federalist Papers
by: Jeong, So Won, et al.
Published: (2025) -
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents
by: Miao, Jiacheng, et al.
Published: (2025) -
Evolving Subnetwork Training for Large Language Models
by: Li, Hanqi, et al.
Published: (2024)