Saved in:
| Main Authors: | Xu, Bowen, Wu, Shaoyu, Jiang, Hao, Liu, Kai, Chen, Xin, Hu, Lulu, Yang, Bin |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.02160 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Mixture-of-Instructions: Aligning Large Language Models via Mixture Prompting
by: Xu, Bowen, et al.
Published: (2024)
by: Xu, Bowen, et al.
Published: (2024)
La RoSA: Enhancing LLM Efficiency via Layerwise Rotated Sparse Activation
by: Liu, Kai, et al.
Published: (2025)
by: Liu, Kai, et al.
Published: (2025)
Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
by: Xu, Ran, et al.
Published: (2025)
by: Xu, Ran, et al.
Published: (2025)
CORE: A Conceptual Reasoning Layer for Large Language Models
by: Hegde, Vishwas, et al.
Published: (2025)
by: Hegde, Vishwas, et al.
Published: (2025)
Agentic Tool Use in Large Language Models
by: Hu, Jinchao, et al.
Published: (2026)
by: Hu, Jinchao, et al.
Published: (2026)
DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry
by: Cai, Zhenyang, et al.
Published: (2025)
by: Cai, Zhenyang, et al.
Published: (2025)
LogicReward: Incentivizing LLM Reasoning via Step-Wise Logical Supervision
by: Xu, Jundong, et al.
Published: (2025)
by: Xu, Jundong, et al.
Published: (2025)
OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning
by: Lu, Pan, et al.
Published: (2025)
by: Lu, Pan, et al.
Published: (2025)
Mitigating Prompt-Induced Hallucinations in Large Language Models via Structured Reasoning
by: Hao, Jinbo, et al.
Published: (2026)
by: Hao, Jinbo, et al.
Published: (2026)
Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning
by: Cheng, Xiaoxue, et al.
Published: (2025)
by: Cheng, Xiaoxue, et al.
Published: (2025)
Meta-Reasoning Improves Tool Use in Large Language Models
by: Alazraki, Lisa, et al.
Published: (2024)
by: Alazraki, Lisa, et al.
Published: (2024)
Learning How to Use Tools, Not Just When: Pattern-Aware Tool-Integrated Reasoning
by: Xu, Ningning, et al.
Published: (2025)
by: Xu, Ningning, et al.
Published: (2025)
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models
by: Huang, Wenxuan, et al.
Published: (2025)
by: Huang, Wenxuan, et al.
Published: (2025)
TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use
by: Ye, Junjie, et al.
Published: (2024)
by: Ye, Junjie, et al.
Published: (2024)
AgentMath: Empowering Mathematical Reasoning for Large Language Models via Tool-Augmented Agent
by: Luo, Haipeng, et al.
Published: (2025)
by: Luo, Haipeng, et al.
Published: (2025)
AWPO: Enhancing Tool-Use of Large Language Models through Adaptive Integration of Reasoning Rewards
by: Lin, Zihan, et al.
Published: (2025)
by: Lin, Zihan, et al.
Published: (2025)
LearNAT: Learning NL2SQL with AST-guided Task Decomposition for Large Language Models
by: Liao, Weibin, et al.
Published: (2025)
by: Liao, Weibin, et al.
Published: (2025)
A Stepwise-Enhanced Reasoning Framework for Large Language Models Based on External Subgraph Generation
by: Zhang, Xin, et al.
Published: (2025)
by: Zhang, Xin, et al.
Published: (2025)
Textualized Agent-Style Reasoning for Complex Tasks by Multiple Round LLM Generation
by: Liang, Chen, et al.
Published: (2024)
by: Liang, Chen, et al.
Published: (2024)
DRP: Distilled Reasoning Pruning with Skill-aware Step Decomposition for Efficient Large Reasoning Models
by: Jiang, Yuxuan, et al.
Published: (2025)
by: Jiang, Yuxuan, et al.
Published: (2025)
MMESGBench: Pioneering Multimodal Understanding and Complex Reasoning Benchmark for ESG Tasks
by: Zhang, Lei, et al.
Published: (2025)
by: Zhang, Lei, et al.
Published: (2025)
FinEval-KR: A Financial Domain Evaluation Framework for Large Language Models' Knowledge and Reasoning
by: Dou, Shaoyu, et al.
Published: (2025)
by: Dou, Shaoyu, et al.
Published: (2025)
Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models
by: Qin, Yulei, et al.
Published: (2025)
by: Qin, Yulei, et al.
Published: (2025)
Context Reasoner: Incentivizing Reasoning Capability for Contextualized Privacy and Safety Compliance via Reinforcement Learning
by: Hu, Wenbin, et al.
Published: (2025)
by: Hu, Wenbin, et al.
Published: (2025)
Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs
by: Wen, Xumeng, et al.
Published: (2025)
by: Wen, Xumeng, et al.
Published: (2025)
MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use
by: Huang, Yue, et al.
Published: (2023)
by: Huang, Yue, et al.
Published: (2023)
TrajSelector: Harnessing Latent Representations for Efficient and Effective Best-of-N in Large Reasoning Model
by: Yu, Bin, et al.
Published: (2025)
by: Yu, Bin, et al.
Published: (2025)
Alignment for Efficient Tool Calling of Large Language Models
by: Xu, Hongshen, et al.
Published: (2025)
by: Xu, Hongshen, et al.
Published: (2025)
Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning
by: Wu, Jinyang, et al.
Published: (2026)
by: Wu, Jinyang, et al.
Published: (2026)
Med-U1: Incentivizing Unified Medical Reasoning in LLMs via Large-scale Reinforcement Learning
by: Zhang, Xiaotian, et al.
Published: (2025)
by: Zhang, Xiaotian, et al.
Published: (2025)
MR-Align: Meta-Reasoning Informed Factuality Alignment for Large Reasoning Models
by: Wang, Xinming, et al.
Published: (2025)
by: Wang, Xinming, et al.
Published: (2025)
TInR: Exploring Tool-Internalized Reasoning in Large Language Models
by: Xu, Qiancheng, et al.
Published: (2026)
by: Xu, Qiancheng, et al.
Published: (2026)
Beyond Token-Level Policy Gradients for Complex Reasoning with Large Language Models
by: Xu, Mufan, et al.
Published: (2026)
by: Xu, Mufan, et al.
Published: (2026)
Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners
by: Xu, Xin, et al.
Published: (2025)
by: Xu, Xin, et al.
Published: (2025)
Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning
by: Cheng, Qianjia, et al.
Published: (2026)
by: Cheng, Qianjia, et al.
Published: (2026)
A Large Language Model Based Method for Complex Logical Reasoning over Knowledge Graphs
by: Zhang, Ziyan, et al.
Published: (2025)
by: Zhang, Ziyan, et al.
Published: (2025)
Probing Large Language Models in Reasoning and Translating Complex Linguistic Puzzles
by: Lin, Zheng-Lin, et al.
Published: (2025)
by: Lin, Zheng-Lin, et al.
Published: (2025)
ToolACE-DEV: Self-Improving Tool Learning via Decomposition and EVolution
by: Huang, Xu, et al.
Published: (2025)
by: Huang, Xu, et al.
Published: (2025)
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use
by: Ye, Junjie, et al.
Published: (2025)
by: Ye, Junjie, et al.
Published: (2025)
How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data
by: Huang, Zixian, et al.
Published: (2026)
by: Huang, Zixian, et al.
Published: (2026)
Similar Items
-
Mixture-of-Instructions: Aligning Large Language Models via Mixture Prompting
by: Xu, Bowen, et al.
Published: (2024) -
La RoSA: Enhancing LLM Efficiency via Layerwise Rotated Sparse Activation
by: Liu, Kai, et al.
Published: (2025) -
Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
by: Xu, Ran, et al.
Published: (2025) -
CORE: A Conceptual Reasoning Layer for Large Language Models
by: Hegde, Vishwas, et al.
Published: (2025) -
Agentic Tool Use in Large Language Models
by: Hu, Jinchao, et al.
Published: (2026)