:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Xu, Bowen, Wu, Shaoyu, Jiang, Hao, Liu, Kai, Chen, Xin, Hu, Lulu, Yang, Bin
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2602.02160
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Mixture-of-Instructions: Aligning Large Language Models via Mixture Prompting
by: Xu, Bowen, et al.
Published: (2024)

La RoSA: Enhancing LLM Efficiency via Layerwise Rotated Sparse Activation
by: Liu, Kai, et al.
Published: (2025)

Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
by: Xu, Ran, et al.
Published: (2025)

CORE: A Conceptual Reasoning Layer for Large Language Models
by: Hegde, Vishwas, et al.
Published: (2025)

Agentic Tool Use in Large Language Models
by: Hu, Jinchao, et al.
Published: (2026)

DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry
by: Cai, Zhenyang, et al.
Published: (2025)

LogicReward: Incentivizing LLM Reasoning via Step-Wise Logical Supervision
by: Xu, Jundong, et al.
Published: (2025)

OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning
by: Lu, Pan, et al.
Published: (2025)

Mitigating Prompt-Induced Hallucinations in Large Language Models via Structured Reasoning
by: Hao, Jinbo, et al.
Published: (2026)

Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning
by: Cheng, Xiaoxue, et al.
Published: (2025)

Meta-Reasoning Improves Tool Use in Large Language Models
by: Alazraki, Lisa, et al.
Published: (2024)

Learning How to Use Tools, Not Just When: Pattern-Aware Tool-Integrated Reasoning
by: Xu, Ningning, et al.
Published: (2025)

Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models
by: Huang, Wenxuan, et al.
Published: (2025)

TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use
by: Ye, Junjie, et al.
Published: (2024)

AgentMath: Empowering Mathematical Reasoning for Large Language Models via Tool-Augmented Agent
by: Luo, Haipeng, et al.
Published: (2025)

AWPO: Enhancing Tool-Use of Large Language Models through Adaptive Integration of Reasoning Rewards
by: Lin, Zihan, et al.
Published: (2025)

LearNAT: Learning NL2SQL with AST-guided Task Decomposition for Large Language Models
by: Liao, Weibin, et al.
Published: (2025)

A Stepwise-Enhanced Reasoning Framework for Large Language Models Based on External Subgraph Generation
by: Zhang, Xin, et al.
Published: (2025)

Textualized Agent-Style Reasoning for Complex Tasks by Multiple Round LLM Generation
by: Liang, Chen, et al.
Published: (2024)

DRP: Distilled Reasoning Pruning with Skill-aware Step Decomposition for Efficient Large Reasoning Models
by: Jiang, Yuxuan, et al.
Published: (2025)

MMESGBench: Pioneering Multimodal Understanding and Complex Reasoning Benchmark for ESG Tasks
by: Zhang, Lei, et al.
Published: (2025)

FinEval-KR: A Financial Domain Evaluation Framework for Large Language Models' Knowledge and Reasoning
by: Dou, Shaoyu, et al.
Published: (2025)

Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models
by: Qin, Yulei, et al.
Published: (2025)

Context Reasoner: Incentivizing Reasoning Capability for Contextualized Privacy and Safety Compliance via Reinforcement Learning
by: Hu, Wenbin, et al.
Published: (2025)

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs
by: Wen, Xumeng, et al.
Published: (2025)

MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use
by: Huang, Yue, et al.
Published: (2023)

TrajSelector: Harnessing Latent Representations for Efficient and Effective Best-of-N in Large Reasoning Model
by: Yu, Bin, et al.
Published: (2025)

Alignment for Efficient Tool Calling of Large Language Models
by: Xu, Hongshen, et al.
Published: (2025)

Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning
by: Wu, Jinyang, et al.
Published: (2026)

Med-U1: Incentivizing Unified Medical Reasoning in LLMs via Large-scale Reinforcement Learning
by: Zhang, Xiaotian, et al.
Published: (2025)

MR-Align: Meta-Reasoning Informed Factuality Alignment for Large Reasoning Models
by: Wang, Xinming, et al.
Published: (2025)

TInR: Exploring Tool-Internalized Reasoning in Large Language Models
by: Xu, Qiancheng, et al.
Published: (2026)

Beyond Token-Level Policy Gradients for Complex Reasoning with Large Language Models
by: Xu, Mufan, et al.
Published: (2026)

Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners
by: Xu, Xin, et al.
Published: (2025)

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning
by: Cheng, Qianjia, et al.
Published: (2026)

A Large Language Model Based Method for Complex Logical Reasoning over Knowledge Graphs
by: Zhang, Ziyan, et al.
Published: (2025)

Probing Large Language Models in Reasoning and Translating Complex Linguistic Puzzles
by: Lin, Zheng-Lin, et al.
Published: (2025)

ToolACE-DEV: Self-Improving Tool Learning via Decomposition and EVolution
by: Huang, Xu, et al.
Published: (2025)

ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use
by: Ye, Junjie, et al.
Published: (2025)

How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data
by: Huang, Zixian, et al.
Published: (2026)