:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Xuqin, He, Quan, Zheng, Zhenrui, Zhang, Zongzhang, He, Xu, Li, Dong
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2602.01204
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Sentence-level Reward Model can Generalize Better for Aligning LLM from Human Preference
by: Qiu, Wenjie, et al.
Published: (2025)

AutoTool: Dynamic Tool Selection and Integration for Agentic Reasoning
by: Zou, Jiaru, et al.
Published: (2025)

Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs
by: Lu, Meng, et al.
Published: (2025)

Agentic Reasoning: A Streamlined Framework for Enhancing LLM Reasoning with Agentic Tools
by: Wu, Junde, et al.
Published: (2025)

When to Trust Tools? Adaptive Tool Trust Calibration For Tool-Integrated Math Reasoning
by: Xu, Ruotao, et al.
Published: (2026)

Open-Vocabulary Federated Learning with Multimodal Prototyping
by: Zeng, Huimin, et al.
Published: (2024)

Scaling Agentic Verifier for Competitive Coding
by: Ma, Zeyao, et al.
Published: (2026)

MedOrch: Medical Diagnosis with Tool-Augmented Reasoning Agents for Flexible Extensibility
by: He, Yexiao, et al.
Published: (2025)

AgenticQwen: Training Small Agentic Language Models with Dual Data Flywheels for Industrial-Scale Tool Use
by: Lyu, Yuanjie, et al.
Published: (2026)

rStar2-Agent: Agentic Reasoning Technical Report
by: Shang, Ning, et al.
Published: (2025)

Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models
by: Zhang, Fuxiang, et al.
Published: (2024)

Hybrid Latent Reasoning via Reinforcement Learning
by: Yue, Zhenrui, et al.
Published: (2025)

Scaling Medical Reasoning Verification via Tool-Integrated Reinforcement Learning
by: Zhang, Hang, et al.
Published: (2026)

OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning
by: Lu, Pan, et al.
Published: (2025)

GeoBrowse: A Geolocation Benchmark for Agentic Tool Use with Expert-Annotated Reasoning Traces
by: Geng, Xinyu, et al.
Published: (2026)

PyVision: Agentic Vision with Dynamic Tooling
by: Zhao, Shitian, et al.
Published: (2025)

Agentic Tool Use in Large Language Models
by: Hu, Jinchao, et al.
Published: (2026)

Evidence-Driven Retrieval Augmented Response Generation for Online Misinformation
by: Yue, Zhenrui, et al.
Published: (2024)

Retrieval Augmented Fact Verification by Synthesizing Contrastive Arguments
by: Yue, Zhenrui, et al.
Published: (2024)

A Survey of Frontiers in LLM Reasoning: Inference Scaling, Learning to Reason, and Agentic Systems
by: Ke, Zixuan, et al.
Published: (2025)

ToolOmni: Enabling Open-World Tool Use via Agentic learning with Proactive Retrieval and Grounded Execution
by: Huang, Shouzheng, et al.
Published: (2026)

LRAS: Advanced Legal Reasoning with Agentic Search
by: Zhou, Yujin, et al.
Published: (2026)

Inference Scaling for Long-Context Retrieval Augmented Generation
by: Yue, Zhenrui, et al.
Published: (2024)

Agentic Reasoning for Large Language Models
by: Wei, Tianxin, et al.
Published: (2026)

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
by: 5 Team, et al.
Published: (2025)

Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
by: Xu, Ran, et al.
Published: (2025)

AgentV-RL: Scaling Reward Modeling with Agentic Verifier
by: Zhang, Jiazheng, et al.
Published: (2026)

THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning
by: Chang, Qikai, et al.
Published: (2025)

Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model Reasoning
by: Lin, Honglin, et al.
Published: (2025)

START: Self-taught Reasoner with Tools
by: Li, Chengpeng, et al.
Published: (2025)

Eliminating Agentic Workflow for Introduction Generation with Parametric Stage Tokens
by: Zhang, Meicong, et al.
Published: (2025)

Multi-Step Visual Reasoning with Visual Tokens Scaling and Verification
by: Bai, Tianyi, et al.
Published: (2025)

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning
by: Wang, Jianing, et al.
Published: (2026)

Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling
by: Wong, Jeffrey T. H., et al.
Published: (2026)

Search-o1: Agentic Search-Enhanced Large Reasoning Models
by: Li, Xiaoxi, et al.
Published: (2025)

An Agentic Approach to Generating XAI-Narratives
by: He, Yifan, et al.
Published: (2026)

Generalist Reward Models: Found Inside Large Language Models
by: Li, Yi-Chen, et al.
Published: (2025)

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling
by: Zheng, Tong, et al.
Published: (2026)

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
by: Jin, Bowen, et al.
Published: (2025)

AgenticRAGTracer: A Hop-Aware Benchmark for Diagnosing Multi-Step Retrieval Reasoning in Agentic RAG
by: You, Qijie, et al.
Published: (2026)