:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lin, Yuxiang, Wang, Zihan, Liu, Mengyang, Shan, Yuxuan, Bai, Longju, Zhang, Junyao, Jin, Xing, Chen, Boshan, Su, Jinyan, Wang, Xingyao, Pei, Jiaxin, Li, Manling
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence Computation and Language
Online Access:	https://arxiv.org/abs/2606.00198
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

How Do AI Agents Spend Your Money? Analyzing and Predicting Token Consumption in Agentic Coding Tasks
by: Bai, Longju, et al.
Published: (2026)

Graphical Reasoning: LLM-based Semi-Open Relation Extraction
by: Tao, Yicheng, et al.
Published: (2024)

Annotations on a Budget: Leveraging Geo-Data Similarity to Balance Model Performance and Annotation Cost
by: Ignat, Oana, et al.
Published: (2024)

SyncMind: Measuring Agent Out-of-Sync Recovery in Collaborative Software Engineering
by: Guo, Xuehang, et al.
Published: (2025)

The Power of Many: Multi-Agent Multimodal Models for Cultural Image Captioning
by: Bai, Longju, et al.
Published: (2024)

Token-Budget-Aware LLM Reasoning
by: Han, Tingxu, et al.
Published: (2024)

Visually Descriptive Language Model for Vector Graphics Reasoning
by: Wang, Zhenhailong, et al.
Published: (2024)

RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning
by: Wang, Zihan, et al.
Published: (2025)

Executable Code Actions Elicit Better LLM Agents
by: Wang, Xingyao, et al.
Published: (2024)

On LLM-Based Scientific Inductive Reasoning Beyond Equations
by: Lin, Brian S., et al.
Published: (2025)

Training Proactive and Personalized LLM Agents
by: Sun, Weiwei, et al.
Published: (2025)

Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
by: Wang, Junlin, et al.
Published: (2024)

Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory
by: Zhang, Haozhen, et al.
Published: (2026)

Knowing but Not Showing: LLMs Recognize Ambiguity but Rarely Ask Clarifying Questions
by: Su, Jinyan, et al.
Published: (2026)

Thinking Fast and Right: Balancing Accuracy and Reasoning Length with Adaptive Rewards
by: Su, Jinyan, et al.
Published: (2025)

Clarification Is Not Enough: Post-Clarification Answering Remains the Bottleneck in Multi-Turn QA
by: Su, Jinyan, et al.
Published: (2026)

Trajectory2Task: Training Robust Tool-Calling Agents with Synthesized Yet Verifiable Data for Complex User Intents
by: Wang, Ziyi, et al.
Published: (2026)

LocAgent: Graph-Guided LLM Agents for Code Localization
by: Chen, Zhaoling, et al.
Published: (2025)

SkillCraft: Can LLM Agents Learn to Use Tools Skillfully?
by: Chen, Shiqi, et al.
Published: (2026)

Culture Affordance Atlas: Reconciling Object Diversity Through Functional Mapping
by: Nwatu, Joan, et al.
Published: (2025)

FinResearchBench: A Logic Tree based Agent-as-a-Judge Evaluation Framework for Financial Research Agents
by: Sun, Rui, et al.
Published: (2025)

Coding Agents with Multimodal Browsing are Generalist Problem Solvers
by: Soni, Aditya Bharat, et al.
Published: (2025)

Why Does New Knowledge Create Messy Ripple Effects in LLMs?
by: Qin, Jiaxin, et al.
Published: (2024)

Budget-Aware Anytime Reasoning with LLM-Synthesized Preference Data
by: Zhang, Xuanming, et al.
Published: (2026)

MTRouter: Cost-Aware Multi-Turn LLM Routing with History-Model Joint Embeddings
by: Zhang, Yiqun, et al.
Published: (2026)

Your Language Model Secretly Contains Personality Subnetworks
by: Ye, Ruimeng, et al.
Published: (2026)

MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback
by: Wang, Xingyao, et al.
Published: (2023)

AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting
by: Huang, Shijue, et al.
Published: (2025)

Adapting Fake News Detection to the Era of Large Language Models
by: Su, Jinyan, et al.
Published: (2023)

Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
by: Wang, Ziyi, et al.
Published: (2025)

If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
by: Yang, Ke, et al.
Published: (2024)

Training Software Engineering Agents and Verifiers with SWE-Gym
by: Pan, Jiayi, et al.
Published: (2024)

Probe and Skip: Self-Predictive Token Skipping for Efficient Long-Context LLM Inference
by: Wu, Zimeng, et al.
Published: (2026)

WINELL: Wikipedia Never-Ending Updating with LLM Agents
by: Reddy, Revanth Gangi, et al.
Published: (2025)

RADAR: Reasoning as Discrimination with Aligned Representations for LLM-based Knowledge Graph Reasoning
by: Xue, Bo, et al.
Published: (2026)

Chumor 1.0: A Truly Funny and Challenging Chinese Humor Understanding Dataset from Ruo Zhi Ba
by: He, Ruiqi, et al.
Published: (2024)

HumanLLM: Towards Personalized Understanding and Simulation of Human Nature
by: Lei, Yuxuan, et al.
Published: (2026)

CP-Router: An Uncertainty-Aware Router Between LLM and LRM
by: Su, Jiayuan, et al.
Published: (2025)

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents
by: Song, Yueqi, et al.
Published: (2025)

InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation
by: Zhao, Weilin, et al.
Published: (2025)