:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ni, Xinyi, Jian, Haonan, Wang, Qiuyang, Shah, Vedanshi Chetan, Hong, Pengyu
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2506.19998
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ToolFactory: Automating Tool Generation by Leveraging LLM to Understand REST API Documentations
by: Ni, Xinyi, et al.
Published: (2025)

What is Visualization for Communication? Analyzing Four Years of VisComm Papers
by: Shah, Vedanshi Chetan, et al.
Published: (2025)

Multiple Abstraction Level Retrieve Augment Generation
by: Zheng, Zheng, et al.
Published: (2025)

DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding
by: Zhu, Dawei, et al.
Published: (2025)

DocAgent: A Multi-Agent System for Automated Code Documentation Generation
by: Yang, Dayu, et al.
Published: (2025)

Demonstrating ViviDoc: Generating Interactive Documents through Human-Agent Collaboration
by: Tang, Yinghao, et al.
Published: (2026)

Tool-to-Agent Retrieval: Bridging Tools and Agents for Scalable LLM Multi-Agent Systems
by: Lumer, Elias, et al.
Published: (2025)

GTA: A Benchmark for General Tool Agents
by: Wang, Jize, et al.
Published: (2024)

Infant Agent: A Tool-Integrated, Logic-Driven Agent with Cost-Effective API Usage
by: Lei, Bin, et al.
Published: (2024)

AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls
by: Du, Yu, et al.
Published: (2024)

Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation
by: Guo, Jiaxin, et al.
Published: (2025)

GTA-2: Benchmarking General Tool Agents from Atomic Tool-Use to Open-Ended Workflows
by: Wang, Jize, et al.
Published: (2026)

In-N-Out: A Parameter-Level API Graph Dataset for Tool Agents
by: Lee, Seungkyu, et al.
Published: (2025)

SpeCrawler: Generating OpenAPI Specifications from API Documentation Using Large Language Models
by: Lazar, Koren, et al.
Published: (2024)

DocGraphLM: Documental Graph Language Model for Information Extraction
by: Wang, Dongsheng, et al.
Published: (2024)

TRACER: Verifiable Generative Provenance for Multimodal Tool-Using Agents
by: Yu, Bihui, et al.
Published: (2026)

DocTabQA: Answering Questions from Long Documents Using Tables
by: Wang, Haochen, et al.
Published: (2024)

From Failure to Mastery: Generating Hard Samples for Tool-use Agents
by: Hao, Bingguang, et al.
Published: (2026)

Beyond Browsing: API-Based Web Agents
by: Song, Yueqi, et al.
Published: (2024)

MM-Doc-R1: Training Agents for Long Document Visual Question Answering through Multi-turn Reinforcement Learning
by: Lin, Jiahang, et al.
Published: (2026)

Doc2Chart: Intent-Driven Zero-Shot Chart Generation from Documents
by: Jain, Akriti, et al.
Published: (2025)

SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems
by: Zhang, Dong, et al.
Published: (2024)

AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents
by: Xie, Jingxu, et al.
Published: (2025)

PP-DocBee: Improving Multimodal Document Understanding Through a Bag of Tricks
by: Ni, Feng, et al.
Published: (2025)

DocCGen: Document-based Controlled Code Generation
by: Pimparkhede, Sameer, et al.
Published: (2024)

EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human Actions
by: Hong, Spencer, et al.
Published: (2025)

Multi2: Multi-Agent Test-Time Scalable Framework for Multi-Document Processing
by: Cao, Juntai, et al.
Published: (2025)

DeepAgent: A General Reasoning Agent with Scalable Toolsets
by: Li, Xiaoxi, et al.
Published: (2025)

MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers
by: Wang, Zhenting, et al.
Published: (2025)

ToolScope: Enhancing LLM Agent Tool Use through Tool Merging and Context-Aware Filtering
by: Liu, Marianne Menglin, et al.
Published: (2025)

Doc-V*:Coarse-to-Fine Interactive Visual Reasoning for Multi-Page Document VQA
by: Zheng, Yuanlei, et al.
Published: (2026)

VeriAgent: A Tool-Integrated Multi-Agent System with Evolving Memory for PPA-Aware RTL Code Generation
by: Wang, Yaoxiang, et al.
Published: (2026)

DocFusion: A Unified Framework for Document Parsing Tasks
by: Chai, Mingxu, et al.
Published: (2024)

DocDancer: Towards Agentic Document-Grounded Information Seeking
by: Zhang, Qintong, et al.
Published: (2026)

PostDoc: Generating Poster from a Long Multimodal Document Using Deep Submodular Optimization
by: Jaisankar, Vijay, et al.
Published: (2024)

DocCHA: Towards LLM-Augmented Interactive Online diagnosis System
by: Liu, Xinyi, et al.
Published: (2025)

DocTER: Evaluating Document-based Knowledge Editing
by: Wu, Suhang, et al.
Published: (2023)

NaviAgent: Graph-Driven Bilevel Planning for Scalable Tool Orchestration
by: Jiang, Yan, et al.
Published: (2025)

ReflecTool: Towards Reflection-Aware Tool-Augmented Clinical Agents
by: Liao, Yusheng, et al.
Published: (2024)

DocTalk: Scalable Graph-based Dialogue Synthesis for Enhancing LLM Conversational Capabilities
by: Lee, Jing Yang, et al.
Published: (2025)