Saved in:
| Main Authors: | Ni, Xinyi, Jian, Haonan, Wang, Qiuyang, Shah, Vedanshi Chetan, Hong, Pengyu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.19998 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ToolFactory: Automating Tool Generation by Leveraging LLM to Understand REST API Documentations
by: Ni, Xinyi, et al.
Published: (2025)
by: Ni, Xinyi, et al.
Published: (2025)
What is Visualization for Communication? Analyzing Four Years of VisComm Papers
by: Shah, Vedanshi Chetan, et al.
Published: (2025)
by: Shah, Vedanshi Chetan, et al.
Published: (2025)
Multiple Abstraction Level Retrieve Augment Generation
by: Zheng, Zheng, et al.
Published: (2025)
by: Zheng, Zheng, et al.
Published: (2025)
DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding
by: Zhu, Dawei, et al.
Published: (2025)
by: Zhu, Dawei, et al.
Published: (2025)
DocAgent: A Multi-Agent System for Automated Code Documentation Generation
by: Yang, Dayu, et al.
Published: (2025)
by: Yang, Dayu, et al.
Published: (2025)
Demonstrating ViviDoc: Generating Interactive Documents through Human-Agent Collaboration
by: Tang, Yinghao, et al.
Published: (2026)
by: Tang, Yinghao, et al.
Published: (2026)
Tool-to-Agent Retrieval: Bridging Tools and Agents for Scalable LLM Multi-Agent Systems
by: Lumer, Elias, et al.
Published: (2025)
by: Lumer, Elias, et al.
Published: (2025)
GTA: A Benchmark for General Tool Agents
by: Wang, Jize, et al.
Published: (2024)
by: Wang, Jize, et al.
Published: (2024)
Infant Agent: A Tool-Integrated, Logic-Driven Agent with Cost-Effective API Usage
by: Lei, Bin, et al.
Published: (2024)
by: Lei, Bin, et al.
Published: (2024)
AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls
by: Du, Yu, et al.
Published: (2024)
by: Du, Yu, et al.
Published: (2024)
Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation
by: Guo, Jiaxin, et al.
Published: (2025)
by: Guo, Jiaxin, et al.
Published: (2025)
GTA-2: Benchmarking General Tool Agents from Atomic Tool-Use to Open-Ended Workflows
by: Wang, Jize, et al.
Published: (2026)
by: Wang, Jize, et al.
Published: (2026)
In-N-Out: A Parameter-Level API Graph Dataset for Tool Agents
by: Lee, Seungkyu, et al.
Published: (2025)
by: Lee, Seungkyu, et al.
Published: (2025)
SpeCrawler: Generating OpenAPI Specifications from API Documentation Using Large Language Models
by: Lazar, Koren, et al.
Published: (2024)
by: Lazar, Koren, et al.
Published: (2024)
DocGraphLM: Documental Graph Language Model for Information Extraction
by: Wang, Dongsheng, et al.
Published: (2024)
by: Wang, Dongsheng, et al.
Published: (2024)
TRACER: Verifiable Generative Provenance for Multimodal Tool-Using Agents
by: Yu, Bihui, et al.
Published: (2026)
by: Yu, Bihui, et al.
Published: (2026)
DocTabQA: Answering Questions from Long Documents Using Tables
by: Wang, Haochen, et al.
Published: (2024)
by: Wang, Haochen, et al.
Published: (2024)
From Failure to Mastery: Generating Hard Samples for Tool-use Agents
by: Hao, Bingguang, et al.
Published: (2026)
by: Hao, Bingguang, et al.
Published: (2026)
Beyond Browsing: API-Based Web Agents
by: Song, Yueqi, et al.
Published: (2024)
by: Song, Yueqi, et al.
Published: (2024)
MM-Doc-R1: Training Agents for Long Document Visual Question Answering through Multi-turn Reinforcement Learning
by: Lin, Jiahang, et al.
Published: (2026)
by: Lin, Jiahang, et al.
Published: (2026)
Doc2Chart: Intent-Driven Zero-Shot Chart Generation from Documents
by: Jain, Akriti, et al.
Published: (2025)
by: Jain, Akriti, et al.
Published: (2025)
SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems
by: Zhang, Dong, et al.
Published: (2024)
by: Zhang, Dong, et al.
Published: (2024)
AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents
by: Xie, Jingxu, et al.
Published: (2025)
by: Xie, Jingxu, et al.
Published: (2025)
PP-DocBee: Improving Multimodal Document Understanding Through a Bag of Tricks
by: Ni, Feng, et al.
Published: (2025)
by: Ni, Feng, et al.
Published: (2025)
DocCGen: Document-based Controlled Code Generation
by: Pimparkhede, Sameer, et al.
Published: (2024)
by: Pimparkhede, Sameer, et al.
Published: (2024)
EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human Actions
by: Hong, Spencer, et al.
Published: (2025)
by: Hong, Spencer, et al.
Published: (2025)
Multi2: Multi-Agent Test-Time Scalable Framework for Multi-Document Processing
by: Cao, Juntai, et al.
Published: (2025)
by: Cao, Juntai, et al.
Published: (2025)
DeepAgent: A General Reasoning Agent with Scalable Toolsets
by: Li, Xiaoxi, et al.
Published: (2025)
by: Li, Xiaoxi, et al.
Published: (2025)
MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers
by: Wang, Zhenting, et al.
Published: (2025)
by: Wang, Zhenting, et al.
Published: (2025)
ToolScope: Enhancing LLM Agent Tool Use through Tool Merging and Context-Aware Filtering
by: Liu, Marianne Menglin, et al.
Published: (2025)
by: Liu, Marianne Menglin, et al.
Published: (2025)
Doc-V*:Coarse-to-Fine Interactive Visual Reasoning for Multi-Page Document VQA
by: Zheng, Yuanlei, et al.
Published: (2026)
by: Zheng, Yuanlei, et al.
Published: (2026)
VeriAgent: A Tool-Integrated Multi-Agent System with Evolving Memory for PPA-Aware RTL Code Generation
by: Wang, Yaoxiang, et al.
Published: (2026)
by: Wang, Yaoxiang, et al.
Published: (2026)
DocFusion: A Unified Framework for Document Parsing Tasks
by: Chai, Mingxu, et al.
Published: (2024)
by: Chai, Mingxu, et al.
Published: (2024)
DocDancer: Towards Agentic Document-Grounded Information Seeking
by: Zhang, Qintong, et al.
Published: (2026)
by: Zhang, Qintong, et al.
Published: (2026)
PostDoc: Generating Poster from a Long Multimodal Document Using Deep Submodular Optimization
by: Jaisankar, Vijay, et al.
Published: (2024)
by: Jaisankar, Vijay, et al.
Published: (2024)
DocCHA: Towards LLM-Augmented Interactive Online diagnosis System
by: Liu, Xinyi, et al.
Published: (2025)
by: Liu, Xinyi, et al.
Published: (2025)
DocTER: Evaluating Document-based Knowledge Editing
by: Wu, Suhang, et al.
Published: (2023)
by: Wu, Suhang, et al.
Published: (2023)
NaviAgent: Graph-Driven Bilevel Planning for Scalable Tool Orchestration
by: Jiang, Yan, et al.
Published: (2025)
by: Jiang, Yan, et al.
Published: (2025)
ReflecTool: Towards Reflection-Aware Tool-Augmented Clinical Agents
by: Liao, Yusheng, et al.
Published: (2024)
by: Liao, Yusheng, et al.
Published: (2024)
DocTalk: Scalable Graph-based Dialogue Synthesis for Enhancing LLM Conversational Capabilities
by: Lee, Jing Yang, et al.
Published: (2025)
by: Lee, Jing Yang, et al.
Published: (2025)
Similar Items
-
ToolFactory: Automating Tool Generation by Leveraging LLM to Understand REST API Documentations
by: Ni, Xinyi, et al.
Published: (2025) -
What is Visualization for Communication? Analyzing Four Years of VisComm Papers
by: Shah, Vedanshi Chetan, et al.
Published: (2025) -
Multiple Abstraction Level Retrieve Augment Generation
by: Zheng, Zheng, et al.
Published: (2025) -
DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding
by: Zhu, Dawei, et al.
Published: (2025) -
DocAgent: A Multi-Agent System for Automated Code Documentation Generation
by: Yang, Dayu, et al.
Published: (2025)