:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liu, Zhiwei, Yao, Weiran, Zhang, Jianguo, Yang, Liangwei, Liu, Zuxin, Tan, Juntao, Choubey, Prafulla K., Lan, Tian, Wu, Jason, Wang, Huan, Heinecke, Shelby, Xiong, Caiming, Savarese, Silvio
Format:	Preprint
Published:	2024
Subjects:	Multiagent Systems Artificial Intelligence
Online Access:	https://arxiv.org/abs/2402.15538
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
by: Liu, Zhiwei, et al.
Published: (2024)

Test-Time Adaptation for LLM Agents via Environment Interaction
by: Chen, Arthur, et al.
Published: (2025)

UserBench: An Interactive Gym Environment for User-Centric Agents
by: Qian, Cheng, et al.
Published: (2025)

MCPEval: Automatic MCP-based Deep Evaluation for AI Agent Models
by: Liu, Zhiwei, et al.
Published: (2025)

AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
by: Zhang, Jianguo, et al.
Published: (2024)

ToolLibGen: Scalable Automatic Tool Creation and Aggregation for LLM Reasoning
by: Yue, Murong, et al.
Published: (2025)

PersonaBench: Evaluating AI Models on Understanding Personal Information through Accessing (Synthetic) Private User Data
by: Tan, Juntao, et al.
Published: (2025)

Building Enterprise Realtime Voice Agents from Scratch: A Technical Tutorial
by: Qiu, Jielin, et al.
Published: (2026)

Entropy-Based Block Pruning for Efficient Large Language Models
by: Yang, Liangwei, et al.
Published: (2025)

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning
by: Qian, Cheng, et al.
Published: (2025)

ActionStudio: A Lightweight Framework for Data and Training of Large Action Models
by: Zhang, Jianguo, et al.
Published: (2025)

LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
by: Qiu, Jielin, et al.
Published: (2025)

APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
by: Liu, Zuxin, et al.
Published: (2024)

APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay
by: Prabhakar, Akshara, et al.
Published: (2025)

LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
by: Qiu, Jielin, et al.
Published: (2025)

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
by: Zhang, Kexun, et al.
Published: (2024)

Promptomatix: An Automatic Prompt Optimization Framework for Large Language Models
by: Murthy, Rithesh, et al.
Published: (2025)

RealUserSim: Bridging the Reality Gap in Agent Benchmarking via Grounded User Simulation
by: Zhu, Ming, et al.
Published: (2026)

A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration
by: Liu, Zijun, et al.
Published: (2023)

VoiceAgentRAG: Solving the RAG Latency Bottleneck in Real-Time Voice Agents Using Dual-Agent Architectures
by: Qiu, Jielin, et al.
Published: (2026)

INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness
by: Le, Hung, et al.
Published: (2024)

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding
by: Chen, Haolin, et al.
Published: (2024)

Personalized Multi-task Training for Recommender System
by: Yang, Liangwei, et al.
Published: (2024)

ToolScan: A Benchmark for Characterizing Errors in Tool-Use LLMs
by: Kokane, Shirley, et al.
Published: (2024)

xRouter: Training Cost-Aware LLMs Orchestration System via Reinforcement Learning
by: Qian, Cheng, et al.
Published: (2025)

xLAM: A Family of Large Action Models to Empower AI Agent Systems
by: Zhang, Jianguo, et al.
Published: (2024)

DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI
by: Zhang, Jianguo, et al.
Published: (2023)

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
by: Yao, Weiran, et al.
Published: (2023)

MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases
by: Murthy, Rithesh, et al.
Published: (2024)

LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent Applications
by: Zhang, Danqing, et al.
Published: (2025)

Recursive Multi-Agent Trading System: Iterative Optimized Portfolio Strategy Under Geopolitical Uncertainty
by: Yang, Jing, et al.
Published: (2026)

Causal Layering via Conditional Entropy
by: Feigenbaum, Itai, et al.
Published: (2024)

Editing Arbitrary Propositions in LLMs without Subject Labels
by: Feigenbaum, Itai, et al.
Published: (2024)

EpochX: Building the Infrastructure for an Emergent Agent Civilization
by: Wang, Huacan, et al.
Published: (2026)

Position: Vector Prompt Interfaces Should Be Exposed to Enable Customization of Large Language Models
by: Yang, Liangwei, et al.
Published: (2026)

Empowering LLMs in Task-Oriented Dialogues: A Domain-Independent Multi-Agent Framework and Fine-Tuning Strategy
by: Feng, Zihao, et al.
Published: (2025)

ENGRAM: Effective, Lightweight Memory Orchestration for Conversational Agents
by: Patel, Daivik, et al.
Published: (2025)

Efficient Agents: Building Effective Agents While Reducing Cost
by: Wang, Ningning, et al.
Published: (2025)

LATTE: Learning to Think with Vision Specialists
by: Ma, Zixian, et al.
Published: (2024)

Agent-Oriented Planning in Multi-Agent Systems
by: Li, Ao, et al.
Published: (2024)