Saved in:
| Main Authors: | Liu, Zhiwei, Yao, Weiran, Zhang, Jianguo, Yang, Liangwei, Liu, Zuxin, Tan, Juntao, Choubey, Prafulla K., Lan, Tian, Wu, Jason, Wang, Huan, Heinecke, Shelby, Xiong, Caiming, Savarese, Silvio |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.15538 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
by: Liu, Zhiwei, et al.
Published: (2024)
by: Liu, Zhiwei, et al.
Published: (2024)
Test-Time Adaptation for LLM Agents via Environment Interaction
by: Chen, Arthur, et al.
Published: (2025)
by: Chen, Arthur, et al.
Published: (2025)
UserBench: An Interactive Gym Environment for User-Centric Agents
by: Qian, Cheng, et al.
Published: (2025)
by: Qian, Cheng, et al.
Published: (2025)
MCPEval: Automatic MCP-based Deep Evaluation for AI Agent Models
by: Liu, Zhiwei, et al.
Published: (2025)
by: Liu, Zhiwei, et al.
Published: (2025)
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
by: Zhang, Jianguo, et al.
Published: (2024)
by: Zhang, Jianguo, et al.
Published: (2024)
ToolLibGen: Scalable Automatic Tool Creation and Aggregation for LLM Reasoning
by: Yue, Murong, et al.
Published: (2025)
by: Yue, Murong, et al.
Published: (2025)
PersonaBench: Evaluating AI Models on Understanding Personal Information through Accessing (Synthetic) Private User Data
by: Tan, Juntao, et al.
Published: (2025)
by: Tan, Juntao, et al.
Published: (2025)
Building Enterprise Realtime Voice Agents from Scratch: A Technical Tutorial
by: Qiu, Jielin, et al.
Published: (2026)
by: Qiu, Jielin, et al.
Published: (2026)
Entropy-Based Block Pruning for Efficient Large Language Models
by: Yang, Liangwei, et al.
Published: (2025)
by: Yang, Liangwei, et al.
Published: (2025)
UserRL: Training Interactive User-Centric Agent via Reinforcement Learning
by: Qian, Cheng, et al.
Published: (2025)
by: Qian, Cheng, et al.
Published: (2025)
ActionStudio: A Lightweight Framework for Data and Training of Large Action Models
by: Zhang, Jianguo, et al.
Published: (2025)
by: Zhang, Jianguo, et al.
Published: (2025)
LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
by: Qiu, Jielin, et al.
Published: (2025)
by: Qiu, Jielin, et al.
Published: (2025)
APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
by: Liu, Zuxin, et al.
Published: (2024)
by: Liu, Zuxin, et al.
Published: (2024)
APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay
by: Prabhakar, Akshara, et al.
Published: (2025)
by: Prabhakar, Akshara, et al.
Published: (2025)
LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
by: Qiu, Jielin, et al.
Published: (2025)
by: Qiu, Jielin, et al.
Published: (2025)
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
by: Zhang, Kexun, et al.
Published: (2024)
by: Zhang, Kexun, et al.
Published: (2024)
Promptomatix: An Automatic Prompt Optimization Framework for Large Language Models
by: Murthy, Rithesh, et al.
Published: (2025)
by: Murthy, Rithesh, et al.
Published: (2025)
RealUserSim: Bridging the Reality Gap in Agent Benchmarking via Grounded User Simulation
by: Zhu, Ming, et al.
Published: (2026)
by: Zhu, Ming, et al.
Published: (2026)
A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration
by: Liu, Zijun, et al.
Published: (2023)
by: Liu, Zijun, et al.
Published: (2023)
VoiceAgentRAG: Solving the RAG Latency Bottleneck in Real-Time Voice Agents Using Dual-Agent Architectures
by: Qiu, Jielin, et al.
Published: (2026)
by: Qiu, Jielin, et al.
Published: (2026)
INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness
by: Le, Hung, et al.
Published: (2024)
by: Le, Hung, et al.
Published: (2024)
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding
by: Chen, Haolin, et al.
Published: (2024)
by: Chen, Haolin, et al.
Published: (2024)
Personalized Multi-task Training for Recommender System
by: Yang, Liangwei, et al.
Published: (2024)
by: Yang, Liangwei, et al.
Published: (2024)
ToolScan: A Benchmark for Characterizing Errors in Tool-Use LLMs
by: Kokane, Shirley, et al.
Published: (2024)
by: Kokane, Shirley, et al.
Published: (2024)
xRouter: Training Cost-Aware LLMs Orchestration System via Reinforcement Learning
by: Qian, Cheng, et al.
Published: (2025)
by: Qian, Cheng, et al.
Published: (2025)
xLAM: A Family of Large Action Models to Empower AI Agent Systems
by: Zhang, Jianguo, et al.
Published: (2024)
by: Zhang, Jianguo, et al.
Published: (2024)
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI
by: Zhang, Jianguo, et al.
Published: (2023)
by: Zhang, Jianguo, et al.
Published: (2023)
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
by: Yao, Weiran, et al.
Published: (2023)
by: Yao, Weiran, et al.
Published: (2023)
MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases
by: Murthy, Rithesh, et al.
Published: (2024)
by: Murthy, Rithesh, et al.
Published: (2024)
LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent Applications
by: Zhang, Danqing, et al.
Published: (2025)
by: Zhang, Danqing, et al.
Published: (2025)
Recursive Multi-Agent Trading System: Iterative Optimized Portfolio Strategy Under Geopolitical Uncertainty
by: Yang, Jing, et al.
Published: (2026)
by: Yang, Jing, et al.
Published: (2026)
Causal Layering via Conditional Entropy
by: Feigenbaum, Itai, et al.
Published: (2024)
by: Feigenbaum, Itai, et al.
Published: (2024)
Editing Arbitrary Propositions in LLMs without Subject Labels
by: Feigenbaum, Itai, et al.
Published: (2024)
by: Feigenbaum, Itai, et al.
Published: (2024)
EpochX: Building the Infrastructure for an Emergent Agent Civilization
by: Wang, Huacan, et al.
Published: (2026)
by: Wang, Huacan, et al.
Published: (2026)
Position: Vector Prompt Interfaces Should Be Exposed to Enable Customization of Large Language Models
by: Yang, Liangwei, et al.
Published: (2026)
by: Yang, Liangwei, et al.
Published: (2026)
Empowering LLMs in Task-Oriented Dialogues: A Domain-Independent Multi-Agent Framework and Fine-Tuning Strategy
by: Feng, Zihao, et al.
Published: (2025)
by: Feng, Zihao, et al.
Published: (2025)
ENGRAM: Effective, Lightweight Memory Orchestration for Conversational Agents
by: Patel, Daivik, et al.
Published: (2025)
by: Patel, Daivik, et al.
Published: (2025)
Efficient Agents: Building Effective Agents While Reducing Cost
by: Wang, Ningning, et al.
Published: (2025)
by: Wang, Ningning, et al.
Published: (2025)
LATTE: Learning to Think with Vision Specialists
by: Ma, Zixian, et al.
Published: (2024)
by: Ma, Zixian, et al.
Published: (2024)
Agent-Oriented Planning in Multi-Agent Systems
by: Li, Ao, et al.
Published: (2024)
by: Li, Ao, et al.
Published: (2024)
Similar Items
-
PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
by: Liu, Zhiwei, et al.
Published: (2024) -
Test-Time Adaptation for LLM Agents via Environment Interaction
by: Chen, Arthur, et al.
Published: (2025) -
UserBench: An Interactive Gym Environment for User-Centric Agents
by: Qian, Cheng, et al.
Published: (2025) -
MCPEval: Automatic MCP-based Deep Evaluation for AI Agent Models
by: Liu, Zhiwei, et al.
Published: (2025) -
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
by: Zhang, Jianguo, et al.
Published: (2024)