Saved in:
| Main Author: | Zhang, Jack |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.05440 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Evolving Interactive Diagnostic Agents in a Virtual Clinical Environment
by: Qiu, Pengcheng, et al.
Published: (2025)
by: Qiu, Pengcheng, et al.
Published: (2025)
Improving Grounded Language Understanding in a Collaborative Environment by Interacting with Agents Through Help Feedback
by: Mehta, Nikhil, et al.
Published: (2023)
by: Mehta, Nikhil, et al.
Published: (2023)
DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity Environments
by: Zhang, Chiyu, et al.
Published: (2025)
by: Zhang, Chiyu, et al.
Published: (2025)
Advancing Language Multi-Agent Learning with Credit Re-Assignment for Interactive Environment Generalization
by: He, Zhitao, et al.
Published: (2025)
by: He, Zhitao, et al.
Published: (2025)
MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models
by: Chen, Justin Chih-Yao, et al.
Published: (2024)
by: Chen, Justin Chih-Yao, et al.
Published: (2024)
MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive and MCP-Augmented Environments
by: Kong, Quyu, et al.
Published: (2025)
by: Kong, Quyu, et al.
Published: (2025)
STAMP: Training Explicit Memory for Mobile GUI Agents in Controllable and Scalable Virtual Environments
by: Wang, Junyang, et al.
Published: (2026)
by: Wang, Junyang, et al.
Published: (2026)
DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents
by: Jansen, Peter, et al.
Published: (2024)
by: Jansen, Peter, et al.
Published: (2024)
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
by: Xi, Zhiheng, et al.
Published: (2024)
by: Xi, Zhiheng, et al.
Published: (2024)
Teaching Language Models to Self-Improve through Interactive Demonstrations
by: Yu, Xiao, et al.
Published: (2023)
by: Yu, Xiao, et al.
Published: (2023)
SPICE: Self-Play In Corpus Environments Improves Reasoning
by: Liu, Bo, et al.
Published: (2025)
by: Liu, Bo, et al.
Published: (2025)
Improving Interactive Diagnostic Ability of a Large Language Model Agent Through Clinical Experience Learning
by: Sun, Zhoujian, et al.
Published: (2025)
by: Sun, Zhoujian, et al.
Published: (2025)
LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments
by: Chen, Junzhe, et al.
Published: (2024)
by: Chen, Junzhe, et al.
Published: (2024)
MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive Environments
by: Cai, Yin, et al.
Published: (2025)
by: Cai, Yin, et al.
Published: (2025)
Language Guided Exploration for RL Agents in Text Environments
by: Golchha, Hitesh, et al.
Published: (2024)
by: Golchha, Hitesh, et al.
Published: (2024)
Can a Unimodal Language Agent Provide Preferences to Tune a Multimodal Vision-Language Model?
by: Mim, Sazia Tabasum, et al.
Published: (2026)
by: Mim, Sazia Tabasum, et al.
Published: (2026)
AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios
by: Mou, Xinyi, et al.
Published: (2024)
by: Mou, Xinyi, et al.
Published: (2024)
NNetNav: Unsupervised Learning of Browser Agents Through Environment Interaction in the Wild
by: Murty, Shikhar, et al.
Published: (2024)
by: Murty, Shikhar, et al.
Published: (2024)
UserBench: An Interactive Gym Environment for User-Centric Agents
by: Qian, Cheng, et al.
Published: (2025)
by: Qian, Cheng, et al.
Published: (2025)
SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents
by: Wang, Ruiyi, et al.
Published: (2024)
by: Wang, Ruiyi, et al.
Published: (2024)
Positive Experience Reflection for Agents in Interactive Text Environments
by: Lippmann, Philip, et al.
Published: (2024)
by: Lippmann, Philip, et al.
Published: (2024)
PSALM-V: Automating Symbolic Planning in Interactive Visual Environments with Large Language Models
by: Zhu, Wang Bill, et al.
Published: (2025)
by: Zhu, Wang Bill, et al.
Published: (2025)
SAUCE: Synchronous and Asynchronous User-Customizable Environment for Multi-Agent LLM Interaction
by: Neuberger, Shlomo, et al.
Published: (2024)
by: Neuberger, Shlomo, et al.
Published: (2024)
Runaway is Ashamed, But Helpful: On the Early-Exit Behavior of Large Language Model-based Agents in Embodied Environments
by: Lu, Qingyu, et al.
Published: (2025)
by: Lu, Qingyu, et al.
Published: (2025)
SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions
by: Wagner, Dominik, et al.
Published: (2025)
by: Wagner, Dominik, et al.
Published: (2025)
Emergent Introspective Awareness in Large Language Models
by: Lindsey, Jack
Published: (2026)
by: Lindsey, Jack
Published: (2026)
WorldCoder, a Model-Based LLM Agent: Building World Models by Writing Code and Interacting with the Environment
by: Tang, Hao, et al.
Published: (2024)
by: Tang, Hao, et al.
Published: (2024)
Recursive Introspection: Teaching Language Model Agents How to Self-Improve
by: Qu, Yuxiao, et al.
Published: (2024)
by: Qu, Yuxiao, et al.
Published: (2024)
Improving Clinical NLP Performance through Language Model-Generated Synthetic Clinical Data
by: Chen, Shan, et al.
Published: (2024)
by: Chen, Shan, et al.
Published: (2024)
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
by: Lu, Christina, et al.
Published: (2026)
by: Lu, Christina, et al.
Published: (2026)
LiteCoder-Terminal: Scaling Long-Horizon Terminal Environments for Learning Language Agents
by: Peng, Xiaoxuan, et al.
Published: (2026)
by: Peng, Xiaoxuan, et al.
Published: (2026)
ChatShop: Interactive Information Seeking with Language Agents
by: Chen, Sanxing, et al.
Published: (2024)
by: Chen, Sanxing, et al.
Published: (2024)
Vision-Language Agents for Interactive Forest Change Analysis
by: Brock, James, et al.
Published: (2026)
by: Brock, James, et al.
Published: (2026)
Rapport-Driven Virtual Agent: Rapport Building Dialogue Strategy for Improving User Experience at First Meeting
by: Baihaqi, Muhammad Yeza, et al.
Published: (2024)
by: Baihaqi, Muhammad Yeza, et al.
Published: (2024)
Learning Virtual Machine Scheduling in Cloud Computing through Language Agents
by: Wu, JieHao, et al.
Published: (2025)
by: Wu, JieHao, et al.
Published: (2025)
Wait, but Tylenol is Acetaminophen... Investigating and Improving Language Models' Ability to Resist Requests for Misinformation
by: Chen, Shan, et al.
Published: (2024)
by: Chen, Shan, et al.
Published: (2024)
Soft Self-Consistency Improves Language Model Agents
by: Wang, Han, et al.
Published: (2024)
by: Wang, Han, et al.
Published: (2024)
Multi-Agent Simulator Drives Language Models for Legal Intensive Interaction
by: Yue, Shengbin, et al.
Published: (2025)
by: Yue, Shengbin, et al.
Published: (2025)
Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code
by: Chae, Hyungjoo, et al.
Published: (2024)
by: Chae, Hyungjoo, et al.
Published: (2024)
Multidimensional Consistency Improves Reasoning in Language Models
by: Lai, Huiyuan, et al.
Published: (2025)
by: Lai, Huiyuan, et al.
Published: (2025)
Similar Items
-
Evolving Interactive Diagnostic Agents in a Virtual Clinical Environment
by: Qiu, Pengcheng, et al.
Published: (2025) -
Improving Grounded Language Understanding in a Collaborative Environment by Interacting with Agents Through Help Feedback
by: Mehta, Nikhil, et al.
Published: (2023) -
DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity Environments
by: Zhang, Chiyu, et al.
Published: (2025) -
Advancing Language Multi-Agent Learning with Credit Re-Assignment for Interactive Environment Generalization
by: He, Zhitao, et al.
Published: (2025) -
MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models
by: Chen, Justin Chih-Yao, et al.
Published: (2024)