:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Zhang, Jack
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2402.05440
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Evolving Interactive Diagnostic Agents in a Virtual Clinical Environment
by: Qiu, Pengcheng, et al.
Published: (2025)

Improving Grounded Language Understanding in a Collaborative Environment by Interacting with Agents Through Help Feedback
by: Mehta, Nikhil, et al.
Published: (2023)

DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity Environments
by: Zhang, Chiyu, et al.
Published: (2025)

Advancing Language Multi-Agent Learning with Credit Re-Assignment for Interactive Environment Generalization
by: He, Zhitao, et al.
Published: (2025)

MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models
by: Chen, Justin Chih-Yao, et al.
Published: (2024)

MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive and MCP-Augmented Environments
by: Kong, Quyu, et al.
Published: (2025)

STAMP: Training Explicit Memory for Mobile GUI Agents in Controllable and Scalable Virtual Environments
by: Wang, Junyang, et al.
Published: (2026)

DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents
by: Jansen, Peter, et al.
Published: (2024)

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
by: Xi, Zhiheng, et al.
Published: (2024)

Teaching Language Models to Self-Improve through Interactive Demonstrations
by: Yu, Xiao, et al.
Published: (2023)

SPICE: Self-Play In Corpus Environments Improves Reasoning
by: Liu, Bo, et al.
Published: (2025)

Improving Interactive Diagnostic Ability of a Large Language Model Agent Through Clinical Experience Learning
by: Sun, Zhoujian, et al.
Published: (2025)

LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments
by: Chen, Junzhe, et al.
Published: (2024)

MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive Environments
by: Cai, Yin, et al.
Published: (2025)

Language Guided Exploration for RL Agents in Text Environments
by: Golchha, Hitesh, et al.
Published: (2024)

Can a Unimodal Language Agent Provide Preferences to Tune a Multimodal Vision-Language Model?
by: Mim, Sazia Tabasum, et al.
Published: (2026)

AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios
by: Mou, Xinyi, et al.
Published: (2024)

NNetNav: Unsupervised Learning of Browser Agents Through Environment Interaction in the Wild
by: Murty, Shikhar, et al.
Published: (2024)

UserBench: An Interactive Gym Environment for User-Centric Agents
by: Qian, Cheng, et al.
Published: (2025)

SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents
by: Wang, Ruiyi, et al.
Published: (2024)

Positive Experience Reflection for Agents in Interactive Text Environments
by: Lippmann, Philip, et al.
Published: (2024)

PSALM-V: Automating Symbolic Planning in Interactive Visual Environments with Large Language Models
by: Zhu, Wang Bill, et al.
Published: (2025)

SAUCE: Synchronous and Asynchronous User-Customizable Environment for Multi-Agent LLM Interaction
by: Neuberger, Shlomo, et al.
Published: (2024)

Runaway is Ashamed, But Helpful: On the Early-Exit Behavior of Large Language Model-based Agents in Embodied Environments
by: Lu, Qingyu, et al.
Published: (2025)

SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions
by: Wagner, Dominik, et al.
Published: (2025)

Emergent Introspective Awareness in Large Language Models
by: Lindsey, Jack
Published: (2026)

WorldCoder, a Model-Based LLM Agent: Building World Models by Writing Code and Interacting with the Environment
by: Tang, Hao, et al.
Published: (2024)

Recursive Introspection: Teaching Language Model Agents How to Self-Improve
by: Qu, Yuxiao, et al.
Published: (2024)

Improving Clinical NLP Performance through Language Model-Generated Synthetic Clinical Data
by: Chen, Shan, et al.
Published: (2024)

The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
by: Lu, Christina, et al.
Published: (2026)

LiteCoder-Terminal: Scaling Long-Horizon Terminal Environments for Learning Language Agents
by: Peng, Xiaoxuan, et al.
Published: (2026)

ChatShop: Interactive Information Seeking with Language Agents
by: Chen, Sanxing, et al.
Published: (2024)

Vision-Language Agents for Interactive Forest Change Analysis
by: Brock, James, et al.
Published: (2026)

Rapport-Driven Virtual Agent: Rapport Building Dialogue Strategy for Improving User Experience at First Meeting
by: Baihaqi, Muhammad Yeza, et al.
Published: (2024)

Learning Virtual Machine Scheduling in Cloud Computing through Language Agents
by: Wu, JieHao, et al.
Published: (2025)

Wait, but Tylenol is Acetaminophen... Investigating and Improving Language Models' Ability to Resist Requests for Misinformation
by: Chen, Shan, et al.
Published: (2024)

Soft Self-Consistency Improves Language Model Agents
by: Wang, Han, et al.
Published: (2024)

Multi-Agent Simulator Drives Language Models for Legal Intensive Interaction
by: Yue, Shengbin, et al.
Published: (2025)

Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code
by: Chae, Hyungjoo, et al.
Published: (2024)

Multidimensional Consistency Improves Reasoning in Language Models
by: Lai, Huiyuan, et al.
Published: (2025)