:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Goel, Shivam, Wei, Yichen, Lymperopoulos, Panagiotis, Chura, Klara, Scheutz, Matthias, Sinapov, Jivko
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2401.03546
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

FLEX: A Framework for Learning Robot-Agnostic Force-based Skills Involving Sustained Contact Object Manipulation
by: Fang, Shijie, et al.
Published: (2025)

Novelty Adaptation Through Hybrid Large Language Model (LLM)-Symbolic Planning and LLM-guided Reinforcement Learning
by: Lu, Hong, et al.
Published: (2026)

Creative Problem Solving in Large Language and Vision Models -- What Would it Take?
by: Nair, Lakshmi, et al.
Published: (2024)

Curiosity-Driven Imagination: Discovering Plan Operators and Learning Associated Policies for Open-World Adaptation
by: Lorang, Pierrick, et al.
Published: (2025)

Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents
by: Shukla, Yash, et al.
Published: (2024)

Towards Reinforcement Learning from Neural Feedback: Mapping fNIRS Signals to Agent Performance
by: Santaniello, Julia, et al.
Published: (2025)

Graph Pruning for Enumeration of Minimal Unsatisfiable Subsets
by: Lymperopoulos, Panagiotis, et al.
Published: (2024)

Are AI Machines Making Humans Obsolete?
by: Scheutz, Matthias
Published: (2025)

Mapping Neural Signals to Agent Performance, A Step Towards Reinforcement Learning from Neural Feedback
by: Santaniello, Julia, et al.
Published: (2025)

The BrowserGym Ecosystem for Web Agent Research
by: De Chezelles, Thibault Le Sellier, et al.
Published: (2024)

Are you with me? A Framework for Detecting Mental Model Discrepancies in Task-Based Team Dialogues
by: Kowalyshyn, Katharine, et al.
Published: (2026)

MOSAIC: Learning Unified Multi-Sensory Object Property Representations for Robot Learning via Interactive Perception
by: Tatiya, Gyan, et al.
Published: (2023)

Tools in the Loop: Quantifying Uncertainty of LLM Question Answering Systems That Use Tools
by: Lymperopoulos, Panagiotis, et al.
Published: (2025)

Build on Priors: Vision--Language--Guided Neuro-Symbolic Imitation Learning for Data-Efficient Real-World Robot Manipulation
by: Lorang, Pierrick, et al.
Published: (2026)

ResearchGym: Evaluating Language Model Agents on Real-World AI Research
by: Garikaparthi, Aniketh, et al.
Published: (2026)

WorldGym: World Model as An Environment for Policy Evaluation
by: Quevedo, Julian, et al.
Published: (2025)

Noise Injection Systemically Degrades Large Language Model Safety Guardrails
by: Shahani, Prithviraj Singh, et al.
Published: (2025)

FormGym: Doing Paperwork with Agents
by: Toles, Matthew, et al.
Published: (2025)

CASSANDRA: Programmatic and Probabilistic Learning and Inference for Stochastic World Modeling
by: Lymperopoulos, Panagiotis, et al.
Published: (2026)

CyberGym: Evaluating AI Agents' Real-World Cybersecurity Capabilities at Scale
by: Wang, Zhun, et al.
Published: (2025)

ViroGym: Realistic Large-Scale Benchmarks for Evaluating Viral Proteins
by: Zhou, Yichen, et al.
Published: (2026)

Smart Language Agents in Real-World Planning
by: Miin, Annabelle, et al.
Published: (2024)

Probing a Vision-Language-Action Model for Symbolic States and Integration into a Cognitive Architecture
by: Lu, Hong, et al.
Published: (2025)

PersonaGym: Evaluating Persona Agents and LLMs
by: Samuel, Vinay, et al.
Published: (2024)

Mini Amusement Parks (MAPs): A Testbed for Modelling Business Decisions
by: Aroca-Ouellette, Stéphane, et al.
Published: (2025)

Gym-Anything: Turn any Software into an Agent Environment
by: Aggarwal, Pranjal, et al.
Published: (2026)

Gym4ReaL: A Suite for Benchmarking Real-World Reinforcement Learning
by: Salaorni, Davide, et al.
Published: (2025)

HDDLGym: A Tool for Studying Multi-Agent Hierarchical Problems Defined in HDDL with OpenAI Gym
by: La, Ngoc, et al.
Published: (2025)

EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies
by: Hu, Xavier, et al.
Published: (2026)

Where Norms and References Collide: Evaluating LLMs on Normative Reasoning
by: Abrams, Mitchell, et al.
Published: (2026)

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
by: Xi, Zhiheng, et al.
Published: (2024)

SearchGym: Bootstrapping Real-World Search Agents via Cost-Effective and High-Fidelity Environment Simulation
by: Zhang, Xichen, et al.
Published: (2026)

InnoGym: Benchmarking the Innovation Potential of AI Agents
by: Zhang, Jintian, et al.
Published: (2025)

Novelty Accommodating Multi-Agent Planning in High Fidelity Simulated Open World
by: Chao, James, et al.
Published: (2023)

CybORG++: An Enhanced Gym for the Development of Autonomous Cyber Agents
by: Emerson, Harry, et al.
Published: (2024)

ClawGym: A Scalable Framework for Building Effective Claw Agents
by: Bai, Fei, et al.
Published: (2026)

EduGym: An Environment and Notebook Suite for Reinforcement Learning Education
by: Moerland, Thomas M., et al.
Published: (2023)

EO-Gym: A Multimodal, Interactive Environment for Earth Observation Agents
by: Ma, Sai, et al.
Published: (2026)

HyPlan: Hybrid Learning-Assisted Planning Under Uncertainty for Safe Autonomous Driving
by: Pfaffmann, Donald, et al.
Published: (2025)

LLMs and their Limited Theory of Mind: Evaluating Mental State Annotations in Situated Dialogue
by: Kowalyshyn, Katharine, et al.
Published: (2025)