Saved in:
| Main Authors: | Goel, Shivam, Wei, Yichen, Lymperopoulos, Panagiotis, Chura, Klara, Scheutz, Matthias, Sinapov, Jivko |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.03546 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FLEX: A Framework for Learning Robot-Agnostic Force-based Skills Involving Sustained Contact Object Manipulation
by: Fang, Shijie, et al.
Published: (2025)
by: Fang, Shijie, et al.
Published: (2025)
Novelty Adaptation Through Hybrid Large Language Model (LLM)-Symbolic Planning and LLM-guided Reinforcement Learning
by: Lu, Hong, et al.
Published: (2026)
by: Lu, Hong, et al.
Published: (2026)
Creative Problem Solving in Large Language and Vision Models -- What Would it Take?
by: Nair, Lakshmi, et al.
Published: (2024)
by: Nair, Lakshmi, et al.
Published: (2024)
Curiosity-Driven Imagination: Discovering Plan Operators and Learning Associated Policies for Open-World Adaptation
by: Lorang, Pierrick, et al.
Published: (2025)
by: Lorang, Pierrick, et al.
Published: (2025)
Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents
by: Shukla, Yash, et al.
Published: (2024)
by: Shukla, Yash, et al.
Published: (2024)
Towards Reinforcement Learning from Neural Feedback: Mapping fNIRS Signals to Agent Performance
by: Santaniello, Julia, et al.
Published: (2025)
by: Santaniello, Julia, et al.
Published: (2025)
Graph Pruning for Enumeration of Minimal Unsatisfiable Subsets
by: Lymperopoulos, Panagiotis, et al.
Published: (2024)
by: Lymperopoulos, Panagiotis, et al.
Published: (2024)
Are AI Machines Making Humans Obsolete?
by: Scheutz, Matthias
Published: (2025)
by: Scheutz, Matthias
Published: (2025)
Mapping Neural Signals to Agent Performance, A Step Towards Reinforcement Learning from Neural Feedback
by: Santaniello, Julia, et al.
Published: (2025)
by: Santaniello, Julia, et al.
Published: (2025)
The BrowserGym Ecosystem for Web Agent Research
by: De Chezelles, Thibault Le Sellier, et al.
Published: (2024)
by: De Chezelles, Thibault Le Sellier, et al.
Published: (2024)
Are you with me? A Framework for Detecting Mental Model Discrepancies in Task-Based Team Dialogues
by: Kowalyshyn, Katharine, et al.
Published: (2026)
by: Kowalyshyn, Katharine, et al.
Published: (2026)
MOSAIC: Learning Unified Multi-Sensory Object Property Representations for Robot Learning via Interactive Perception
by: Tatiya, Gyan, et al.
Published: (2023)
by: Tatiya, Gyan, et al.
Published: (2023)
Tools in the Loop: Quantifying Uncertainty of LLM Question Answering Systems That Use Tools
by: Lymperopoulos, Panagiotis, et al.
Published: (2025)
by: Lymperopoulos, Panagiotis, et al.
Published: (2025)
Build on Priors: Vision--Language--Guided Neuro-Symbolic Imitation Learning for Data-Efficient Real-World Robot Manipulation
by: Lorang, Pierrick, et al.
Published: (2026)
by: Lorang, Pierrick, et al.
Published: (2026)
ResearchGym: Evaluating Language Model Agents on Real-World AI Research
by: Garikaparthi, Aniketh, et al.
Published: (2026)
by: Garikaparthi, Aniketh, et al.
Published: (2026)
WorldGym: World Model as An Environment for Policy Evaluation
by: Quevedo, Julian, et al.
Published: (2025)
by: Quevedo, Julian, et al.
Published: (2025)
Noise Injection Systemically Degrades Large Language Model Safety Guardrails
by: Shahani, Prithviraj Singh, et al.
Published: (2025)
by: Shahani, Prithviraj Singh, et al.
Published: (2025)
FormGym: Doing Paperwork with Agents
by: Toles, Matthew, et al.
Published: (2025)
by: Toles, Matthew, et al.
Published: (2025)
CASSANDRA: Programmatic and Probabilistic Learning and Inference for Stochastic World Modeling
by: Lymperopoulos, Panagiotis, et al.
Published: (2026)
by: Lymperopoulos, Panagiotis, et al.
Published: (2026)
CyberGym: Evaluating AI Agents' Real-World Cybersecurity Capabilities at Scale
by: Wang, Zhun, et al.
Published: (2025)
by: Wang, Zhun, et al.
Published: (2025)
ViroGym: Realistic Large-Scale Benchmarks for Evaluating Viral Proteins
by: Zhou, Yichen, et al.
Published: (2026)
by: Zhou, Yichen, et al.
Published: (2026)
Smart Language Agents in Real-World Planning
by: Miin, Annabelle, et al.
Published: (2024)
by: Miin, Annabelle, et al.
Published: (2024)
Probing a Vision-Language-Action Model for Symbolic States and Integration into a Cognitive Architecture
by: Lu, Hong, et al.
Published: (2025)
by: Lu, Hong, et al.
Published: (2025)
PersonaGym: Evaluating Persona Agents and LLMs
by: Samuel, Vinay, et al.
Published: (2024)
by: Samuel, Vinay, et al.
Published: (2024)
Mini Amusement Parks (MAPs): A Testbed for Modelling Business Decisions
by: Aroca-Ouellette, Stéphane, et al.
Published: (2025)
by: Aroca-Ouellette, Stéphane, et al.
Published: (2025)
Gym-Anything: Turn any Software into an Agent Environment
by: Aggarwal, Pranjal, et al.
Published: (2026)
by: Aggarwal, Pranjal, et al.
Published: (2026)
Gym4ReaL: A Suite for Benchmarking Real-World Reinforcement Learning
by: Salaorni, Davide, et al.
Published: (2025)
by: Salaorni, Davide, et al.
Published: (2025)
HDDLGym: A Tool for Studying Multi-Agent Hierarchical Problems Defined in HDDL with OpenAI Gym
by: La, Ngoc, et al.
Published: (2025)
by: La, Ngoc, et al.
Published: (2025)
EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies
by: Hu, Xavier, et al.
Published: (2026)
by: Hu, Xavier, et al.
Published: (2026)
Where Norms and References Collide: Evaluating LLMs on Normative Reasoning
by: Abrams, Mitchell, et al.
Published: (2026)
by: Abrams, Mitchell, et al.
Published: (2026)
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
by: Xi, Zhiheng, et al.
Published: (2024)
by: Xi, Zhiheng, et al.
Published: (2024)
SearchGym: Bootstrapping Real-World Search Agents via Cost-Effective and High-Fidelity Environment Simulation
by: Zhang, Xichen, et al.
Published: (2026)
by: Zhang, Xichen, et al.
Published: (2026)
InnoGym: Benchmarking the Innovation Potential of AI Agents
by: Zhang, Jintian, et al.
Published: (2025)
by: Zhang, Jintian, et al.
Published: (2025)
Novelty Accommodating Multi-Agent Planning in High Fidelity Simulated Open World
by: Chao, James, et al.
Published: (2023)
by: Chao, James, et al.
Published: (2023)
CybORG++: An Enhanced Gym for the Development of Autonomous Cyber Agents
by: Emerson, Harry, et al.
Published: (2024)
by: Emerson, Harry, et al.
Published: (2024)
ClawGym: A Scalable Framework for Building Effective Claw Agents
by: Bai, Fei, et al.
Published: (2026)
by: Bai, Fei, et al.
Published: (2026)
EduGym: An Environment and Notebook Suite for Reinforcement Learning Education
by: Moerland, Thomas M., et al.
Published: (2023)
by: Moerland, Thomas M., et al.
Published: (2023)
EO-Gym: A Multimodal, Interactive Environment for Earth Observation Agents
by: Ma, Sai, et al.
Published: (2026)
by: Ma, Sai, et al.
Published: (2026)
HyPlan: Hybrid Learning-Assisted Planning Under Uncertainty for Safe Autonomous Driving
by: Pfaffmann, Donald, et al.
Published: (2025)
by: Pfaffmann, Donald, et al.
Published: (2025)
LLMs and their Limited Theory of Mind: Evaluating Mental State Annotations in Situated Dialogue
by: Kowalyshyn, Katharine, et al.
Published: (2025)
by: Kowalyshyn, Katharine, et al.
Published: (2025)
Similar Items
-
FLEX: A Framework for Learning Robot-Agnostic Force-based Skills Involving Sustained Contact Object Manipulation
by: Fang, Shijie, et al.
Published: (2025) -
Novelty Adaptation Through Hybrid Large Language Model (LLM)-Symbolic Planning and LLM-guided Reinforcement Learning
by: Lu, Hong, et al.
Published: (2026) -
Creative Problem Solving in Large Language and Vision Models -- What Would it Take?
by: Nair, Lakshmi, et al.
Published: (2024) -
Curiosity-Driven Imagination: Discovering Plan Operators and Learning Associated Policies for Open-World Adaptation
by: Lorang, Pierrick, et al.
Published: (2025) -
Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents
by: Shukla, Yash, et al.
Published: (2024)