:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Jenny, Zhao, Bingchen, Yang, Wannan, Foerster, Jakob, Clune, Jeff, Jiang, Minqi, Devlin, Sam, Shavrina, Tatiana
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2603.19461
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

APRES: An Agentic Paper Revision and Evaluation System
by: Zhao, Bingchen, et al.
Published: (2026)

OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code
by: Faldor, Maxence, et al.
Published: (2024)

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
by: Lu, Chris, et al.
Published: (2024)

OMNI: Open-endedness via Models of human Notions of Interestingness
by: Zhang, Jenny, et al.
Published: (2023)

Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents
by: Zhang, Jenny, et al.
Published: (2025)

Thought Cloning: Learning to Think while Acting by Imitating Human Thinking
by: Hu, Shengran, et al.
Published: (2023)

First-Explore, then Exploit: Meta-Learning to Solve Hard Exploration-Exploitation Trade-Offs
by: Norman, Ben, et al.
Published: (2023)

Asking the Right Questions: Improving Reasoning with Generated Stepping Stones
by: Hu, Hengyuan, et al.
Published: (2026)

Quality Diversity through Human Feedback: Towards Open-Ended Diversity-Driven Optimization
by: Ding, Li, et al.
Published: (2023)

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
by: Yamada, Yutaro, et al.
Published: (2025)

Refining Minimax Regret for Unsupervised Environment Design
by: Beukman, Michael, et al.
Published: (2024)

Learning to Continually Learn via Meta-learning Agentic Memory Designs
by: Xiong, Yiming, et al.
Published: (2026)

Automated Design of Agentic Systems
by: Hu, Shengran, et al.
Published: (2024)

Foundation Model Self-Play: Open-Ended Strategy Innovation via Foundation Models
by: Dharna, Aaron, et al.
Published: (2025)

Automated Capability Discovery via Foundation Model Self-Exploration
by: Lu, Cong, et al.
Published: (2025)

Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models
by: Lu, Cong, et al.
Published: (2024)

The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements
by: Zhao, Bingchen, et al.
Published: (2025)

AI & Human Co-Improvement for Safer Co-Superintelligence
by: Weston, Jason, et al.
Published: (2025)

Continual learning under domain transfer with sparse synaptic bursting
by: Beaulieu, Shawn L., et al.
Published: (2021)

Learning to Act without Actions
by: Schmidt, Dominik, et al.
Published: (2023)

PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition
by: Zhang, Ziyang, et al.
Published: (2024)

JaxUED: A simple and useable UED library in Jax
by: Coward, Samuel, et al.
Published: (2024)

QuanForge: A Mutation Testing Framework for Quantum Neural Networks
by: Shao, Minqi, et al.
Published: (2026)

A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem
by: Barde, Paul, et al.
Published: (2023)

AgentBreeder: Mitigating the AI Safety Risks of Multi-Agent Scaffolds via Self-Improvement
by: Rosser, J, et al.
Published: (2025)

MLGym: A New Framework and Benchmark for Advancing AI Research Agents
by: Nathani, Deepak, et al.
Published: (2025)

AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench
by: Toledo, Edan, et al.
Published: (2025)

Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
by: Matthews, Michael, et al.
Published: (2024)

AQA-Bench: An Interactive Benchmark for Evaluating LLMs' Sequential Reasoning Ability
by: Yang, Siwei, et al.
Published: (2024)

The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind
by: Lupu, Andrei, et al.
Published: (2025)

JaxLife: An Open-Ended Agentic Simulator
by: Lu, Chris, et al.
Published: (2024)

Oasis: One Image is All You Need for Multimodal Instruction Data Synthesis
by: Zhang, Letian, et al.
Published: (2025)

Hallucination reduction with CASAL: Contrastive Activation Steering For Amortized Learning
by: Wannan, et al.
Published: (2025)

The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
by: Sims, Anya, et al.
Published: (2024)

Mirror Learning: A Unifying Framework of Policy Optimisation
by: Kuba, Jakub Grudzien, et al.
Published: (2022)

Learning Multi-Agent Communication with Contrastive Learning
by: Lo, Yat Long, et al.
Published: (2023)

The Generalization Gap in Offline Reinforcement Learning
by: Mediratta, Ishita, et al.
Published: (2023)

minimax: Efficient Baselines for Autocurricula in JAX
by: Jiang, Minqi, et al.
Published: (2023)

Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
by: Samvelyan, Mikayel, et al.
Published: (2024)

SpecBench: Measuring Reward Hacking in Long-Horizon Coding Agents
by: Zhao, Bingchen, et al.
Published: (2026)