Saved in:
| Main Authors: | Zhang, Jenny, Zhao, Bingchen, Yang, Wannan, Foerster, Jakob, Clune, Jeff, Jiang, Minqi, Devlin, Sam, Shavrina, Tatiana |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.19461 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
APRES: An Agentic Paper Revision and Evaluation System
by: Zhao, Bingchen, et al.
Published: (2026)
by: Zhao, Bingchen, et al.
Published: (2026)
OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code
by: Faldor, Maxence, et al.
Published: (2024)
by: Faldor, Maxence, et al.
Published: (2024)
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
by: Lu, Chris, et al.
Published: (2024)
by: Lu, Chris, et al.
Published: (2024)
OMNI: Open-endedness via Models of human Notions of Interestingness
by: Zhang, Jenny, et al.
Published: (2023)
by: Zhang, Jenny, et al.
Published: (2023)
Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents
by: Zhang, Jenny, et al.
Published: (2025)
by: Zhang, Jenny, et al.
Published: (2025)
Thought Cloning: Learning to Think while Acting by Imitating Human Thinking
by: Hu, Shengran, et al.
Published: (2023)
by: Hu, Shengran, et al.
Published: (2023)
First-Explore, then Exploit: Meta-Learning to Solve Hard Exploration-Exploitation Trade-Offs
by: Norman, Ben, et al.
Published: (2023)
by: Norman, Ben, et al.
Published: (2023)
Asking the Right Questions: Improving Reasoning with Generated Stepping Stones
by: Hu, Hengyuan, et al.
Published: (2026)
by: Hu, Hengyuan, et al.
Published: (2026)
Quality Diversity through Human Feedback: Towards Open-Ended Diversity-Driven Optimization
by: Ding, Li, et al.
Published: (2023)
by: Ding, Li, et al.
Published: (2023)
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
by: Yamada, Yutaro, et al.
Published: (2025)
by: Yamada, Yutaro, et al.
Published: (2025)
Refining Minimax Regret for Unsupervised Environment Design
by: Beukman, Michael, et al.
Published: (2024)
by: Beukman, Michael, et al.
Published: (2024)
Learning to Continually Learn via Meta-learning Agentic Memory Designs
by: Xiong, Yiming, et al.
Published: (2026)
by: Xiong, Yiming, et al.
Published: (2026)
Automated Design of Agentic Systems
by: Hu, Shengran, et al.
Published: (2024)
by: Hu, Shengran, et al.
Published: (2024)
Foundation Model Self-Play: Open-Ended Strategy Innovation via Foundation Models
by: Dharna, Aaron, et al.
Published: (2025)
by: Dharna, Aaron, et al.
Published: (2025)
Automated Capability Discovery via Foundation Model Self-Exploration
by: Lu, Cong, et al.
Published: (2025)
by: Lu, Cong, et al.
Published: (2025)
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models
by: Lu, Cong, et al.
Published: (2024)
by: Lu, Cong, et al.
Published: (2024)
The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements
by: Zhao, Bingchen, et al.
Published: (2025)
by: Zhao, Bingchen, et al.
Published: (2025)
AI & Human Co-Improvement for Safer Co-Superintelligence
by: Weston, Jason, et al.
Published: (2025)
by: Weston, Jason, et al.
Published: (2025)
Continual learning under domain transfer with sparse synaptic bursting
by: Beaulieu, Shawn L., et al.
Published: (2021)
by: Beaulieu, Shawn L., et al.
Published: (2021)
Learning to Act without Actions
by: Schmidt, Dominik, et al.
Published: (2023)
by: Schmidt, Dominik, et al.
Published: (2023)
PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition
by: Zhang, Ziyang, et al.
Published: (2024)
by: Zhang, Ziyang, et al.
Published: (2024)
JaxUED: A simple and useable UED library in Jax
by: Coward, Samuel, et al.
Published: (2024)
by: Coward, Samuel, et al.
Published: (2024)
QuanForge: A Mutation Testing Framework for Quantum Neural Networks
by: Shao, Minqi, et al.
Published: (2026)
by: Shao, Minqi, et al.
Published: (2026)
A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem
by: Barde, Paul, et al.
Published: (2023)
by: Barde, Paul, et al.
Published: (2023)
AgentBreeder: Mitigating the AI Safety Risks of Multi-Agent Scaffolds via Self-Improvement
by: Rosser, J, et al.
Published: (2025)
by: Rosser, J, et al.
Published: (2025)
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
by: Nathani, Deepak, et al.
Published: (2025)
by: Nathani, Deepak, et al.
Published: (2025)
AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench
by: Toledo, Edan, et al.
Published: (2025)
by: Toledo, Edan, et al.
Published: (2025)
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
by: Matthews, Michael, et al.
Published: (2024)
by: Matthews, Michael, et al.
Published: (2024)
AQA-Bench: An Interactive Benchmark for Evaluating LLMs' Sequential Reasoning Ability
by: Yang, Siwei, et al.
Published: (2024)
by: Yang, Siwei, et al.
Published: (2024)
The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind
by: Lupu, Andrei, et al.
Published: (2025)
by: Lupu, Andrei, et al.
Published: (2025)
JaxLife: An Open-Ended Agentic Simulator
by: Lu, Chris, et al.
Published: (2024)
by: Lu, Chris, et al.
Published: (2024)
Oasis: One Image is All You Need for Multimodal Instruction Data Synthesis
by: Zhang, Letian, et al.
Published: (2025)
by: Zhang, Letian, et al.
Published: (2025)
Hallucination reduction with CASAL: Contrastive Activation Steering For Amortized Learning
by: Wannan, et al.
Published: (2025)
by: Wannan, et al.
Published: (2025)
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
by: Sims, Anya, et al.
Published: (2024)
by: Sims, Anya, et al.
Published: (2024)
Mirror Learning: A Unifying Framework of Policy Optimisation
by: Kuba, Jakub Grudzien, et al.
Published: (2022)
by: Kuba, Jakub Grudzien, et al.
Published: (2022)
Learning Multi-Agent Communication with Contrastive Learning
by: Lo, Yat Long, et al.
Published: (2023)
by: Lo, Yat Long, et al.
Published: (2023)
The Generalization Gap in Offline Reinforcement Learning
by: Mediratta, Ishita, et al.
Published: (2023)
by: Mediratta, Ishita, et al.
Published: (2023)
minimax: Efficient Baselines for Autocurricula in JAX
by: Jiang, Minqi, et al.
Published: (2023)
by: Jiang, Minqi, et al.
Published: (2023)
Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
by: Samvelyan, Mikayel, et al.
Published: (2024)
by: Samvelyan, Mikayel, et al.
Published: (2024)
SpecBench: Measuring Reward Hacking in Long-Horizon Coding Agents
by: Zhao, Bingchen, et al.
Published: (2026)
by: Zhao, Bingchen, et al.
Published: (2026)
Similar Items
-
APRES: An Agentic Paper Revision and Evaluation System
by: Zhao, Bingchen, et al.
Published: (2026) -
OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code
by: Faldor, Maxence, et al.
Published: (2024) -
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
by: Lu, Chris, et al.
Published: (2024) -
OMNI: Open-endedness via Models of human Notions of Interestingness
by: Zhang, Jenny, et al.
Published: (2023) -
Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents
by: Zhang, Jenny, et al.
Published: (2025)