:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Chojecki, Przemyslaw
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2512.02731
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Psychometric Tests for AI Agents and Their Moduli Space
by: Chojecki, Przemyslaw
Published: (2025)

Mathematics and Coding are Universal AI Benchmarks
by: Chojecki, Przemyslaw
Published: (2025)

The Geometry of Benchmarks: A New Path Toward AGI
by: Chojecki, Przemyslaw
Published: (2025)

An Operational Kardashev-Style Scale for Autonomous AI - Towards AGI and Superintelligence
by: Chojecki, Przemyslaw
Published: (2025)

Learning Robust Reasoning through Guided Adversarial Self-Play
by: Li, Shuozhe, et al.
Published: (2026)

On The Statistical Limits of Self-Improving Agents
by: Wang, Charles L., et al.
Published: (2025)

Toward Training Superintelligent Software Agents through Self-Play SWE-RL
by: Wei, Yuxiang, et al.
Published: (2025)

VideoAgent: Self-Improving Video Generation
by: Soni, Achint, et al.
Published: (2024)

Superhuman AI for Stratego Using Self-Play Reinforcement Learning and Test-Time Search
by: Sokota, Samuel, et al.
Published: (2025)

Experiential Reflective Learning for Self-Improving LLM Agents
by: Allard, Marc-Antoine, et al.
Published: (2026)

Memory Self-Regeneration: Uncovering Hidden Knowledge in Unlearned Models
by: Polowczyk, Agnieszka, et al.
Published: (2025)

Robust Autonomy Emerges from Self-Play
by: Cusumano-Towner, Marco, et al.
Published: (2025)

Continual Harness: Online Adaptation for Self-Improving Foundation Agents
by: Karten, Seth, et al.
Published: (2026)

TriPlay-RL: Tri-Role Self-Play Reinforcement Learning for LLM Safety Alignment
by: Tan, Zhewen, et al.
Published: (2026)

Self-Improving LLM Agents at Test-Time
by: Acikgoz, Emre Can, et al.
Published: (2025)

RSPO: Regularized Self-Play Alignment of Large Language Models
by: Tang, Xiaohang, et al.
Published: (2025)

Self-Harmony: Learning to Harmonize Self-Supervision and Self-Play in Test-Time Reinforcement Learning
by: Wang, Ru, et al.
Published: (2025)

Self-Play Reinforcement Learning under Imperfect Information in Big 2
by: Patwa, Aalok
Published: (2026)

Soft Self-Consistency Improves Language Model Agents
by: Wang, Han, et al.
Published: (2024)

Enabling Self-Improving Agents to Learn at Test Time With Human-In-The-Loop Guidance
by: He, Yufei, et al.
Published: (2025)

Model Science: getting serious about verification, explanation and control of AI systems
by: Biecek, Przemyslaw, et al.
Published: (2025)

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
by: Cheng, Jiale, et al.
Published: (2024)

Self-Play Preference Optimization for Language Model Alignment
by: Wu, Yue, et al.
Published: (2024)

CoSPlay: Cooperative Self-Play at Test-Time with Self-Generated Code and Unit Test
by: Hu, Zhangyi, et al.
Published: (2026)

Improving LLM Code Reasoning via Semantic Equivalence Self-Play with Formal Verification
by: Barone, Antonio Valerio Miceli, et al.
Published: (2026)

Foundation Model Self-Play: Open-Ended Strategy Innovation via Foundation Models
by: Dharna, Aaron, et al.
Published: (2025)

WIST: Web-Grounded Iterative Self-Play Tree for Domain-Targeted Reasoning Improvement
by: Li, Fangyuan, et al.
Published: (2026)

Self-Improving Robust Preference Optimization
by: Choi, Eugene, et al.
Published: (2024)

Heterogeneous Self-Play for Realistic Highway Traffic Simulation
by: Qiu, Jinkai, et al.
Published: (2026)

Training Agents to Self-Report Misbehavior
by: Lee, Bruce W., et al.
Published: (2026)

Combee: Scaling Prompt Learning for Self-Improving Language Model Agents
by: Li, Hanchen, et al.
Published: (2026)

Large Language Models Can Self-Improve At Web Agent Tasks
by: Patel, Ajay, et al.
Published: (2024)

Recursive Introspection: Teaching Language Model Agents How to Self-Improve
by: Qu, Yuxiao, et al.
Published: (2024)

Differentially Private Reinforcement Learning with Self-Play
by: Qiao, Dan, et al.
Published: (2024)

AlphaVerus: Bootstrapping Formally Verified Code Generation through Self-Improving Translation and Treefinement
by: Aggarwal, Pranjal, et al.
Published: (2024)

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
by: Liu, Bo, et al.
Published: (2025)

Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints
by: Qiao, Dan, et al.
Published: (2024)

Self-Improving Diffusion Models with Synthetic Data
by: Alemohammad, Sina, et al.
Published: (2024)

The Attacker in the Mirror: Breaking Self-Consistency in Safety via Anchored Bipolicy Self-Play
by: La Malfa, Gabriele, et al.
Published: (2026)

A Self-Evolving AI Agent System for Climate Science
by: Guo, Zijie, et al.
Published: (2025)