:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Paglieri, Davide, Cupiał, Bartłomiej, Coward, Samuel, Piterbarg, Ulyana, Wolczyk, Maciej, Khan, Akbir, Pignatelli, Eduardo, Kuciński, Łukasz, Pinto, Lerrel, Fergus, Rob, Foerster, Jakob Nicolaus, Parker-Holder, Jack, Rocktäschel, Tim
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2411.13543
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Learning When to Plan: Efficiently Allocating Test-Time Compute for LLM Agents
by: Paglieri, Davide, et al.
Published: (2025)

diff History for Neural Language Agents
by: Piterbarg, Ulyana, et al.
Published: (2023)

Training Language Models on Synthetic Edit Sequences Improves Code Synthesis
by: Piterbarg, Ulyana, et al.
Published: (2024)

Outliers and Calibration Sets have Diminishing Effect on Quantization of Modern LLMs
by: Paglieri, Davide, et al.
Published: (2024)

Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem
by: Wołczyk, Maciej, et al.
Published: (2024)

Multi-Agent Diagnostics for Robustness via Illuminated Diversity
by: Samvelyan, Mikayel, et al.
Published: (2024)

Programming by Backprop: An Instruction is Worth 100 Examples When Finetuning LLMs
by: Cook, Jonathan, et al.
Published: (2025)

DéjàQ: Open-Ended Evolution of Diverse, Learnable and Verifiable Problems
by: Röpke, Willem, et al.
Published: (2026)

Imagined Autocurricula
by: Güzel, Ahmet H., et al.
Published: (2025)

Scaling Opponent Shaping to High Dimensional Games
by: Khan, Akbir, et al.
Published: (2023)

Assessing the Zero-Shot Capabilities of LLMs for Action Evaluation in RL
by: Pignatelli, Eduardo, et al.
Published: (2024)

Preference-Based Alignment of Discrete Diffusion Models
by: Borso, Umberto, et al.
Published: (2025)

JaxUED: A simple and useable UED library in Jax
by: Coward, Samuel, et al.
Published: (2024)

Learning Multi-Agent Coordination via Sheaf-ADMM
by: Seely, Jeffrey, et al.
Published: (2026)

GUIDE: Guidance-based Incremental Learning with Diffusion Models
by: Cywiński, Bartosz, et al.
Published: (2024)

TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation
by: Cook, Jonathan, et al.
Published: (2024)

Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
by: Samvelyan, Mikayel, et al.
Published: (2024)

There and Back Again: On the relation between Noise and Image Inversions in Diffusion Models
by: Staniszewski, Łukasz, et al.
Published: (2024)

Open-Endedness is Essential for Artificial Superhuman Intelligence
by: Hughes, Edward, et al.
Published: (2024)

Kendall-Cancer-Lab/PAX3-FOXO1_invivo_6hpf: code for publication
by: Jack Kucinski, et al.
Published: (2025)

Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation
by: Haldar, Siddhant, et al.
Published: (2025)

Factorio Learning Environment
by: Hopkins, Jack, et al.
Published: (2025)

Seeing Through Their Eyes: Evaluating Visual Perspective Taking in Vision Language Models
by: Góral, Gracjan, et al.
Published: (2024)

State Soup: In-Context Skill Learning, Retrieval and Mixing
by: Pióro, Maciej, et al.
Published: (2024)

JaxLife: An Open-Ended Agentic Simulator
by: Lu, Chris, et al.
Published: (2024)

Low regularity potentials in heterogeneous Cahn--Hilliard functionals
by: Cristoferi, Riccardo, et al.
Published: (2026)

Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data
by: Güzel, Ahmet H., et al.
Published: (2025)

Catalytic Role Of Noise And Necessity Of Inductive Biases In The Emergence Of Compositional Communication
by: Kuciński, Łukasz, et al.
Published: (2021)

Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning
by: Matthews, Michael, et al.
Published: (2024)

Refining Minimax Regret for Unsupervised Environment Design
by: Beukman, Michael, et al.
Published: (2024)

BAKU: An Efficient Transformer for Multi-Task Policy Learning
by: Haldar, Siddhant, et al.
Published: (2024)

Stochastic Video Generation with a Learned Prior
by: Denton, Remi, et al.
Published: (2018)

An evaluation of the BALROG and RoboBA algorithms for determining the position of Fermi/GBM GRBs
by: López, K. Océlotl. C., et al.
Published: (2024)

Jornalismo, saúde e cidadania
by: Bernardo Kucinski
Published: (2000)

Synthesis of Alkynylsilanes: A Review of the State of the Art
by: Krzysztof Kuciński
Published: (2024)

RapidDock: Unlocking Proteome-scale Molecular Docking
by: Powalski, Rafał, et al.
Published: (2024)

Una aproximación empírica al análisis de las percepciones del consumidor sobre el envase
by: Paola Pignatelli
Published: (2020)

Antropologia em Portugal nos últimos 50 anos: introdução
by: Marina Pignatelli
Published: (2014)

Simple formulas of π in terms of ϕ
by: Pignatelli, Angelo
Published: (2024)

On canonical threefolds near the Noether line
by: Pignatelli, Roberto
Published: (2024)