Saved in:
| Main Authors: | Paglieri, Davide, Cupiał, Bartłomiej, Coward, Samuel, Piterbarg, Ulyana, Wolczyk, Maciej, Khan, Akbir, Pignatelli, Eduardo, Kuciński, Łukasz, Pinto, Lerrel, Fergus, Rob, Foerster, Jakob Nicolaus, Parker-Holder, Jack, Rocktäschel, Tim |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.13543 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Learning When to Plan: Efficiently Allocating Test-Time Compute for LLM Agents
by: Paglieri, Davide, et al.
Published: (2025)
by: Paglieri, Davide, et al.
Published: (2025)
diff History for Neural Language Agents
by: Piterbarg, Ulyana, et al.
Published: (2023)
by: Piterbarg, Ulyana, et al.
Published: (2023)
Training Language Models on Synthetic Edit Sequences Improves Code Synthesis
by: Piterbarg, Ulyana, et al.
Published: (2024)
by: Piterbarg, Ulyana, et al.
Published: (2024)
Outliers and Calibration Sets have Diminishing Effect on Quantization of Modern LLMs
by: Paglieri, Davide, et al.
Published: (2024)
by: Paglieri, Davide, et al.
Published: (2024)
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem
by: Wołczyk, Maciej, et al.
Published: (2024)
by: Wołczyk, Maciej, et al.
Published: (2024)
Multi-Agent Diagnostics for Robustness via Illuminated Diversity
by: Samvelyan, Mikayel, et al.
Published: (2024)
by: Samvelyan, Mikayel, et al.
Published: (2024)
Programming by Backprop: An Instruction is Worth 100 Examples When Finetuning LLMs
by: Cook, Jonathan, et al.
Published: (2025)
by: Cook, Jonathan, et al.
Published: (2025)
DéjàQ: Open-Ended Evolution of Diverse, Learnable and Verifiable Problems
by: Röpke, Willem, et al.
Published: (2026)
by: Röpke, Willem, et al.
Published: (2026)
Imagined Autocurricula
by: Güzel, Ahmet H., et al.
Published: (2025)
by: Güzel, Ahmet H., et al.
Published: (2025)
Scaling Opponent Shaping to High Dimensional Games
by: Khan, Akbir, et al.
Published: (2023)
by: Khan, Akbir, et al.
Published: (2023)
Assessing the Zero-Shot Capabilities of LLMs for Action Evaluation in RL
by: Pignatelli, Eduardo, et al.
Published: (2024)
by: Pignatelli, Eduardo, et al.
Published: (2024)
Preference-Based Alignment of Discrete Diffusion Models
by: Borso, Umberto, et al.
Published: (2025)
by: Borso, Umberto, et al.
Published: (2025)
JaxUED: A simple and useable UED library in Jax
by: Coward, Samuel, et al.
Published: (2024)
by: Coward, Samuel, et al.
Published: (2024)
Learning Multi-Agent Coordination via Sheaf-ADMM
by: Seely, Jeffrey, et al.
Published: (2026)
by: Seely, Jeffrey, et al.
Published: (2026)
GUIDE: Guidance-based Incremental Learning with Diffusion Models
by: Cywiński, Bartosz, et al.
Published: (2024)
by: Cywiński, Bartosz, et al.
Published: (2024)
TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation
by: Cook, Jonathan, et al.
Published: (2024)
by: Cook, Jonathan, et al.
Published: (2024)
Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
by: Samvelyan, Mikayel, et al.
Published: (2024)
by: Samvelyan, Mikayel, et al.
Published: (2024)
There and Back Again: On the relation between Noise and Image Inversions in Diffusion Models
by: Staniszewski, Łukasz, et al.
Published: (2024)
by: Staniszewski, Łukasz, et al.
Published: (2024)
Open-Endedness is Essential for Artificial Superhuman Intelligence
by: Hughes, Edward, et al.
Published: (2024)
by: Hughes, Edward, et al.
Published: (2024)
Kendall-Cancer-Lab/PAX3-FOXO1_invivo_6hpf: code for publication
by: Jack Kucinski, et al.
Published: (2025)
by: Jack Kucinski, et al.
Published: (2025)
Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation
by: Haldar, Siddhant, et al.
Published: (2025)
by: Haldar, Siddhant, et al.
Published: (2025)
Factorio Learning Environment
by: Hopkins, Jack, et al.
Published: (2025)
by: Hopkins, Jack, et al.
Published: (2025)
Seeing Through Their Eyes: Evaluating Visual Perspective Taking in Vision Language Models
by: Góral, Gracjan, et al.
Published: (2024)
by: Góral, Gracjan, et al.
Published: (2024)
State Soup: In-Context Skill Learning, Retrieval and Mixing
by: Pióro, Maciej, et al.
Published: (2024)
by: Pióro, Maciej, et al.
Published: (2024)
JaxLife: An Open-Ended Agentic Simulator
by: Lu, Chris, et al.
Published: (2024)
by: Lu, Chris, et al.
Published: (2024)
Low regularity potentials in heterogeneous Cahn--Hilliard functionals
by: Cristoferi, Riccardo, et al.
Published: (2026)
by: Cristoferi, Riccardo, et al.
Published: (2026)
Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data
by: Güzel, Ahmet H., et al.
Published: (2025)
by: Güzel, Ahmet H., et al.
Published: (2025)
Catalytic Role Of Noise And Necessity Of Inductive Biases In The Emergence Of Compositional Communication
by: Kuciński, Łukasz, et al.
Published: (2021)
by: Kuciński, Łukasz, et al.
Published: (2021)
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning
by: Matthews, Michael, et al.
Published: (2024)
by: Matthews, Michael, et al.
Published: (2024)
Refining Minimax Regret for Unsupervised Environment Design
by: Beukman, Michael, et al.
Published: (2024)
by: Beukman, Michael, et al.
Published: (2024)
BAKU: An Efficient Transformer for Multi-Task Policy Learning
by: Haldar, Siddhant, et al.
Published: (2024)
by: Haldar, Siddhant, et al.
Published: (2024)
Stochastic Video Generation with a Learned Prior
by: Denton, Remi, et al.
Published: (2018)
by: Denton, Remi, et al.
Published: (2018)
An evaluation of the BALROG and RoboBA algorithms for determining the position of Fermi/GBM GRBs
by: López, K. Océlotl. C., et al.
Published: (2024)
by: López, K. Océlotl. C., et al.
Published: (2024)
Jornalismo, saúde e cidadania
by: Bernardo Kucinski
Published: (2000)
by: Bernardo Kucinski
Published: (2000)
Synthesis of Alkynylsilanes: A Review of the State of the Art
by: Krzysztof Kuciński
Published: (2024)
by: Krzysztof Kuciński
Published: (2024)
RapidDock: Unlocking Proteome-scale Molecular Docking
by: Powalski, Rafał, et al.
Published: (2024)
by: Powalski, Rafał, et al.
Published: (2024)
Una aproximación empírica al análisis de las percepciones del consumidor sobre el envase
by: Paola Pignatelli
Published: (2020)
by: Paola Pignatelli
Published: (2020)
Antropologia em Portugal nos últimos 50 anos: introdução
by: Marina Pignatelli
Published: (2014)
by: Marina Pignatelli
Published: (2014)
Simple formulas of π in terms of ϕ
by: Pignatelli, Angelo
Published: (2024)
by: Pignatelli, Angelo
Published: (2024)
On canonical threefolds near the Noether line
by: Pignatelli, Roberto
Published: (2024)
by: Pignatelli, Roberto
Published: (2024)
Similar Items
-
Learning When to Plan: Efficiently Allocating Test-Time Compute for LLM Agents
by: Paglieri, Davide, et al.
Published: (2025) -
diff History for Neural Language Agents
by: Piterbarg, Ulyana, et al.
Published: (2023) -
Training Language Models on Synthetic Edit Sequences Improves Code Synthesis
by: Piterbarg, Ulyana, et al.
Published: (2024) -
Outliers and Calibration Sets have Diminishing Effect on Quantization of Modern LLMs
by: Paglieri, Davide, et al.
Published: (2024) -
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem
by: Wołczyk, Maciej, et al.
Published: (2024)