Saved in:
| Main Authors: | Bengio, Yoshua, Hinton, Geoffrey, Yao, Andrew, Song, Dawn, Abbeel, Pieter, Darrell, Trevor, Harari, Yuval Noah, Zhang, Ya-Qin, Xue, Lan, Shalev-Shwartz, Shai, Hadfield, Gillian, Clune, Jeff, Maharaj, Tegan, Hutter, Frank, Baydin, Atılım Güneş, McIlraith, Sheila, Gao, Qiqi, Acharya, Ashwin, Krueger, David, Dragan, Anca, Torr, Philip, Russell, Stuart, Kahneman, Daniel, Brauner, Jan, Mindermann, Sören |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2310.17688 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers
by: Lifshitz, Shalev, et al.
Published: (2025)
by: Lifshitz, Shalev, et al.
Published: (2025)
Closing the Gap Between SGP4 and High-Precision Propagation via Differentiable Programming
by: Acciarini, Giacomo, et al.
Published: (2024)
by: Acciarini, Giacomo, et al.
Published: (2024)
From Reasoning to Super-Intelligence: A Search-Theoretic Perspective
by: Shalev-Shwartz, Shai, et al.
Published: (2025)
by: Shalev-Shwartz, Shai, et al.
Published: (2025)
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
by: Lifshitz, Shalev, et al.
Published: (2023)
by: Lifshitz, Shalev, et al.
Published: (2023)
Second-Order Forward-Mode Automatic Differentiation for Optimization
by: Cobb, Adam D., et al.
Published: (2024)
by: Cobb, Adam D., et al.
Published: (2024)
Gaussian Processes for Probabilistic Estimates of Earthquake Ground Shaking: A 1-D Proof-of-Concept
by: Scivier, Sam A., et al.
Published: (2024)
by: Scivier, Sam A., et al.
Published: (2024)
Research Program: Theory of Learning in Dynamical Systems
by: Hazan, Elad, et al.
Published: (2025)
by: Hazan, Elad, et al.
Published: (2025)
Contextual Plackett-Luce: An Efficient Neural Model for Probabilistic Sequence Selection under Ambiguity
by: Mizrachi, Noam, et al.
Published: (2026)
by: Mizrachi, Noam, et al.
Published: (2026)
Untangling Lariats: Subgradient Following of Variationally Penalized Objectives
by: Mo, Kai-Chia, et al.
Published: (2024)
by: Mo, Kai-Chia, et al.
Published: (2024)
Cooperative Inverse Reinforcement Learning
by: Hadfield-Menell, Dylan, et al.
Published: (2016)
by: Hadfield-Menell, Dylan, et al.
Published: (2016)
Now, Later, and Lasting: Ten Priorities for AI Research, Policy, and Practice
by: Horvitz, Eric, et al.
Published: (2024)
by: Horvitz, Eric, et al.
Published: (2024)
Pluralistic Alignment Over Time
by: Klassen, Toryn Q., et al.
Published: (2024)
by: Klassen, Toryn Q., et al.
Published: (2024)
Formal Methods Meet LLMs: Auditing, Monitoring, and Intervention for Compliance of Advanced AI Systems
by: Alamdari, Parand A., et al.
Published: (2026)
by: Alamdari, Parand A., et al.
Published: (2026)
Why Hawks Win
by: Kahneman, Daniel
Published: (2007)
by: Kahneman, Daniel
Published: (2007)
Probabilistic Forecasting of Radiation Exposure for Spaceflight
by: Gurav, Rutuja, et al.
Published: (2024)
by: Gurav, Rutuja, et al.
Published: (2024)
Language Models For Generalised PDDL Planning: Synthesising Sound and Programmatic Policies
by: Chen, Dillon Z., et al.
Published: (2025)
by: Chen, Dillon Z., et al.
Published: (2025)
Machine learning and information theory concepts towards an AI Mathematician
by: Bengio, Yoshua, et al.
Published: (2024)
by: Bengio, Yoshua, et al.
Published: (2024)
Satisficing and Optimal Generalised Planning via Goal Regression (Extended Version)
by: Chen, Dillon Z., et al.
Published: (2025)
by: Chen, Dillon Z., et al.
Published: (2025)
Remembering to Be Fair: Non-Markovian Fairness in Sequential Decision Making
by: Alamdari, Parand A., et al.
Published: (2023)
by: Alamdari, Parand A., et al.
Published: (2023)
Learning Bilevel Policies over Symbolic World Models for Long-Horizon Planning
by: Chen, Dillon Z., et al.
Published: (2026)
by: Chen, Dillon Z., et al.
Published: (2026)
Baking Symmetry into GFlowNets
by: Ma, George, et al.
Published: (2024)
by: Ma, George, et al.
Published: (2024)
Being Considerate as a Pathway Towards Pluralistic Alignment for Agentic AI
by: Alamdari, Parand A., et al.
Published: (2024)
by: Alamdari, Parand A., et al.
Published: (2024)
Forecasting the Ionosphere from Sparse GNSS Data with Temporal-Fusion Transformers
by: Acciarini, Giacomo, et al.
Published: (2025)
by: Acciarini, Giacomo, et al.
Published: (2025)
Better Training Data Attribution via Better Inverse Hessian-Vector Products
by: Wang, Andrew, et al.
Published: (2025)
by: Wang, Andrew, et al.
Published: (2025)
Noticing the Watcher: LLM Agents Can Infer CoT Monitoring from Blocking Feedback
by: Jiralerspong, Thomas, et al.
Published: (2026)
by: Jiralerspong, Thomas, et al.
Published: (2026)
Ground-Compose-Reinforce: Grounding Language in Agentic Behaviours using Limited Data
by: Li, Andrew C., et al.
Published: (2025)
by: Li, Andrew C., et al.
Published: (2025)
Pushdown Reward Machines for Reinforcement Learning
by: Varricchione, Giovanni, et al.
Published: (2025)
by: Varricchione, Giovanni, et al.
Published: (2025)
Psicología de las preferencias
by: Kahneman, Daniel y Tversky, Amos
Published: (1982)
by: Kahneman, Daniel y Tversky, Amos
Published: (1982)
Beyond Predictive Algorithms in Child Welfare
by: Moon, Erina Seh-Young, et al.
Published: (2024)
by: Moon, Erina Seh-Young, et al.
Published: (2024)
Implicit meta-learning may lead language models to trust more reliable sources
by: Krasheninnikov, Dmitrii, et al.
Published: (2023)
by: Krasheninnikov, Dmitrii, et al.
Published: (2023)
Single-Frame Super-Resolution of Solar Magnetograms: Investigating Physics-Based Metrics & Losses
by: Jungbluth, Anna, et al.
Published: (2019)
by: Jungbluth, Anna, et al.
Published: (2019)
A Foundation Model for the Solar Dynamics Observatory
by: Walsh, James, et al.
Published: (2024)
by: Walsh, James, et al.
Published: (2024)
Reward Machines for Deep RL in Noisy and Uncertain Environments
by: Li, Andrew C., et al.
Published: (2024)
by: Li, Andrew C., et al.
Published: (2024)
Gauss-Newton Unlearning for the LLM Era
by: McKinney, Lev, et al.
Published: (2026)
by: McKinney, Lev, et al.
Published: (2026)
On Generalization for Generative Flow Networks
by: Krichel, Anas, et al.
Published: (2024)
by: Krichel, Anas, et al.
Published: (2024)
Interventional Causal Representation Learning
by: Ahuja, Kartik, et al.
Published: (2022)
by: Ahuja, Kartik, et al.
Published: (2022)
A Complexity-Based Theory of Compositionality
by: Elmoznino, Eric, et al.
Published: (2024)
by: Elmoznino, Eric, et al.
Published: (2024)
Visual symbolic mechanisms: Emergent symbol processing in vision language models
by: Assouel, Rim, et al.
Published: (2025)
by: Assouel, Rim, et al.
Published: (2025)
Relative Trajectory Balance is equivalent to Trust-PCL
by: Deleu, Tristan, et al.
Published: (2025)
by: Deleu, Tristan, et al.
Published: (2025)
Fast Monte Carlo Tree Diffusion: 100x Speedup via Parallel Sparse Planning
by: Yoon, Jaesik, et al.
Published: (2025)
by: Yoon, Jaesik, et al.
Published: (2025)
Similar Items
-
Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers
by: Lifshitz, Shalev, et al.
Published: (2025) -
Closing the Gap Between SGP4 and High-Precision Propagation via Differentiable Programming
by: Acciarini, Giacomo, et al.
Published: (2024) -
From Reasoning to Super-Intelligence: A Search-Theoretic Perspective
by: Shalev-Shwartz, Shai, et al.
Published: (2025) -
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
by: Lifshitz, Shalev, et al.
Published: (2023) -
Second-Order Forward-Mode Automatic Differentiation for Optimization
by: Cobb, Adam D., et al.
Published: (2024)