:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Bengio, Yoshua, Hinton, Geoffrey, Yao, Andrew, Song, Dawn, Abbeel, Pieter, Darrell, Trevor, Harari, Yuval Noah, Zhang, Ya-Qin, Xue, Lan, Shalev-Shwartz, Shai, Hadfield, Gillian, Clune, Jeff, Maharaj, Tegan, Hutter, Frank, Baydin, Atılım Güneş, McIlraith, Sheila, Gao, Qiqi, Acharya, Ashwin, Krueger, David, Dragan, Anca, Torr, Philip, Russell, Stuart, Kahneman, Daniel, Brauner, Jan, Mindermann, Sören
Format:	Preprint
Published:	2023
Subjects:	Computers and Society Artificial Intelligence Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2310.17688
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers
by: Lifshitz, Shalev, et al.
Published: (2025)

Closing the Gap Between SGP4 and High-Precision Propagation via Differentiable Programming
by: Acciarini, Giacomo, et al.
Published: (2024)

From Reasoning to Super-Intelligence: A Search-Theoretic Perspective
by: Shalev-Shwartz, Shai, et al.
Published: (2025)

STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
by: Lifshitz, Shalev, et al.
Published: (2023)

Second-Order Forward-Mode Automatic Differentiation for Optimization
by: Cobb, Adam D., et al.
Published: (2024)

Gaussian Processes for Probabilistic Estimates of Earthquake Ground Shaking: A 1-D Proof-of-Concept
by: Scivier, Sam A., et al.
Published: (2024)

Research Program: Theory of Learning in Dynamical Systems
by: Hazan, Elad, et al.
Published: (2025)

Contextual Plackett-Luce: An Efficient Neural Model for Probabilistic Sequence Selection under Ambiguity
by: Mizrachi, Noam, et al.
Published: (2026)

Untangling Lariats: Subgradient Following of Variationally Penalized Objectives
by: Mo, Kai-Chia, et al.
Published: (2024)

Cooperative Inverse Reinforcement Learning
by: Hadfield-Menell, Dylan, et al.
Published: (2016)

Now, Later, and Lasting: Ten Priorities for AI Research, Policy, and Practice
by: Horvitz, Eric, et al.
Published: (2024)

Pluralistic Alignment Over Time
by: Klassen, Toryn Q., et al.
Published: (2024)

Formal Methods Meet LLMs: Auditing, Monitoring, and Intervention for Compliance of Advanced AI Systems
by: Alamdari, Parand A., et al.
Published: (2026)

Why Hawks Win
by: Kahneman, Daniel
Published: (2007)

Probabilistic Forecasting of Radiation Exposure for Spaceflight
by: Gurav, Rutuja, et al.
Published: (2024)

Language Models For Generalised PDDL Planning: Synthesising Sound and Programmatic Policies
by: Chen, Dillon Z., et al.
Published: (2025)

Machine learning and information theory concepts towards an AI Mathematician
by: Bengio, Yoshua, et al.
Published: (2024)

Satisficing and Optimal Generalised Planning via Goal Regression (Extended Version)
by: Chen, Dillon Z., et al.
Published: (2025)

Remembering to Be Fair: Non-Markovian Fairness in Sequential Decision Making
by: Alamdari, Parand A., et al.
Published: (2023)

Learning Bilevel Policies over Symbolic World Models for Long-Horizon Planning
by: Chen, Dillon Z., et al.
Published: (2026)

Baking Symmetry into GFlowNets
by: Ma, George, et al.
Published: (2024)

Being Considerate as a Pathway Towards Pluralistic Alignment for Agentic AI
by: Alamdari, Parand A., et al.
Published: (2024)

Forecasting the Ionosphere from Sparse GNSS Data with Temporal-Fusion Transformers
by: Acciarini, Giacomo, et al.
Published: (2025)

Better Training Data Attribution via Better Inverse Hessian-Vector Products
by: Wang, Andrew, et al.
Published: (2025)

Noticing the Watcher: LLM Agents Can Infer CoT Monitoring from Blocking Feedback
by: Jiralerspong, Thomas, et al.
Published: (2026)

Ground-Compose-Reinforce: Grounding Language in Agentic Behaviours using Limited Data
by: Li, Andrew C., et al.
Published: (2025)

Pushdown Reward Machines for Reinforcement Learning
by: Varricchione, Giovanni, et al.
Published: (2025)

Psicología de las preferencias
by: Kahneman, Daniel y Tversky, Amos
Published: (1982)

Beyond Predictive Algorithms in Child Welfare
by: Moon, Erina Seh-Young, et al.
Published: (2024)

Implicit meta-learning may lead language models to trust more reliable sources
by: Krasheninnikov, Dmitrii, et al.
Published: (2023)

Single-Frame Super-Resolution of Solar Magnetograms: Investigating Physics-Based Metrics & Losses
by: Jungbluth, Anna, et al.
Published: (2019)

A Foundation Model for the Solar Dynamics Observatory
by: Walsh, James, et al.
Published: (2024)

Reward Machines for Deep RL in Noisy and Uncertain Environments
by: Li, Andrew C., et al.
Published: (2024)

Gauss-Newton Unlearning for the LLM Era
by: McKinney, Lev, et al.
Published: (2026)

On Generalization for Generative Flow Networks
by: Krichel, Anas, et al.
Published: (2024)

Interventional Causal Representation Learning
by: Ahuja, Kartik, et al.
Published: (2022)

A Complexity-Based Theory of Compositionality
by: Elmoznino, Eric, et al.
Published: (2024)

Visual symbolic mechanisms: Emergent symbol processing in vision language models
by: Assouel, Rim, et al.
Published: (2025)

Relative Trajectory Balance is equivalent to Trust-PCL
by: Deleu, Tristan, et al.
Published: (2025)

Fast Monte Carlo Tree Diffusion: 100x Speedup via Parallel Sparse Planning
by: Yoon, Jaesik, et al.
Published: (2025)