:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Simpson, Liu, Tennison, van der Schaar, Mihaela
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2505.20120
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Truly Self-Improving Agents Require Intrinsic Metacognitive Learning
by: Liu, Tennison, et al.
Published: (2025)

Strategic Self-Improvement for Competitive Agents in AI Labour Markets
by: Chiu, Christopher, et al.
Published: (2025)

Hypothesis Hunting with Evolving Networks of Autonomous Scientific Agents
by: Liu, Tennison, et al.
Published: (2025)

Large Language Models to Enhance Bayesian Optimization
by: Liu, Tennison, et al.
Published: (2024)

Active Task Disambiguation with LLMs
by: Kobalczyk, Katarzyna, et al.
Published: (2025)

Unveiling the Power of Sparse Neural Networks for Feature Selection
by: Atashgahi, Zahra, et al.
Published: (2024)

Learning Reasoning Rewards from Expert Demonstrations with Inverse Reinforcement Learning
by: Fanconi, Claudio, et al.
Published: (2025)

CauSim: Scaling Causal Reasoning with Increasingly Complex Causal Simulators
by: Astorga, Nicolás, et al.
Published: (2026)

Machine Learning with Requirements: a Manifesto
by: Giunchiglia, Eleonora, et al.
Published: (2023)

Simulating Viva Voce Examinations to Evaluate Clinical Reasoning in Large Language Models
by: Chiu, Christopher, et al.
Published: (2025)

Not All Explanations for Deep Learning Phenomena Are Equally Valuable
by: Jeffares, Alan, et al.
Published: (2025)

Preference Learning for AI Alignment: a Causal Perspective
by: Kobalczyk, Katarzyna, et al.
Published: (2025)

Cascaded Language Models for Cost-effective Human-AI Decision-Making
by: Fanconi, Claudio, et al.
Published: (2025)

Active Timepoint Selection for Learning Measure-Valued Trajectories
by: Huynh, Nicolas, et al.
Published: (2026)

Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
by: Sun, Hao, et al.
Published: (2024)

Hyperparameter Trajectory Inference with Conditional Lagrangian Optimal Transport
by: Amad, Harry, et al.
Published: (2026)

Discovery of Hidden Miscalibration Regimes
by: Kobalczyk, Katarzyna, et al.
Published: (2026)

Language Bottleneck Models for Qualitative Knowledge State Modeling
by: Berthon, Antonin, et al.
Published: (2025)

Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities
by: Sun, Hao, et al.
Published: (2025)

GameTalk: Training LLMs for Strategic Conversation
by: Vendrell, Victor Conchello, et al.
Published: (2026)

Decision Tree Induction Through LLMs via Semantically-Aware Evolution
by: Liu, Tennison, et al.
Published: (2025)

Automatically Learning Hybrid Digital Twins of Dynamical Systems
by: Holt, Samuel, et al.
Published: (2024)

OpenReview Should be Protected and Leveraged as a Community Asset for Research in the Era of Large Language Models
by: Sun, Hao, et al.
Published: (2025)

Deep Learning Through A Telescoping Lens: A Simple Model Provides Empirical Insights On Grokking, Gradient Boosting & Beyond
by: Jeffares, Alan, et al.
Published: (2024)

Eliciting Numerical Predictive Distributions of LLMs Without Autoregression
by: Piskorz, Julianna, et al.
Published: (2026)

Timely Clinical Diagnosis through Active Test Selection
by: Estévez, Silas Ruhrberg, et al.
Published: (2025)

Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
by: Sun, Hao, et al.
Published: (2023)

Semantic-KG: Using Knowledge Graphs to Construct Benchmarks for Measuring Semantic Similarity
by: Wei, Qiyao, et al.
Published: (2025)

DC-Check: A Data-Centric AI checklist to guide the development of reliable machine learning systems
by: Seedat, Nabeel, et al.
Published: (2022)

Curated LLM: Synergy of LLMs and Data Curation for tabular augmentation in low-data regimes
by: Seedat, Nabeel, et al.
Published: (2023)

Autoformulation of Mathematical Optimization Models Using LLMs
by: Astorga, Nicolás, et al.
Published: (2024)

You can't handle the (dirty) truth: Data-centric insights improve pseudo-labeling
by: Seedat, Nabeel, et al.
Published: (2024)

Causal Deep Learning
by: Berrevoets, Jeroen, et al.
Published: (2023)

Few-shot Steerable Alignment: Adapting Rewards and LLM Policies with Neural Processes
by: Kobalczyk, Katarzyna, et al.
Published: (2024)

Self-Healing Machine Learning: A Framework for Autonomous Adaptation in Real-World Environments
by: Rauba, Paulius, et al.
Published: (2024)

Time Series Diffusion in the Frequency Domain
by: Crabbé, Jonathan, et al.
Published: (2024)

The Synergy of LLMs & RL Unlocks Offline Learning of Generalizable Language-Conditioned Policies with Low-fidelity Data
by: Pouplin, Thomas, et al.
Published: (2024)

Meta-Learners for Partially-Identified Treatment Effects Across Multiple Environments
by: Schweisthal, Jonas, et al.
Published: (2024)

Defining Expertise: Applications to Treatment Effect Estimation
by: Hüyük, Alihan, et al.
Published: (2024)

TimeTok: Granularity-Controllable Time-Series Generation via Hierarchical Tokenization
by: Lee, Seokhyun, et al.
Published: (2026)