Saved in:
| Main Authors: | Zhang, Simpson, Liu, Tennison, van der Schaar, Mihaela |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.20120 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Truly Self-Improving Agents Require Intrinsic Metacognitive Learning
by: Liu, Tennison, et al.
Published: (2025)
by: Liu, Tennison, et al.
Published: (2025)
Strategic Self-Improvement for Competitive Agents in AI Labour Markets
by: Chiu, Christopher, et al.
Published: (2025)
by: Chiu, Christopher, et al.
Published: (2025)
Hypothesis Hunting with Evolving Networks of Autonomous Scientific Agents
by: Liu, Tennison, et al.
Published: (2025)
by: Liu, Tennison, et al.
Published: (2025)
Large Language Models to Enhance Bayesian Optimization
by: Liu, Tennison, et al.
Published: (2024)
by: Liu, Tennison, et al.
Published: (2024)
Active Task Disambiguation with LLMs
by: Kobalczyk, Katarzyna, et al.
Published: (2025)
by: Kobalczyk, Katarzyna, et al.
Published: (2025)
Unveiling the Power of Sparse Neural Networks for Feature Selection
by: Atashgahi, Zahra, et al.
Published: (2024)
by: Atashgahi, Zahra, et al.
Published: (2024)
Learning Reasoning Rewards from Expert Demonstrations with Inverse Reinforcement Learning
by: Fanconi, Claudio, et al.
Published: (2025)
by: Fanconi, Claudio, et al.
Published: (2025)
CauSim: Scaling Causal Reasoning with Increasingly Complex Causal Simulators
by: Astorga, Nicolás, et al.
Published: (2026)
by: Astorga, Nicolás, et al.
Published: (2026)
Machine Learning with Requirements: a Manifesto
by: Giunchiglia, Eleonora, et al.
Published: (2023)
by: Giunchiglia, Eleonora, et al.
Published: (2023)
Simulating Viva Voce Examinations to Evaluate Clinical Reasoning in Large Language Models
by: Chiu, Christopher, et al.
Published: (2025)
by: Chiu, Christopher, et al.
Published: (2025)
Not All Explanations for Deep Learning Phenomena Are Equally Valuable
by: Jeffares, Alan, et al.
Published: (2025)
by: Jeffares, Alan, et al.
Published: (2025)
Preference Learning for AI Alignment: a Causal Perspective
by: Kobalczyk, Katarzyna, et al.
Published: (2025)
by: Kobalczyk, Katarzyna, et al.
Published: (2025)
Cascaded Language Models for Cost-effective Human-AI Decision-Making
by: Fanconi, Claudio, et al.
Published: (2025)
by: Fanconi, Claudio, et al.
Published: (2025)
Active Timepoint Selection for Learning Measure-Valued Trajectories
by: Huynh, Nicolas, et al.
Published: (2026)
by: Huynh, Nicolas, et al.
Published: (2026)
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
by: Sun, Hao, et al.
Published: (2024)
by: Sun, Hao, et al.
Published: (2024)
Hyperparameter Trajectory Inference with Conditional Lagrangian Optimal Transport
by: Amad, Harry, et al.
Published: (2026)
by: Amad, Harry, et al.
Published: (2026)
Discovery of Hidden Miscalibration Regimes
by: Kobalczyk, Katarzyna, et al.
Published: (2026)
by: Kobalczyk, Katarzyna, et al.
Published: (2026)
Language Bottleneck Models for Qualitative Knowledge State Modeling
by: Berthon, Antonin, et al.
Published: (2025)
by: Berthon, Antonin, et al.
Published: (2025)
Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities
by: Sun, Hao, et al.
Published: (2025)
by: Sun, Hao, et al.
Published: (2025)
GameTalk: Training LLMs for Strategic Conversation
by: Vendrell, Victor Conchello, et al.
Published: (2026)
by: Vendrell, Victor Conchello, et al.
Published: (2026)
Decision Tree Induction Through LLMs via Semantically-Aware Evolution
by: Liu, Tennison, et al.
Published: (2025)
by: Liu, Tennison, et al.
Published: (2025)
Automatically Learning Hybrid Digital Twins of Dynamical Systems
by: Holt, Samuel, et al.
Published: (2024)
by: Holt, Samuel, et al.
Published: (2024)
OpenReview Should be Protected and Leveraged as a Community Asset for Research in the Era of Large Language Models
by: Sun, Hao, et al.
Published: (2025)
by: Sun, Hao, et al.
Published: (2025)
Deep Learning Through A Telescoping Lens: A Simple Model Provides Empirical Insights On Grokking, Gradient Boosting & Beyond
by: Jeffares, Alan, et al.
Published: (2024)
by: Jeffares, Alan, et al.
Published: (2024)
Eliciting Numerical Predictive Distributions of LLMs Without Autoregression
by: Piskorz, Julianna, et al.
Published: (2026)
by: Piskorz, Julianna, et al.
Published: (2026)
Timely Clinical Diagnosis through Active Test Selection
by: Estévez, Silas Ruhrberg, et al.
Published: (2025)
by: Estévez, Silas Ruhrberg, et al.
Published: (2025)
Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
by: Sun, Hao, et al.
Published: (2023)
by: Sun, Hao, et al.
Published: (2023)
Semantic-KG: Using Knowledge Graphs to Construct Benchmarks for Measuring Semantic Similarity
by: Wei, Qiyao, et al.
Published: (2025)
by: Wei, Qiyao, et al.
Published: (2025)
DC-Check: A Data-Centric AI checklist to guide the development of reliable machine learning systems
by: Seedat, Nabeel, et al.
Published: (2022)
by: Seedat, Nabeel, et al.
Published: (2022)
Curated LLM: Synergy of LLMs and Data Curation for tabular augmentation in low-data regimes
by: Seedat, Nabeel, et al.
Published: (2023)
by: Seedat, Nabeel, et al.
Published: (2023)
Autoformulation of Mathematical Optimization Models Using LLMs
by: Astorga, Nicolás, et al.
Published: (2024)
by: Astorga, Nicolás, et al.
Published: (2024)
You can't handle the (dirty) truth: Data-centric insights improve pseudo-labeling
by: Seedat, Nabeel, et al.
Published: (2024)
by: Seedat, Nabeel, et al.
Published: (2024)
Causal Deep Learning
by: Berrevoets, Jeroen, et al.
Published: (2023)
by: Berrevoets, Jeroen, et al.
Published: (2023)
Few-shot Steerable Alignment: Adapting Rewards and LLM Policies with Neural Processes
by: Kobalczyk, Katarzyna, et al.
Published: (2024)
by: Kobalczyk, Katarzyna, et al.
Published: (2024)
Self-Healing Machine Learning: A Framework for Autonomous Adaptation in Real-World Environments
by: Rauba, Paulius, et al.
Published: (2024)
by: Rauba, Paulius, et al.
Published: (2024)
Time Series Diffusion in the Frequency Domain
by: Crabbé, Jonathan, et al.
Published: (2024)
by: Crabbé, Jonathan, et al.
Published: (2024)
The Synergy of LLMs & RL Unlocks Offline Learning of Generalizable Language-Conditioned Policies with Low-fidelity Data
by: Pouplin, Thomas, et al.
Published: (2024)
by: Pouplin, Thomas, et al.
Published: (2024)
Meta-Learners for Partially-Identified Treatment Effects Across Multiple Environments
by: Schweisthal, Jonas, et al.
Published: (2024)
by: Schweisthal, Jonas, et al.
Published: (2024)
Defining Expertise: Applications to Treatment Effect Estimation
by: Hüyük, Alihan, et al.
Published: (2024)
by: Hüyük, Alihan, et al.
Published: (2024)
TimeTok: Granularity-Controllable Time-Series Generation via Hierarchical Tokenization
by: Lee, Seokhyun, et al.
Published: (2026)
by: Lee, Seokhyun, et al.
Published: (2026)
Similar Items
-
Truly Self-Improving Agents Require Intrinsic Metacognitive Learning
by: Liu, Tennison, et al.
Published: (2025) -
Strategic Self-Improvement for Competitive Agents in AI Labour Markets
by: Chiu, Christopher, et al.
Published: (2025) -
Hypothesis Hunting with Evolving Networks of Autonomous Scientific Agents
by: Liu, Tennison, et al.
Published: (2025) -
Large Language Models to Enhance Bayesian Optimization
by: Liu, Tennison, et al.
Published: (2024) -
Active Task Disambiguation with LLMs
by: Kobalczyk, Katarzyna, et al.
Published: (2025)