:: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Saveliev, Evgeny S., Holt, Samuel, Seedat, Nabeel, Bentley, David L., Weatherall, Jim, van der Schaar, Mihaela
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.29184
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Towards Human-Guided, Data-Centric LLM Co-Pilots
by: Saveliev, Evgeny, et al.
Published: (2025)

Matchmaker: Self-Improving Large Language Model Programs for Schema Matching
by: Seedat, Nabeel, et al.
Published: (2024)

Deep Generative Symbolic Regression
by: Holt, Samuel, et al.
Published: (2023)

Relaxed Quantile Regression: Prediction Intervals for Asymmetric Noise
by: Pouplin, Thomas, et al.
Published: (2024)

Dissecting Sample Hardness: A Fine-Grained Analysis of Hardness Characterization Methods for Data-Centric AI
by: Seedat, Nabeel, et al.
Published: (2024)

DC-Check: A Data-Centric AI checklist to guide the development of reliable machine learning systems
by: Seedat, Nabeel, et al.
Published: (2022)

What's the next frontier for Data-centric AI? Data Savvy Agents
by: Seedat, Nabeel, et al.
Published: (2025)

Curated LLM: Synergy of LLMs and Data Curation for tabular augmentation in low-data regimes
by: Seedat, Nabeel, et al.
Published: (2023)

Unlocking Historical Clinical Trial Data with ALIGN: A Compositional Large Language Model System for Medical Coding
by: Seedat, Nabeel, et al.
Published: (2024)

Knowledge-Informed Kernel State Reconstruction from Heterogeneous Partial Observations
by: Muscarnera, Luca, et al.
Published: (2026)

You can't handle the (dirty) truth: Data-centric insights improve pseudo-labeling
by: Seedat, Nabeel, et al.
Published: (2024)

Large Language Models to Enhance Bayesian Optimization
by: Liu, Tennison, et al.
Published: (2024)

Self-Healing Machine Learning: A Framework for Autonomous Adaptation in Real-World Environments
by: Rauba, Paulius, et al.
Published: (2024)

Shape Arithmetic Expressions: Advancing Scientific Discovery Beyond Closed-Form Equations
by: Kacprzyk, Krzysztof, et al.
Published: (2024)

Context-Aware Testing: A New Paradigm for Model Testing with Large Language Models
by: Rauba, Paulius, et al.
Published: (2024)

Improving LLM Agent Planning with In-Context Learning via Atomic Fact Augmentation and Lookahead Search
by: Holt, Samuel, et al.
Published: (2025)

When is Off-Policy Evaluation (Reward Modeling) Useful in Contextual Bandits? A Data-Centric Perspective
by: Sun, Hao, et al.
Published: (2023)

Technical Report: Facilitating the Adoption of Causal Inference Methods Through LLM-Empowered Co-Pilot
by: Berrevoets, Jeroen, et al.
Published: (2025)

Dense Reward for Free in Reinforcement Learning from Human Feedback
by: Chan, Alex J., et al.
Published: (2024)

DAGnosis: Localized Identification of Data Inconsistencies using Structures
by: Huynh, Nicolas, et al.
Published: (2024)

Automatically Learning Hybrid Digital Twins of Dynamical Systems
by: Holt, Samuel, et al.
Published: (2024)

CliMB: An AI-enabled Partner for Clinical Predictive Modeling
by: Saveliev, Evgeny, et al.
Published: (2024)

ODE Discovery for Longitudinal Heterogeneous Treatment Effects Inference
by: Kacprzyk, Krzysztof, et al.
Published: (2024)

Hypothesis Hunting with Evolving Networks of Autonomous Scientific Agents
by: Liu, Tennison, et al.
Published: (2025)

L2MAC: Large Language Model Automatic Computer for Extensive Code Generation
by: Holt, Samuel, et al.
Published: (2023)

Discovery of Hidden Miscalibration Regimes
by: Kobalczyk, Katarzyna, et al.
Published: (2026)

Guideline-Grounded Evidence Accumulation for High-Stakes Agent Verification
by: Zhang, Yichi, et al.
Published: (2026)

Retrieval Augmented Thought Process for Private Data Handling in Healthcare
by: Pouplin, Thomas, et al.
Published: (2024)

No Equations Needed: Learning System Dynamics Without Relying on Closed-Form ODEs
by: Kacprzyk, Krzysztof, et al.
Published: (2025)

G-Sim: Generative Simulations with Large Language Models and Gradient-Free Calibration
by: Holt, Samuel, et al.
Published: (2025)

The AI Imperative: Scaling High-Quality Peer Review in Machine Learning
by: Wei, Qiyao, et al.
Published: (2025)

The Reasoning-Creativity Trade-off: Toward Creativity-Driven Problem Solving
by: Luyten, Max Ruiz, et al.
Published: (2026)

Prior-Guided Symbolic Regression: Towards Scientific Consistency in Equation Discovery
by: Xiao, Jing, et al.
Published: (2026)

Using Machine Learning to Individualize Treatment Effect Estimation: Challenges and Opportunities
by: Alicia Curth, et al.
Published: (2024)

A Decision-Theoretic Formalisation of Steganography With Applications to LLM Monitoring
by: Anwar, Usman, et al.
Published: (2026)

Interpretable DNA Sequence Classification via Dynamic Feature Generation in Decision Trees
by: Huynh, Nicolas, et al.
Published: (2026)

TimeTok: Granularity-Controllable Time-Series Generation via Hierarchical Tokenization
by: Lee, Seokhyun, et al.
Published: (2026)

Not All Explanations for Deep Learning Phenomena Are Equally Valuable
by: Jeffares, Alan, et al.
Published: (2025)

Language Bottleneck Models for Qualitative Knowledge State Modeling
by: Berthon, Antonin, et al.
Published: (2025)

Preference Learning for AI Alignment: a Causal Perspective
by: Kobalczyk, Katarzyna, et al.
Published: (2025)