Saved in:
| Main Authors: | Saveliev, Evgeny S., Holt, Samuel, Seedat, Nabeel, Bentley, David L., Weatherall, Jim, van der Schaar, Mihaela |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.29184 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Human-Guided, Data-Centric LLM Co-Pilots
by: Saveliev, Evgeny, et al.
Published: (2025)
by: Saveliev, Evgeny, et al.
Published: (2025)
Matchmaker: Self-Improving Large Language Model Programs for Schema Matching
by: Seedat, Nabeel, et al.
Published: (2024)
by: Seedat, Nabeel, et al.
Published: (2024)
Deep Generative Symbolic Regression
by: Holt, Samuel, et al.
Published: (2023)
by: Holt, Samuel, et al.
Published: (2023)
Relaxed Quantile Regression: Prediction Intervals for Asymmetric Noise
by: Pouplin, Thomas, et al.
Published: (2024)
by: Pouplin, Thomas, et al.
Published: (2024)
Dissecting Sample Hardness: A Fine-Grained Analysis of Hardness Characterization Methods for Data-Centric AI
by: Seedat, Nabeel, et al.
Published: (2024)
by: Seedat, Nabeel, et al.
Published: (2024)
DC-Check: A Data-Centric AI checklist to guide the development of reliable machine learning systems
by: Seedat, Nabeel, et al.
Published: (2022)
by: Seedat, Nabeel, et al.
Published: (2022)
What's the next frontier for Data-centric AI? Data Savvy Agents
by: Seedat, Nabeel, et al.
Published: (2025)
by: Seedat, Nabeel, et al.
Published: (2025)
Curated LLM: Synergy of LLMs and Data Curation for tabular augmentation in low-data regimes
by: Seedat, Nabeel, et al.
Published: (2023)
by: Seedat, Nabeel, et al.
Published: (2023)
Unlocking Historical Clinical Trial Data with ALIGN: A Compositional Large Language Model System for Medical Coding
by: Seedat, Nabeel, et al.
Published: (2024)
by: Seedat, Nabeel, et al.
Published: (2024)
Knowledge-Informed Kernel State Reconstruction from Heterogeneous Partial Observations
by: Muscarnera, Luca, et al.
Published: (2026)
by: Muscarnera, Luca, et al.
Published: (2026)
You can't handle the (dirty) truth: Data-centric insights improve pseudo-labeling
by: Seedat, Nabeel, et al.
Published: (2024)
by: Seedat, Nabeel, et al.
Published: (2024)
Large Language Models to Enhance Bayesian Optimization
by: Liu, Tennison, et al.
Published: (2024)
by: Liu, Tennison, et al.
Published: (2024)
Self-Healing Machine Learning: A Framework for Autonomous Adaptation in Real-World Environments
by: Rauba, Paulius, et al.
Published: (2024)
by: Rauba, Paulius, et al.
Published: (2024)
Shape Arithmetic Expressions: Advancing Scientific Discovery Beyond Closed-Form Equations
by: Kacprzyk, Krzysztof, et al.
Published: (2024)
by: Kacprzyk, Krzysztof, et al.
Published: (2024)
Context-Aware Testing: A New Paradigm for Model Testing with Large Language Models
by: Rauba, Paulius, et al.
Published: (2024)
by: Rauba, Paulius, et al.
Published: (2024)
Improving LLM Agent Planning with In-Context Learning via Atomic Fact Augmentation and Lookahead Search
by: Holt, Samuel, et al.
Published: (2025)
by: Holt, Samuel, et al.
Published: (2025)
When is Off-Policy Evaluation (Reward Modeling) Useful in Contextual Bandits? A Data-Centric Perspective
by: Sun, Hao, et al.
Published: (2023)
by: Sun, Hao, et al.
Published: (2023)
Technical Report: Facilitating the Adoption of Causal Inference Methods Through LLM-Empowered Co-Pilot
by: Berrevoets, Jeroen, et al.
Published: (2025)
by: Berrevoets, Jeroen, et al.
Published: (2025)
Dense Reward for Free in Reinforcement Learning from Human Feedback
by: Chan, Alex J., et al.
Published: (2024)
by: Chan, Alex J., et al.
Published: (2024)
DAGnosis: Localized Identification of Data Inconsistencies using Structures
by: Huynh, Nicolas, et al.
Published: (2024)
by: Huynh, Nicolas, et al.
Published: (2024)
Automatically Learning Hybrid Digital Twins of Dynamical Systems
by: Holt, Samuel, et al.
Published: (2024)
by: Holt, Samuel, et al.
Published: (2024)
CliMB: An AI-enabled Partner for Clinical Predictive Modeling
by: Saveliev, Evgeny, et al.
Published: (2024)
by: Saveliev, Evgeny, et al.
Published: (2024)
ODE Discovery for Longitudinal Heterogeneous Treatment Effects Inference
by: Kacprzyk, Krzysztof, et al.
Published: (2024)
by: Kacprzyk, Krzysztof, et al.
Published: (2024)
Hypothesis Hunting with Evolving Networks of Autonomous Scientific Agents
by: Liu, Tennison, et al.
Published: (2025)
by: Liu, Tennison, et al.
Published: (2025)
L2MAC: Large Language Model Automatic Computer for Extensive Code Generation
by: Holt, Samuel, et al.
Published: (2023)
by: Holt, Samuel, et al.
Published: (2023)
Discovery of Hidden Miscalibration Regimes
by: Kobalczyk, Katarzyna, et al.
Published: (2026)
by: Kobalczyk, Katarzyna, et al.
Published: (2026)
Guideline-Grounded Evidence Accumulation for High-Stakes Agent Verification
by: Zhang, Yichi, et al.
Published: (2026)
by: Zhang, Yichi, et al.
Published: (2026)
Retrieval Augmented Thought Process for Private Data Handling in Healthcare
by: Pouplin, Thomas, et al.
Published: (2024)
by: Pouplin, Thomas, et al.
Published: (2024)
No Equations Needed: Learning System Dynamics Without Relying on Closed-Form ODEs
by: Kacprzyk, Krzysztof, et al.
Published: (2025)
by: Kacprzyk, Krzysztof, et al.
Published: (2025)
G-Sim: Generative Simulations with Large Language Models and Gradient-Free Calibration
by: Holt, Samuel, et al.
Published: (2025)
by: Holt, Samuel, et al.
Published: (2025)
The AI Imperative: Scaling High-Quality Peer Review in Machine Learning
by: Wei, Qiyao, et al.
Published: (2025)
by: Wei, Qiyao, et al.
Published: (2025)
The Reasoning-Creativity Trade-off: Toward Creativity-Driven Problem Solving
by: Luyten, Max Ruiz, et al.
Published: (2026)
by: Luyten, Max Ruiz, et al.
Published: (2026)
Prior-Guided Symbolic Regression: Towards Scientific Consistency in Equation Discovery
by: Xiao, Jing, et al.
Published: (2026)
by: Xiao, Jing, et al.
Published: (2026)
Using Machine Learning to Individualize Treatment Effect Estimation: Challenges and Opportunities
by: Alicia Curth, et al.
Published: (2024)
by: Alicia Curth, et al.
Published: (2024)
A Decision-Theoretic Formalisation of Steganography With Applications to LLM Monitoring
by: Anwar, Usman, et al.
Published: (2026)
by: Anwar, Usman, et al.
Published: (2026)
Interpretable DNA Sequence Classification via Dynamic Feature Generation in Decision Trees
by: Huynh, Nicolas, et al.
Published: (2026)
by: Huynh, Nicolas, et al.
Published: (2026)
TimeTok: Granularity-Controllable Time-Series Generation via Hierarchical Tokenization
by: Lee, Seokhyun, et al.
Published: (2026)
by: Lee, Seokhyun, et al.
Published: (2026)
Not All Explanations for Deep Learning Phenomena Are Equally Valuable
by: Jeffares, Alan, et al.
Published: (2025)
by: Jeffares, Alan, et al.
Published: (2025)
Language Bottleneck Models for Qualitative Knowledge State Modeling
by: Berthon, Antonin, et al.
Published: (2025)
by: Berthon, Antonin, et al.
Published: (2025)
Preference Learning for AI Alignment: a Causal Perspective
by: Kobalczyk, Katarzyna, et al.
Published: (2025)
by: Kobalczyk, Katarzyna, et al.
Published: (2025)
Similar Items
-
Towards Human-Guided, Data-Centric LLM Co-Pilots
by: Saveliev, Evgeny, et al.
Published: (2025) -
Matchmaker: Self-Improving Large Language Model Programs for Schema Matching
by: Seedat, Nabeel, et al.
Published: (2024) -
Deep Generative Symbolic Regression
by: Holt, Samuel, et al.
Published: (2023) -
Relaxed Quantile Regression: Prediction Intervals for Asymmetric Noise
by: Pouplin, Thomas, et al.
Published: (2024) -
Dissecting Sample Hardness: A Fine-Grained Analysis of Hardness Characterization Methods for Data-Centric AI
by: Seedat, Nabeel, et al.
Published: (2024)