:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Huang, Tzu-Heng, Bilkhu, Manjot, Cooper, John, Sala, Frederic, Movellan, Javier
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2501.06708
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning
by: Huang, Tzu-Heng, et al.
Published: (2026)

MixAtlas: Uncertainty-aware Data Mixture Optimization for Multimodal LLM Midtraining
by: Wen, Bingbing, et al.
Published: (2026)

Time To Impeach LLM-as-a-Judge: Programs are the Future of Evaluation
by: Huang, Tzu-Heng, et al.
Published: (2025)

The ALCHEmist: Automated Labeling 500x CHEaper Than LLM Data Annotators
by: Huang, Tzu-Heng, et al.
Published: (2024)

Weak-to-Strong Generalization Through the Data-Centric Lens
by: Shin, Changho, et al.
Published: (2024)

R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training
by: Ge, Albert, et al.
Published: (2025)

MoRe Fine-Tuning with 10x Fewer Parameters
by: Tan, Wenxuan, et al.
Published: (2024)

CARE: Confounder-Aware Aggregation for Reliable LLM Evaluation
by: Zhao, Jitian, et al.
Published: (2026)

Efficient Data Selection for Multimodal Models via Incremental Optimization Utility
by: Jing, Jinhao, et al.
Published: (2026)

Zero-Shot Robustification of Zero-Shot Models
by: Adila, Dyah, et al.
Published: (2023)

Learning with Imbalanced Noisy Data by Preventing Bias in Sample Selection
by: Liu, Huafeng, et al.
Published: (2024)

COSMOS: Predictable and Cost-Effective Adaptation of LLMs
by: Wang, Jiayu, et al.
Published: (2025)

Efficient Utility-Preserving Machine Unlearning with Implicit Gradient Surgery
by: Zhou, Shiji, et al.
Published: (2025)

OTTER: Effortless Label Distribution Adaptation of Zero-shot Models
by: Shin, Changho, et al.
Published: (2024)

Quantifying Structure in CLIP Embeddings: A Statistical Framework for Concept Interpretation
by: Zhao, Jitian, et al.
Published: (2025)

DISCO: Diversifying Sample Condensation for Efficient Model Evaluation
by: Rubinstein, Alexander, et al.
Published: (2025)

Learning from Less: Measuring the Effectiveness of RLVR in Low Data and Compute Regimes
by: Bauer, Justin, et al.
Published: (2026)

Promises and Pitfalls of Threshold-based Auto-labeling
by: Vishwakarma, Harit, et al.
Published: (2022)

Learning Multi-Indicator Weights for Data Selection: A Joint Task-Model Adaptation Framework with Efficient Proxies
by: Song, Jingze, et al.
Published: (2026)

On Evaluation Protocols for Data Augmentation in a Limited Data Scenario
by: Piedboeuf, Frédéric, et al.
Published: (2024)

Data-Efficient Training by Evolved Sampling
by: Cheng, Ziheng, et al.
Published: (2025)

When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
by: Liu, Vincent, et al.
Published: (2023)

SkillOrchestra: Learning to Route Agents via Skill Transfer
by: Wang, Jiayu, et al.
Published: (2026)

Max-Utility Based Arm Selection Strategy For Sequential Query Recommendations
by: Parambath, Shameem A. Puthiya, et al.
Published: (2021)

On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond
by: Nguyen-Tang, Thanh, et al.
Published: (2024)

Sampling and Loss Weights in Multi-Domain Training
by: Salmani, Mahdi, et al.
Published: (2025)

DaMoC: Efficiently Selecting the Optimal Large Language Model for Fine-tuning Domain Tasks Based on Data and Model Compression
by: Huang, Wei, et al.
Published: (2025)

Model-Agnostic Dynamic Feature Selection with Uncertainty Quantification
by: Fumanal-Idocin, Javier, et al.
Published: (2025)

Joint Optimization of Resource Allocation and Data Selection for Fast and Cost-Efficient Federated Edge Learning
by: Jia, Yunjian, et al.
Published: (2024)

UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning
by: Patil, Vaidehi, et al.
Published: (2025)

Know When to Abstain: Optimal Selective Classification with Likelihood Ratios
by: Heng, Alvin, et al.
Published: (2025)

Evaluating Causal Discovery Algorithms for Path-Specific Fairness and Utility in Healthcare
by: Nagesh, Nitish, et al.
Published: (2026)

Fair Class-Incremental Learning using Sample Weighting
by: Park, Jaeyoung, et al.
Published: (2024)

Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol
by: Liu, Pai, et al.
Published: (2025)

COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated Sample Weighting Mechanism
by: He, Jianing, et al.
Published: (2024)

Gradient Weight-normalized Low-rank Projection for Efficient LLM Training
by: Huang, Jia-Hong, et al.
Published: (2024)

Preference Models assume Proportional Hazards of Utilities
by: Nagpal, Chirag
Published: (2025)

Filter-then-Weight: Online Data Selection and Reweighting for LLM Fine-Tuning
by: Wang, Fangxin, et al.
Published: (2026)

An Interpretable and Scalable Framework for Evaluating Large Language Models
by: Qu, Xinhao, et al.
Published: (2026)

Training Data Selection with Gradient Orthogonality for Efficient Domain Adaptation
by: Zhang, Xiyang, et al.
Published: (2026)