Saved in:
| Main Authors: | Huang, Tzu-Heng, Bilkhu, Manjot, Cooper, John, Sala, Frederic, Movellan, Javier |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.06708 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning
by: Huang, Tzu-Heng, et al.
Published: (2026)
by: Huang, Tzu-Heng, et al.
Published: (2026)
MixAtlas: Uncertainty-aware Data Mixture Optimization for Multimodal LLM Midtraining
by: Wen, Bingbing, et al.
Published: (2026)
by: Wen, Bingbing, et al.
Published: (2026)
Time To Impeach LLM-as-a-Judge: Programs are the Future of Evaluation
by: Huang, Tzu-Heng, et al.
Published: (2025)
by: Huang, Tzu-Heng, et al.
Published: (2025)
The ALCHEmist: Automated Labeling 500x CHEaper Than LLM Data Annotators
by: Huang, Tzu-Heng, et al.
Published: (2024)
by: Huang, Tzu-Heng, et al.
Published: (2024)
Weak-to-Strong Generalization Through the Data-Centric Lens
by: Shin, Changho, et al.
Published: (2024)
by: Shin, Changho, et al.
Published: (2024)
R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training
by: Ge, Albert, et al.
Published: (2025)
by: Ge, Albert, et al.
Published: (2025)
MoRe Fine-Tuning with 10x Fewer Parameters
by: Tan, Wenxuan, et al.
Published: (2024)
by: Tan, Wenxuan, et al.
Published: (2024)
CARE: Confounder-Aware Aggregation for Reliable LLM Evaluation
by: Zhao, Jitian, et al.
Published: (2026)
by: Zhao, Jitian, et al.
Published: (2026)
Efficient Data Selection for Multimodal Models via Incremental Optimization Utility
by: Jing, Jinhao, et al.
Published: (2026)
by: Jing, Jinhao, et al.
Published: (2026)
Zero-Shot Robustification of Zero-Shot Models
by: Adila, Dyah, et al.
Published: (2023)
by: Adila, Dyah, et al.
Published: (2023)
Learning with Imbalanced Noisy Data by Preventing Bias in Sample Selection
by: Liu, Huafeng, et al.
Published: (2024)
by: Liu, Huafeng, et al.
Published: (2024)
COSMOS: Predictable and Cost-Effective Adaptation of LLMs
by: Wang, Jiayu, et al.
Published: (2025)
by: Wang, Jiayu, et al.
Published: (2025)
Efficient Utility-Preserving Machine Unlearning with Implicit Gradient Surgery
by: Zhou, Shiji, et al.
Published: (2025)
by: Zhou, Shiji, et al.
Published: (2025)
OTTER: Effortless Label Distribution Adaptation of Zero-shot Models
by: Shin, Changho, et al.
Published: (2024)
by: Shin, Changho, et al.
Published: (2024)
Quantifying Structure in CLIP Embeddings: A Statistical Framework for Concept Interpretation
by: Zhao, Jitian, et al.
Published: (2025)
by: Zhao, Jitian, et al.
Published: (2025)
DISCO: Diversifying Sample Condensation for Efficient Model Evaluation
by: Rubinstein, Alexander, et al.
Published: (2025)
by: Rubinstein, Alexander, et al.
Published: (2025)
Learning from Less: Measuring the Effectiveness of RLVR in Low Data and Compute Regimes
by: Bauer, Justin, et al.
Published: (2026)
by: Bauer, Justin, et al.
Published: (2026)
Promises and Pitfalls of Threshold-based Auto-labeling
by: Vishwakarma, Harit, et al.
Published: (2022)
by: Vishwakarma, Harit, et al.
Published: (2022)
Learning Multi-Indicator Weights for Data Selection: A Joint Task-Model Adaptation Framework with Efficient Proxies
by: Song, Jingze, et al.
Published: (2026)
by: Song, Jingze, et al.
Published: (2026)
On Evaluation Protocols for Data Augmentation in a Limited Data Scenario
by: Piedboeuf, Frédéric, et al.
Published: (2024)
by: Piedboeuf, Frédéric, et al.
Published: (2024)
Data-Efficient Training by Evolved Sampling
by: Cheng, Ziheng, et al.
Published: (2025)
by: Cheng, Ziheng, et al.
Published: (2025)
When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
by: Liu, Vincent, et al.
Published: (2023)
by: Liu, Vincent, et al.
Published: (2023)
SkillOrchestra: Learning to Route Agents via Skill Transfer
by: Wang, Jiayu, et al.
Published: (2026)
by: Wang, Jiayu, et al.
Published: (2026)
Max-Utility Based Arm Selection Strategy For Sequential Query Recommendations
by: Parambath, Shameem A. Puthiya, et al.
Published: (2021)
by: Parambath, Shameem A. Puthiya, et al.
Published: (2021)
On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond
by: Nguyen-Tang, Thanh, et al.
Published: (2024)
by: Nguyen-Tang, Thanh, et al.
Published: (2024)
Sampling and Loss Weights in Multi-Domain Training
by: Salmani, Mahdi, et al.
Published: (2025)
by: Salmani, Mahdi, et al.
Published: (2025)
DaMoC: Efficiently Selecting the Optimal Large Language Model for Fine-tuning Domain Tasks Based on Data and Model Compression
by: Huang, Wei, et al.
Published: (2025)
by: Huang, Wei, et al.
Published: (2025)
Model-Agnostic Dynamic Feature Selection with Uncertainty Quantification
by: Fumanal-Idocin, Javier, et al.
Published: (2025)
by: Fumanal-Idocin, Javier, et al.
Published: (2025)
Joint Optimization of Resource Allocation and Data Selection for Fast and Cost-Efficient Federated Edge Learning
by: Jia, Yunjian, et al.
Published: (2024)
by: Jia, Yunjian, et al.
Published: (2024)
UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning
by: Patil, Vaidehi, et al.
Published: (2025)
by: Patil, Vaidehi, et al.
Published: (2025)
Know When to Abstain: Optimal Selective Classification with Likelihood Ratios
by: Heng, Alvin, et al.
Published: (2025)
by: Heng, Alvin, et al.
Published: (2025)
Evaluating Causal Discovery Algorithms for Path-Specific Fairness and Utility in Healthcare
by: Nagesh, Nitish, et al.
Published: (2026)
by: Nagesh, Nitish, et al.
Published: (2026)
Fair Class-Incremental Learning using Sample Weighting
by: Park, Jaeyoung, et al.
Published: (2024)
by: Park, Jaeyoung, et al.
Published: (2024)
Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol
by: Liu, Pai, et al.
Published: (2025)
by: Liu, Pai, et al.
Published: (2025)
COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated Sample Weighting Mechanism
by: He, Jianing, et al.
Published: (2024)
by: He, Jianing, et al.
Published: (2024)
Gradient Weight-normalized Low-rank Projection for Efficient LLM Training
by: Huang, Jia-Hong, et al.
Published: (2024)
by: Huang, Jia-Hong, et al.
Published: (2024)
Preference Models assume Proportional Hazards of Utilities
by: Nagpal, Chirag
Published: (2025)
by: Nagpal, Chirag
Published: (2025)
Filter-then-Weight: Online Data Selection and Reweighting for LLM Fine-Tuning
by: Wang, Fangxin, et al.
Published: (2026)
by: Wang, Fangxin, et al.
Published: (2026)
An Interpretable and Scalable Framework for Evaluating Large Language Models
by: Qu, Xinhao, et al.
Published: (2026)
by: Qu, Xinhao, et al.
Published: (2026)
Training Data Selection with Gradient Orthogonality for Efficient Domain Adaptation
by: Zhang, Xiyang, et al.
Published: (2026)
by: Zhang, Xiyang, et al.
Published: (2026)
Similar Items
-
RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning
by: Huang, Tzu-Heng, et al.
Published: (2026) -
MixAtlas: Uncertainty-aware Data Mixture Optimization for Multimodal LLM Midtraining
by: Wen, Bingbing, et al.
Published: (2026) -
Time To Impeach LLM-as-a-Judge: Programs are the Future of Evaluation
by: Huang, Tzu-Heng, et al.
Published: (2025) -
The ALCHEmist: Automated Labeling 500x CHEaper Than LLM Data Annotators
by: Huang, Tzu-Heng, et al.
Published: (2024) -
Weak-to-Strong Generalization Through the Data-Centric Lens
by: Shin, Changho, et al.
Published: (2024)