Saved in:
| Main Authors: | He, Yifei, Zhou, Shiji, Zhang, Guojun, Yun, Hyokun, Xu, Yi, Zeng, Belinda, Chilimbi, Trishul, Zhao, Han |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.02009 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AutoMixAlign: Adaptive Data Mixing for Multi-Task Preference Optimization in LLMs
by: Corrado, Nicholas E., et al.
Published: (2025)
by: Corrado, Nicholas E., et al.
Published: (2025)
Evolutionary Contrastive Distillation for Language Model Alignment
by: Katz-Samuels, Julian, et al.
Published: (2024)
by: Katz-Samuels, Julian, et al.
Published: (2024)
VidLA: Video-Language Alignment at Scale
by: Rizve, Mamshad Nayeem, et al.
Published: (2024)
by: Rizve, Mamshad Nayeem, et al.
Published: (2024)
DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models
by: Ram, Shwetha, et al.
Published: (2024)
by: Ram, Shwetha, et al.
Published: (2024)
X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs
by: Swetha, Sirnam, et al.
Published: (2024)
by: Swetha, Sirnam, et al.
Published: (2024)
InfoPO: On Mutual Information Maximization for Large Language Model Alignment
by: Xiao, Teng, et al.
Published: (2025)
by: Xiao, Teng, et al.
Published: (2025)
Mitigating Accuracy-Robustness Trade-off via Balanced Multi-Teacher Adversarial Distillation
by: Zhao, Shiji, et al.
Published: (2023)
by: Zhao, Shiji, et al.
Published: (2023)
Tensorized Clustered LoRA Merging for Multi-Task Interference
by: Su, Zhan, et al.
Published: (2025)
by: Su, Zhan, et al.
Published: (2025)
Task Vector Bases: A Unified and Scalable Framework for Compressed Task Arithmetic
by: Zeng, Siqi, et al.
Published: (2025)
by: Zeng, Siqi, et al.
Published: (2025)
Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective
by: Zhang, Zhi, et al.
Published: (2024)
by: Zhang, Zhi, et al.
Published: (2024)
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning
by: Wei, Zhepei, et al.
Published: (2025)
by: Wei, Zhepei, et al.
Published: (2025)
Lightweight and Robust Federated Data Valuation
by: Tang, Guojun, et al.
Published: (2025)
by: Tang, Guojun, et al.
Published: (2025)
Improving Sampling Efficiency in RLVR through Adaptive Rollout and Response Reuse
by: Zhang, Yuheng, et al.
Published: (2025)
by: Zhang, Yuheng, et al.
Published: (2025)
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
by: He, Yifei, et al.
Published: (2024)
by: He, Yifei, et al.
Published: (2024)
Robust Fast Adaptation from Adversarially Explicit Task Distribution Generation
by: Wang, Cheems, et al.
Published: (2024)
by: Wang, Cheems, et al.
Published: (2024)
MergeBench: A Benchmark for Merging Domain-Specialized LLMs
by: He, Yifei, et al.
Published: (2025)
by: He, Yifei, et al.
Published: (2025)
Moment Alignment: Unifying Gradient and Hessian Matching for Domain Generalization
by: Chen, Yuen, et al.
Published: (2025)
by: Chen, Yuen, et al.
Published: (2025)
Efficient Stochastic Approximation of Minimax Excess Risk Optimization
by: Zhang, Lijun, et al.
Published: (2023)
by: Zhang, Lijun, et al.
Published: (2023)
Zeroth-Order Stochastic Mirror Descent Algorithms for Minimax Excess Risk Optimization
by: Gu, Zhihao, et al.
Published: (2024)
by: Gu, Zhihao, et al.
Published: (2024)
Efficient Utility-Preserving Machine Unlearning with Implicit Gradient Surgery
by: Zhou, Shiji, et al.
Published: (2025)
by: Zhou, Shiji, et al.
Published: (2025)
Robust Bayesian Dynamic Programming for On-policy Risk-sensitive Reinforcement Learning
by: Han, Shanyu, et al.
Published: (2025)
by: Han, Shanyu, et al.
Published: (2025)
Ask a Strong LLM Judge when Your Reward Model is Uncertain
by: Xu, Zhenghao, et al.
Published: (2025)
by: Xu, Zhenghao, et al.
Published: (2025)
Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback
by: Xiong, Guojun, et al.
Published: (2024)
by: Xiong, Guojun, et al.
Published: (2024)
MoSLD: An Extremely Parameter-Efficient Mixture-of-Shared LoRAs for Multi-Task Learning
by: Zhao, Lulu, et al.
Published: (2024)
by: Zhao, Lulu, et al.
Published: (2024)
Multi-Task Learning with Feature-Similarity Laplacian Graphs for Predicting Alzheimer's Disease Progression
by: Xu, Zixiang, et al.
Published: (2025)
by: Xu, Zixiang, et al.
Published: (2025)
Improving Adversarial Robust Fairness via Anti-Bias Soft Label Distillation
by: Zhao, Shiji, et al.
Published: (2023)
by: Zhao, Shiji, et al.
Published: (2023)
RealWebAssist: A Benchmark for Long-Horizon Web Assistance with Real-World Users
by: Ye, Suyu, et al.
Published: (2025)
by: Ye, Suyu, et al.
Published: (2025)
Distributionally Robust Multi-Task Reinforcement Learning via Adaptive Task Sampling
by: Corrado, Nicholas E., et al.
Published: (2026)
by: Corrado, Nicholas E., et al.
Published: (2026)
Delve into the Applicability of Advanced Optimizers for Multi-Task Learning
by: Zhou, Zhipeng, et al.
Published: (2026)
by: Zhou, Zhipeng, et al.
Published: (2026)
Injecting Imbalance Sensitivity for Multi-Task Learning
by: Zhou, Zhipeng, et al.
Published: (2025)
by: Zhou, Zhipeng, et al.
Published: (2025)
Minimax Excess Risk of First-Order Methods for Statistical Learning with Data-Dependent Oracles
by: Scaman, Kevin, et al.
Published: (2023)
by: Scaman, Kevin, et al.
Published: (2023)
Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation
by: Kim, Sungnyun, et al.
Published: (2025)
by: Kim, Sungnyun, et al.
Published: (2025)
Sharper Risk Bound for Multi-Task Learning with Multi-Graph Dependent Data
by: Shao, Xiao, et al.
Published: (2025)
by: Shao, Xiao, et al.
Published: (2025)
Gradual Domain Adaptation: Theory and Algorithms
by: He, Yifei, et al.
Published: (2023)
by: He, Yifei, et al.
Published: (2023)
Optimal Excess Risk Bounds for Empirical Risk Minimization on $p$-Norm Linear Regression
by: Hanchi, Ayoub El, et al.
Published: (2023)
by: Hanchi, Ayoub El, et al.
Published: (2023)
Exploring Correlations of Self-Supervised Tasks for Graphs
by: Fang, Taoran, et al.
Published: (2024)
by: Fang, Taoran, et al.
Published: (2024)
Better Representations via Adversarial Training in Pre-Training: A Theoretical Perspective
by: Xing, Yue, et al.
Published: (2024)
by: Xing, Yue, et al.
Published: (2024)
SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks
by: Zhou, Yifei, et al.
Published: (2025)
by: Zhou, Yifei, et al.
Published: (2025)
A Reinforcement-Learning-Based Multiple-Column Selection Strategy for Column Generation
by: Yuan, Haofeng, et al.
Published: (2023)
by: Yuan, Haofeng, et al.
Published: (2023)
Bounding the Excess Risk for Linear Models Trained on Marginal-Preserving, Differentially-Private, Synthetic Data
by: Zhou, Yvonne, et al.
Published: (2024)
by: Zhou, Yvonne, et al.
Published: (2024)
Similar Items
-
AutoMixAlign: Adaptive Data Mixing for Multi-Task Preference Optimization in LLMs
by: Corrado, Nicholas E., et al.
Published: (2025) -
Evolutionary Contrastive Distillation for Language Model Alignment
by: Katz-Samuels, Julian, et al.
Published: (2024) -
VidLA: Video-Language Alignment at Scale
by: Rizve, Mamshad Nayeem, et al.
Published: (2024) -
DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models
by: Ram, Shwetha, et al.
Published: (2024) -
X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs
by: Swetha, Sirnam, et al.
Published: (2024)