Saved in:
| Main Authors: | Corrado, Nicholas E., Katz-Samuels, Julian, Devraj, Adithya, Yun, Hyokun, Zhang, Chao, Xu, Yi, Pan, Yi, Yin, Bing, Chilimbi, Trishul |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.00569 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Evolutionary Contrastive Distillation for Language Model Alignment
by: Katz-Samuels, Julian, et al.
Published: (2024)
by: Katz-Samuels, Julian, et al.
Published: (2024)
Robust Multi-Task Learning with Excess Risks
by: He, Yifei, et al.
Published: (2024)
by: He, Yifei, et al.
Published: (2024)
InfoPO: On Mutual Information Maximization for Large Language Model Alignment
by: Xiao, Teng, et al.
Published: (2025)
by: Xiao, Teng, et al.
Published: (2025)
DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models
by: Ram, Shwetha, et al.
Published: (2024)
by: Ram, Shwetha, et al.
Published: (2024)
Aligning Large Language Models with Implicit Preferences from User-Generated Content
by: Tan, Zhaoxuan, et al.
Published: (2025)
by: Tan, Zhaoxuan, et al.
Published: (2025)
Improving Sampling Efficiency in RLVR through Adaptive Rollout and Response Reuse
by: Zhang, Yuheng, et al.
Published: (2025)
by: Zhang, Yuheng, et al.
Published: (2025)
Listwise Direct Preference Optimization with Multi-Dimensional Preference Mixing
by: Sun, Yuhui, et al.
Published: (2025)
by: Sun, Yuhui, et al.
Published: (2025)
M-LLM Based Video Frame Selection for Efficient Video Understanding
by: Hu, Kai, et al.
Published: (2025)
by: Hu, Kai, et al.
Published: (2025)
Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization
by: Gu, Yi, et al.
Published: (2024)
by: Gu, Yi, et al.
Published: (2024)
Distributionally Robust Multi-Task Reinforcement Learning via Adaptive Task Sampling
by: Corrado, Nicholas E., et al.
Published: (2026)
by: Corrado, Nicholas E., et al.
Published: (2026)
CoLLM: A Large Language Model for Composed Image Retrieval
by: Huynh, Chuong, et al.
Published: (2025)
by: Huynh, Chuong, et al.
Published: (2025)
Evaluating and Aligning Human Economic Risk Preferences in LLMs
by: Liu, Jiaxin, et al.
Published: (2025)
by: Liu, Jiaxin, et al.
Published: (2025)
VAO: Validation-Aligned Optimization for Cross-Task Generative Auto-Bidding
by: Lv, Yiqin, et al.
Published: (2025)
by: Lv, Yiqin, et al.
Published: (2025)
ScaleBITS: Scalable Bitwidth Search for Hardware-Aligned Mixed-Precision LLMs
by: Li, Xinlin, et al.
Published: (2026)
by: Li, Xinlin, et al.
Published: (2026)
Open Vocabulary Multi-Label Video Classification
by: Gupta, Rohit, et al.
Published: (2024)
by: Gupta, Rohit, et al.
Published: (2024)
X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs
by: Swetha, Sirnam, et al.
Published: (2024)
by: Swetha, Sirnam, et al.
Published: (2024)
Learning to Align Human Code Preferences
by: Yin, Xin, et al.
Published: (2025)
by: Yin, Xin, et al.
Published: (2025)
AutoMix: Automatically Mixing Language Models
by: Aggarwal, Pranjal, et al.
Published: (2023)
by: Aggarwal, Pranjal, et al.
Published: (2023)
Bradley-Terry Policy Optimization for Generative Preference Modeling
by: Feng, Shengyu, et al.
Published: (2025)
by: Feng, Shengyu, et al.
Published: (2025)
Aligning CodeLLMs with Direct Preference Optimization
by: Miao, Yibo, et al.
Published: (2024)
by: Miao, Yibo, et al.
Published: (2024)
HYPO: Hyperspherical Out-of-Distribution Generalization
by: Bai, Haoyue, et al.
Published: (2024)
by: Bai, Haoyue, et al.
Published: (2024)
The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning
by: Borkar, Vivek, et al.
Published: (2021)
by: Borkar, Vivek, et al.
Published: (2021)
VidLA: Video-Language Alignment at Scale
by: Rizve, Mamshad Nayeem, et al.
Published: (2024)
by: Rizve, Mamshad Nayeem, et al.
Published: (2024)
Exploring Reasoning-Infused Text Embedding with Large Language Models for Zero-Shot Dense Retrieval
by: Liu, Yuxiang, et al.
Published: (2025)
by: Liu, Yuxiang, et al.
Published: (2025)
Matroidal Mixed Eulerian Numbers
by: Katz, Eric, et al.
Published: (2023)
by: Katz, Eric, et al.
Published: (2023)
AutoScale: Scale-Aware Data Mixing for Pre-Training LLMs
by: Kang, Feiyang, et al.
Published: (2024)
by: Kang, Feiyang, et al.
Published: (2024)
Exposing Privacy Gaps: Membership Inference Attack on Preference Data for LLM Alignment
by: Feng, Qizhang, et al.
Published: (2024)
by: Feng, Qizhang, et al.
Published: (2024)
DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents
by: Shi, Kai, et al.
Published: (2025)
by: Shi, Kai, et al.
Published: (2025)
AdaViP: Aligning Multi-modal LLMs via Adaptive Vision-enhanced Preference Optimization
by: Lu, Jinda, et al.
Published: (2025)
by: Lu, Jinda, et al.
Published: (2025)
MixRec: Individual and Collective Mixing Empowers Data Augmentation for Recommender Systems
by: Zhang, Yi, et al.
Published: (2025)
by: Zhang, Yi, et al.
Published: (2025)
Synthetic Users, Real Differences: an Evaluation Framework for User Simulation in Multi-Turn Conversations
by: Liu, Yu Lu, et al.
Published: (2026)
by: Liu, Yu Lu, et al.
Published: (2026)
LLMs are the Ideal Candidate for Mixed-Initiative Game Design Pillar Workflows
by: Geheeb, Julian, et al.
Published: (2026)
by: Geheeb, Julian, et al.
Published: (2026)
POPI: Personalizing LLMs via Optimized Natural Language Preference Inference
by: Chen, Yizhuo, et al.
Published: (2025)
by: Chen, Yizhuo, et al.
Published: (2025)
DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
by: Zhou, Zhenglin, et al.
Published: (2025)
by: Zhou, Zhenglin, et al.
Published: (2025)
Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generation of Diffusion Models
by: Wang, Fu-Yun, et al.
Published: (2025)
by: Wang, Fu-Yun, et al.
Published: (2025)
Optimizing Mixed Quantum Channels via Projected Gradient Dynamics
by: Lin, Matthew M., et al.
Published: (2025)
by: Lin, Matthew M., et al.
Published: (2025)
Less is More: Learning Graph Tasks with Just LLMs
by: Shirai, Sola, et al.
Published: (2025)
by: Shirai, Sola, et al.
Published: (2025)
PMoL: Parameter Efficient MoE for Preference Mixing of LLM Alignment
by: Liu, Dongxu, et al.
Published: (2024)
by: Liu, Dongxu, et al.
Published: (2024)
Centralized Adaptive Sampling for Reliable Co-Training of Independent Multi-Agent Policies
by: Corrado, Nicholas E., et al.
Published: (2025)
by: Corrado, Nicholas E., et al.
Published: (2025)
APreQEL: Adaptive Mixed Precision Quantization For Edge LLMs
by: Bouzouad, Meriem, et al.
Published: (2026)
by: Bouzouad, Meriem, et al.
Published: (2026)
Similar Items
-
Evolutionary Contrastive Distillation for Language Model Alignment
by: Katz-Samuels, Julian, et al.
Published: (2024) -
Robust Multi-Task Learning with Excess Risks
by: He, Yifei, et al.
Published: (2024) -
InfoPO: On Mutual Information Maximization for Large Language Model Alignment
by: Xiao, Teng, et al.
Published: (2025) -
DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models
by: Ram, Shwetha, et al.
Published: (2024) -
Aligning Large Language Models with Implicit Preferences from User-Generated Content
by: Tan, Zhaoxuan, et al.
Published: (2025)