Saved in:
| Main Authors: | Falahati, Ali, Amiri, Mohammad Mohammadi, Larson, Kate, Golab, Lukasz |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.07724 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Alignment Game: A Theory of Long-Horizon Alignment Through Recursive Curation
by: Falahati, Ali, et al.
Published: (2025)
by: Falahati, Ali, et al.
Published: (2025)
Disentangled Structural and Featural Representation for Task-Agnostic Graph Valuation
by: Falahati, Ali, et al.
Published: (2024)
by: Falahati, Ali, et al.
Published: (2024)
A Median Perspective on Unlabeled Data for Out-of-Distribution Detection
by: Abbas, Momin, et al.
Published: (2025)
by: Abbas, Momin, et al.
Published: (2025)
SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching
by: Zhu, Yuxuan, et al.
Published: (2025)
by: Zhu, Yuxuan, et al.
Published: (2025)
Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy and Research
by: Cooper, A. Feder, et al.
Published: (2024)
by: Cooper, A. Feder, et al.
Published: (2024)
Infinite Width Models That Work: Why Feature Learning Doesn't Matter as Much as You Think
by: Sernau, Luke
Published: (2024)
by: Sernau, Luke
Published: (2024)
One LR Doesn't Fit All: Heavy-Tail Guided Layerwise Learning Rates for LLMs
by: He, Di, et al.
Published: (2026)
by: He, Di, et al.
Published: (2026)
Toward Efficient Influence Function: Dropout as a Compression Tool
by: Zhang, Yuchen, et al.
Published: (2025)
by: Zhang, Yuchen, et al.
Published: (2025)
Optimal Singular Damage: Efficient LLM Inference in Low Storage Regimes
by: Alipour, Mohammadsajad, et al.
Published: (2025)
by: Alipour, Mohammadsajad, et al.
Published: (2025)
MedCalc-Bench Doesn't Measure What You Think: A Benchmark Audit and the Case for Open-Book Evaluation
by: Krohn-Grimberghe, Artus
Published: (2026)
by: Krohn-Grimberghe, Artus
Published: (2026)
GASTON: Graph-Aware Social Transformer for Online Networks
by: Wloch, Olha, et al.
Published: (2026)
by: Wloch, Olha, et al.
Published: (2026)
OFMU: Optimization-Driven Framework for Machine Unlearning
by: Asif, Sadia, et al.
Published: (2025)
by: Asif, Sadia, et al.
Published: (2025)
Towards Reversible Model Merging For Low-rank Weights
by: Alipour, Mohammadsajad, et al.
Published: (2025)
by: Alipour, Mohammadsajad, et al.
Published: (2025)
DriftXpress: Faster Drifting Models via Projected RKHS Fields
by: Falahati, Ali, et al.
Published: (2026)
by: Falahati, Ali, et al.
Published: (2026)
MixDPO: Modeling Preference Strength for Pluralistic Alignment
by: Imai, Saki, et al.
Published: (2026)
by: Imai, Saki, et al.
Published: (2026)
Revisiting Intermediate-Layer Matching in Knowledge Distillation: Layer-Selection Strategy Doesn't Matter (Much)
by: Yu, Zony, et al.
Published: (2025)
by: Yu, Zony, et al.
Published: (2025)
APPA: Adaptive Preference Pluralistic Alignment for Fair Federated RLHF of LLMs
by: Srewa, Mahmoud, et al.
Published: (2026)
by: Srewa, Mahmoud, et al.
Published: (2026)
Multi-modal Synthetic Data Training and Model Collapse: Insights from VLMs and Diffusion Models
by: Hu, Zizhao, et al.
Published: (2025)
by: Hu, Zizhao, et al.
Published: (2025)
Bayesian Elicitation with LLMs: Model Size Helps, Extra "Reasoning" Doesn't Always
by: Hobor, Luka, et al.
Published: (2026)
by: Hobor, Luka, et al.
Published: (2026)
ChatGPT Doesn't Trust Chargers Fans: Guardrail Sensitivity in Context
by: Li, Victoria R., et al.
Published: (2024)
by: Li, Victoria R., et al.
Published: (2024)
RefusalGuard: Geometry-Preserving Fine-Tuning for Safety in LLMs
by: Asif, Sadia, et al.
Published: (2026)
by: Asif, Sadia, et al.
Published: (2026)
Self-Training Doesn't Flatten Language -- It Restructures It: Surface Markers Amplify While Deep Syntax Dies
by: Liu, Ming
Published: (2026)
by: Liu, Ming
Published: (2026)
Recurrent Off-Policy Deep Reinforcement Learning Doesn't Have to be Slow
by: Clark, Tyler, et al.
Published: (2025)
by: Clark, Tyler, et al.
Published: (2025)
Explaining Expert Search and Team Formation Systems with ExES
by: Golzadeh, Kiarash, et al.
Published: (2024)
by: Golzadeh, Kiarash, et al.
Published: (2024)
Collapse or Thrive? Perils and Promises of Synthetic Data in a Self-Generating World
by: Kazdan, Joshua, et al.
Published: (2024)
by: Kazdan, Joshua, et al.
Published: (2024)
Reflective Verbal Reward Design for Pluralistic Alignment
by: Blair, Carter, et al.
Published: (2025)
by: Blair, Carter, et al.
Published: (2025)
Order Doesn't Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation
by: He, Qianxi, et al.
Published: (2025)
by: He, Qianxi, et al.
Published: (2025)
Adaptive Alignment: Dynamic Preference Adjustments via Multi-Objective Reinforcement Learning for Pluralistic AI
by: Harland, Hadassah, et al.
Published: (2024)
by: Harland, Hadassah, et al.
Published: (2024)
Power to the Clients: Federated Learning in a Dictatorship Setting
by: Alipour, Mohammadsajad, et al.
Published: (2025)
by: Alipour, Mohammadsajad, et al.
Published: (2025)
Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn't
by: Taguchi, Chihiro, et al.
Published: (2024)
by: Taguchi, Chihiro, et al.
Published: (2024)
Training-Free Geospatial Place Representation Learning from Large-Scale Point-of-Interest Graph Data
by: Hashemi, Mohammad, et al.
Published: (2025)
by: Hashemi, Mohammad, et al.
Published: (2025)
Four Things People Should Know About Migraines
by: Parsa, Mohammad S., et al.
Published: (2025)
by: Parsa, Mohammad S., et al.
Published: (2025)
REPA Works Until It Doesn't: Early-Stopped, Holistic Alignment Supercharges Diffusion Training
by: Wang, Ziqiao, et al.
Published: (2025)
by: Wang, Ziqiao, et al.
Published: (2025)
Pairwise Calibrated Rewards for Pluralistic Alignment
by: Halpern, Daniel, et al.
Published: (2025)
by: Halpern, Daniel, et al.
Published: (2025)
A Note on Shumailov et al. (2024): `AI Models Collapse When Trained on Recursively Generated Data'
by: Borji, Ali
Published: (2024)
by: Borji, Ali
Published: (2024)
Liquid Democracy for Low-Cost Ensemble Pruning
by: Armstrong, Ben, et al.
Published: (2024)
by: Armstrong, Ben, et al.
Published: (2024)
Liquid Ensemble Selection for Continual Learning
by: Blair, Carter, et al.
Published: (2024)
by: Blair, Carter, et al.
Published: (2024)
Pluralistic Alignment Over Time
by: Klassen, Toryn Q., et al.
Published: (2024)
by: Klassen, Toryn Q., et al.
Published: (2024)
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy
by: Liu, Chris Yuhao, et al.
Published: (2025)
by: Liu, Chris Yuhao, et al.
Published: (2025)
Self-Consuming Generative Models with Curated Data Provably Optimize Human Preferences
by: Ferbach, Damien, et al.
Published: (2024)
by: Ferbach, Damien, et al.
Published: (2024)
Similar Items
-
The Alignment Game: A Theory of Long-Horizon Alignment Through Recursive Curation
by: Falahati, Ali, et al.
Published: (2025) -
Disentangled Structural and Featural Representation for Task-Agnostic Graph Valuation
by: Falahati, Ali, et al.
Published: (2024) -
A Median Perspective on Unlabeled Data for Out-of-Distribution Detection
by: Abbas, Momin, et al.
Published: (2025) -
SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching
by: Zhu, Yuxuan, et al.
Published: (2025) -
Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy and Research
by: Cooper, A. Feder, et al.
Published: (2024)