Saved in:
| Main Authors: | Falahati, Ali, Amiri, Mohammad Mohammadi, Larson, Kate, Golab, Lukasz |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.12804 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Curated Synthetic Data Doesn't Have to Collapse: A Theoretical Study of Generative Retraining with Pluralistic Preferences
by: Falahati, Ali, et al.
Published: (2026)
by: Falahati, Ali, et al.
Published: (2026)
Disentangled Structural and Featural Representation for Task-Agnostic Graph Valuation
by: Falahati, Ali, et al.
Published: (2024)
by: Falahati, Ali, et al.
Published: (2024)
A Median Perspective on Unlabeled Data for Out-of-Distribution Detection
by: Abbas, Momin, et al.
Published: (2025)
by: Abbas, Momin, et al.
Published: (2025)
SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching
by: Zhu, Yuxuan, et al.
Published: (2025)
by: Zhu, Yuxuan, et al.
Published: (2025)
Toward Efficient Influence Function: Dropout as a Compression Tool
by: Zhang, Yuchen, et al.
Published: (2025)
by: Zhang, Yuchen, et al.
Published: (2025)
Jackpot! Alignment as a Maximal Lottery
by: Maura-Rivero, Roberto-Rafael, et al.
Published: (2025)
by: Maura-Rivero, Roberto-Rafael, et al.
Published: (2025)
Optimal Singular Damage: Efficient LLM Inference in Low Storage Regimes
by: Alipour, Mohammadsajad, et al.
Published: (2025)
by: Alipour, Mohammadsajad, et al.
Published: (2025)
GASTON: Graph-Aware Social Transformer for Online Networks
by: Wloch, Olha, et al.
Published: (2026)
by: Wloch, Olha, et al.
Published: (2026)
DriftXpress: Faster Drifting Models via Projected RKHS Fields
by: Falahati, Ali, et al.
Published: (2026)
by: Falahati, Ali, et al.
Published: (2026)
OFMU: Optimization-Driven Framework for Machine Unlearning
by: Asif, Sadia, et al.
Published: (2025)
by: Asif, Sadia, et al.
Published: (2025)
Towards Reversible Model Merging For Low-rank Weights
by: Alipour, Mohammadsajad, et al.
Published: (2025)
by: Alipour, Mohammadsajad, et al.
Published: (2025)
Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs
by: Sel, Bilgehan, et al.
Published: (2024)
by: Sel, Bilgehan, et al.
Published: (2024)
Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models
by: Wang, Fei, et al.
Published: (2024)
by: Wang, Fei, et al.
Published: (2024)
RefusalGuard: Geometry-Preserving Fine-Tuning for Safety in LLMs
by: Asif, Sadia, et al.
Published: (2026)
by: Asif, Sadia, et al.
Published: (2026)
When and How Human Curation Backfires: Preference Alignment under Multi-Model Self-Consuming Loop
by: Zhang, Yang, et al.
Published: (2026)
by: Zhang, Yang, et al.
Published: (2026)
Reward Shaping for Inference-Time Alignment: A Stackelberg Game Perspective
by: Wang, Haichuan, et al.
Published: (2026)
by: Wang, Haichuan, et al.
Published: (2026)
Towards a Learning Theory of Representation Alignment
by: Insulla, Francesco, et al.
Published: (2025)
by: Insulla, Francesco, et al.
Published: (2025)
LLM Probability Concentration: How Alignment Shrinks the Generative Horizon
by: Yang, Chenghao, et al.
Published: (2025)
by: Yang, Chenghao, et al.
Published: (2025)
Learning Hyperspectral Images with Curated Text Prompts for Efficient Multimodal Alignment
by: Chatterjee, Abhiroop, et al.
Published: (2025)
by: Chatterjee, Abhiroop, et al.
Published: (2025)
Tokenized Bandit for LLM Decoding and Alignment
by: Shin, Suho, et al.
Published: (2025)
by: Shin, Suho, et al.
Published: (2025)
SAHOO: Safeguarded Alignment for High-Order Optimization Objectives in Recursive Self-Improvement
by: Sahoo, Subramanyam, et al.
Published: (2026)
by: Sahoo, Subramanyam, et al.
Published: (2026)
Information as Structural Alignment: A Dynamical Theory of Continual Learning
by: Negulescu, Radu
Published: (2026)
by: Negulescu, Radu
Published: (2026)
Manifold Approximation leads to Robust Kernel Alignment
by: Islam, Mohammad Tariqul, et al.
Published: (2025)
by: Islam, Mohammad Tariqul, et al.
Published: (2025)
Pharmacist: Safety Alignment Data Curation for Large Language Models against Harmful Fine-tuning
by: Liu, Guozhi, et al.
Published: (2025)
by: Liu, Guozhi, et al.
Published: (2025)
Direct Alignment with Heterogeneous Preferences
by: Shirali, Ali, et al.
Published: (2025)
by: Shirali, Ali, et al.
Published: (2025)
Four Things People Should Know About Migraines
by: Parsa, Mohammad S., et al.
Published: (2025)
by: Parsa, Mohammad S., et al.
Published: (2025)
RDD: Retrieval-Based Demonstration Decomposer for Planner Alignment in Long-Horizon Tasks
by: Yan, Mingxuan, et al.
Published: (2025)
by: Yan, Mingxuan, et al.
Published: (2025)
Liquid Democracy for Low-Cost Ensemble Pruning
by: Armstrong, Ben, et al.
Published: (2024)
by: Armstrong, Ben, et al.
Published: (2024)
Liquid Ensemble Selection for Continual Learning
by: Blair, Carter, et al.
Published: (2024)
by: Blair, Carter, et al.
Published: (2024)
Power to the Clients: Federated Learning in a Dictatorship Setting
by: Alipour, Mohammadsajad, et al.
Published: (2025)
by: Alipour, Mohammadsajad, et al.
Published: (2025)
The Sign Estimator: LLM Alignment in the Face of Choice Heterogeneity
by: Aouad, Ali, et al.
Published: (2025)
by: Aouad, Ali, et al.
Published: (2025)
Fair Dataset Distillation via Cross-Group Barycenter Alignment
by: Moslemi, Mohammad Hossein, et al.
Published: (2026)
by: Moslemi, Mohammad Hossein, et al.
Published: (2026)
Automated Meta Prompt Engineering for Alignment with the Theory of Mind
by: Baughman, Aaron, et al.
Published: (2025)
by: Baughman, Aaron, et al.
Published: (2025)
Prior-informed optimization of treatment recommendation via bandit algorithms trained on large language model-processed historical records
by: Nessari, Saman, et al.
Published: (2025)
by: Nessari, Saman, et al.
Published: (2025)
Rethinking Inverse Reinforcement Learning: from Data Alignment to Task Alignment
by: Zhou, Weichao, et al.
Published: (2024)
by: Zhou, Weichao, et al.
Published: (2024)
Training-Free Geospatial Place Representation Learning from Large-Scale Point-of-Interest Graph Data
by: Hashemi, Mohammad, et al.
Published: (2025)
by: Hashemi, Mohammad, et al.
Published: (2025)
Model Alignment Search
by: Grant, Satchel
Published: (2025)
by: Grant, Satchel
Published: (2025)
Towards Generalisable Imitation Learning Through Conditioned Transition Estimation and Online Behaviour Alignment
by: Gavenski, Nathan, et al.
Published: (2026)
by: Gavenski, Nathan, et al.
Published: (2026)
ECLIPTICA -- A Framework for Switchable LLM Alignment via CITA - Contrastive Instruction-Tuned Alignment
by: Wanaskar, Kapil, et al.
Published: (2026)
by: Wanaskar, Kapil, et al.
Published: (2026)
Scalable, Explainable and Provably Robust Anomaly Detection with One-Step Flow Matching
by: Li, Zhong, et al.
Published: (2025)
by: Li, Zhong, et al.
Published: (2025)
Similar Items
-
Curated Synthetic Data Doesn't Have to Collapse: A Theoretical Study of Generative Retraining with Pluralistic Preferences
by: Falahati, Ali, et al.
Published: (2026) -
Disentangled Structural and Featural Representation for Task-Agnostic Graph Valuation
by: Falahati, Ali, et al.
Published: (2024) -
A Median Perspective on Unlabeled Data for Out-of-Distribution Detection
by: Abbas, Momin, et al.
Published: (2025) -
SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching
by: Zhu, Yuxuan, et al.
Published: (2025) -
Toward Efficient Influence Function: Dropout as a Compression Tool
by: Zhang, Yuchen, et al.
Published: (2025)