:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Falahati, Ali, Amiri, Mohammad Mohammadi, Larson, Kate, Golab, Lukasz
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2511.12804
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Curated Synthetic Data Doesn't Have to Collapse: A Theoretical Study of Generative Retraining with Pluralistic Preferences
by: Falahati, Ali, et al.
Published: (2026)

Disentangled Structural and Featural Representation for Task-Agnostic Graph Valuation
by: Falahati, Ali, et al.
Published: (2024)

A Median Perspective on Unlabeled Data for Out-of-Distribution Detection
by: Abbas, Momin, et al.
Published: (2025)

SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching
by: Zhu, Yuxuan, et al.
Published: (2025)

Toward Efficient Influence Function: Dropout as a Compression Tool
by: Zhang, Yuchen, et al.
Published: (2025)

Jackpot! Alignment as a Maximal Lottery
by: Maura-Rivero, Roberto-Rafael, et al.
Published: (2025)

Optimal Singular Damage: Efficient LLM Inference in Low Storage Regimes
by: Alipour, Mohammadsajad, et al.
Published: (2025)

GASTON: Graph-Aware Social Transformer for Online Networks
by: Wloch, Olha, et al.
Published: (2026)

DriftXpress: Faster Drifting Models via Projected RKHS Fields
by: Falahati, Ali, et al.
Published: (2026)

OFMU: Optimization-Driven Framework for Machine Unlearning
by: Asif, Sadia, et al.
Published: (2025)

Towards Reversible Model Merging For Low-rank Weights
by: Alipour, Mohammadsajad, et al.
Published: (2025)

Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs
by: Sel, Bilgehan, et al.
Published: (2024)

Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models
by: Wang, Fei, et al.
Published: (2024)

RefusalGuard: Geometry-Preserving Fine-Tuning for Safety in LLMs
by: Asif, Sadia, et al.
Published: (2026)

When and How Human Curation Backfires: Preference Alignment under Multi-Model Self-Consuming Loop
by: Zhang, Yang, et al.
Published: (2026)

Reward Shaping for Inference-Time Alignment: A Stackelberg Game Perspective
by: Wang, Haichuan, et al.
Published: (2026)

Towards a Learning Theory of Representation Alignment
by: Insulla, Francesco, et al.
Published: (2025)

LLM Probability Concentration: How Alignment Shrinks the Generative Horizon
by: Yang, Chenghao, et al.
Published: (2025)

Learning Hyperspectral Images with Curated Text Prompts for Efficient Multimodal Alignment
by: Chatterjee, Abhiroop, et al.
Published: (2025)

Tokenized Bandit for LLM Decoding and Alignment
by: Shin, Suho, et al.
Published: (2025)

SAHOO: Safeguarded Alignment for High-Order Optimization Objectives in Recursive Self-Improvement
by: Sahoo, Subramanyam, et al.
Published: (2026)

Information as Structural Alignment: A Dynamical Theory of Continual Learning
by: Negulescu, Radu
Published: (2026)

Manifold Approximation leads to Robust Kernel Alignment
by: Islam, Mohammad Tariqul, et al.
Published: (2025)

Pharmacist: Safety Alignment Data Curation for Large Language Models against Harmful Fine-tuning
by: Liu, Guozhi, et al.
Published: (2025)

Direct Alignment with Heterogeneous Preferences
by: Shirali, Ali, et al.
Published: (2025)

Four Things People Should Know About Migraines
by: Parsa, Mohammad S., et al.
Published: (2025)

RDD: Retrieval-Based Demonstration Decomposer for Planner Alignment in Long-Horizon Tasks
by: Yan, Mingxuan, et al.
Published: (2025)

Liquid Democracy for Low-Cost Ensemble Pruning
by: Armstrong, Ben, et al.
Published: (2024)

Liquid Ensemble Selection for Continual Learning
by: Blair, Carter, et al.
Published: (2024)

Power to the Clients: Federated Learning in a Dictatorship Setting
by: Alipour, Mohammadsajad, et al.
Published: (2025)

The Sign Estimator: LLM Alignment in the Face of Choice Heterogeneity
by: Aouad, Ali, et al.
Published: (2025)

Fair Dataset Distillation via Cross-Group Barycenter Alignment
by: Moslemi, Mohammad Hossein, et al.
Published: (2026)

Automated Meta Prompt Engineering for Alignment with the Theory of Mind
by: Baughman, Aaron, et al.
Published: (2025)

Prior-informed optimization of treatment recommendation via bandit algorithms trained on large language model-processed historical records
by: Nessari, Saman, et al.
Published: (2025)

Rethinking Inverse Reinforcement Learning: from Data Alignment to Task Alignment
by: Zhou, Weichao, et al.
Published: (2024)

Training-Free Geospatial Place Representation Learning from Large-Scale Point-of-Interest Graph Data
by: Hashemi, Mohammad, et al.
Published: (2025)

Model Alignment Search
by: Grant, Satchel
Published: (2025)

Towards Generalisable Imitation Learning Through Conditioned Transition Estimation and Online Behaviour Alignment
by: Gavenski, Nathan, et al.
Published: (2026)

ECLIPTICA -- A Framework for Switchable LLM Alignment via CITA - Contrastive Instruction-Tuned Alignment
by: Wanaskar, Kapil, et al.
Published: (2026)

Scalable, Explainable and Provably Robust Anomaly Detection with One-Step Flow Matching
by: Li, Zhong, et al.
Published: (2025)