Saved in:
| Main Authors: | Zhong, Yifan, Ma, Chengdong, Zhang, Xiaoyuan, Yang, Ziran, Chen, Haojun, Zhang, Qingfu, Qi, Siyuan, Yang, Yaodong |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.02030 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs
by: Zhang, Zhaowei, et al.
Published: (2025)
by: Zhang, Zhaowei, et al.
Published: (2025)
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment
by: Wang, Mingzhi, et al.
Published: (2024)
by: Wang, Mingzhi, et al.
Published: (2024)
Evolving Diverse Red-team Language Models in Multi-round Multi-agent Games
by: Ma, Chengdong, et al.
Published: (2023)
by: Ma, Chengdong, et al.
Published: (2023)
Dealing with Structure Constraints in Evolutionary Pareto Set Learning
by: Lin, Xi, et al.
Published: (2023)
by: Lin, Xi, et al.
Published: (2023)
In-Context Editing: Learning Knowledge from Self-Induced Distributions
by: Qi, Siyuan, et al.
Published: (2024)
by: Qi, Siyuan, et al.
Published: (2024)
PKU-SafeRLHF: Towards Multi-Level Safety Alignment for LLMs with Human Preference
by: Ji, Jiaming, et al.
Published: (2024)
by: Ji, Jiaming, et al.
Published: (2024)
Heterogeneous Value Alignment Evaluation for Large Language Models
by: Zhang, Zhaowei, et al.
Published: (2023)
by: Zhang, Zhaowei, et al.
Published: (2023)
World Models Should Prioritize the Unification of Physical and Social Dynamics
by: Zhang, Xiaoyuan, et al.
Published: (2025)
by: Zhang, Xiaoyuan, et al.
Published: (2025)
Efficient Model-agnostic Alignment via Bayesian Persuasion
by: Bai, Fengshuo, et al.
Published: (2024)
by: Bai, Fengshuo, et al.
Published: (2024)
PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs
by: Liu, An, et al.
Published: (2024)
by: Liu, An, et al.
Published: (2024)
SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset
by: Dai, Josef, et al.
Published: (2024)
by: Dai, Josef, et al.
Published: (2024)
CATNIP: LLM Unlearning via Calibrated and Tokenized Negative Preference Alignment
by: Yang, Zhengbang, et al.
Published: (2026)
by: Yang, Zhengbang, et al.
Published: (2026)
Can LLMs Understand Unvoiced Speech? Exploring EMG-to-Text Conversion with LLMs
by: Mohapatra, Payal, et al.
Published: (2025)
by: Mohapatra, Payal, et al.
Published: (2025)
Self-Improvement Towards Pareto Optimality: Mitigating Preference Conflicts in Multi-Objective Alignment
by: Li, Moxin, et al.
Published: (2025)
by: Li, Moxin, et al.
Published: (2025)
Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering
by: Zhang, Yichi, et al.
Published: (2023)
by: Zhang, Yichi, et al.
Published: (2023)
Roadmap on Incentive Compatibility for AI Alignment and Governance in Sociotechnical Systems
by: Zhang, Zhaowei, et al.
Published: (2024)
by: Zhang, Zhaowei, et al.
Published: (2024)
UMOEA/D: A Multiobjective Evolutionary Algorithm for Uniform Pareto Objectives based on Decomposition
by: Zhang, Xiaoyuan, et al.
Published: (2024)
by: Zhang, Xiaoyuan, et al.
Published: (2024)
LLMs Know More Than Words: A Genre Study with Syntax, Metaphor & Phonetics
by: Shi, Weiye, et al.
Published: (2025)
by: Shi, Weiye, et al.
Published: (2025)
Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction
by: Lou, Hantao, et al.
Published: (2025)
by: Lou, Hantao, et al.
Published: (2025)
Does LLM Alignment Really Need Diversity? An Empirical Study of Adapting RLVR Methods for Moral Reasoning
by: Zhang, Zhaowei, et al.
Published: (2026)
by: Zhang, Zhaowei, et al.
Published: (2026)
PSD: Pushing the Pareto Frontier of Diffusion LLMs via Parallel Speculative Decoding
by: Sun, Shengyin, et al.
Published: (2026)
by: Sun, Shengyin, et al.
Published: (2026)
Safety Alignment as Continual Learning: Mitigating the Alignment Tax via Orthogonal Gradient Projection
by: Sun, Guanglong, et al.
Published: (2026)
by: Sun, Guanglong, et al.
Published: (2026)
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards
by: Wang, Haoxiang, et al.
Published: (2024)
by: Wang, Haoxiang, et al.
Published: (2024)
Evaluating and Aligning CodeLLMs on Human Preference
by: Yang, Jian, et al.
Published: (2024)
by: Yang, Jian, et al.
Published: (2024)
TokAlign++: Advancing Vocabulary Adaptation via Better Token Alignment
by: Li, Chong, et al.
Published: (2026)
by: Li, Chong, et al.
Published: (2026)
ProgressGym: Alignment with a Millennium of Moral Progress
by: Qiu, Tianyi, et al.
Published: (2024)
by: Qiu, Tianyi, et al.
Published: (2024)
Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation
by: Bai, Fengshuo, et al.
Published: (2024)
by: Bai, Fengshuo, et al.
Published: (2024)
PICACO: Pluralistic In-Context Value Alignment of LLMs via Total Correlation Optimization
by: Jiang, Han, et al.
Published: (2025)
by: Jiang, Han, et al.
Published: (2025)
UC-MOA: Utility-Conditioned Multi-Objective Alignment for Distributional Pareto-Optimality
by: Cheng, Zelei, et al.
Published: (2025)
by: Cheng, Zelei, et al.
Published: (2025)
DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization
by: Zhang, Chao, et al.
Published: (2025)
by: Zhang, Chao, et al.
Published: (2025)
Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment
by: Yang, Wen, et al.
Published: (2025)
by: Yang, Wen, et al.
Published: (2025)
SAE-V: Interpreting Multimodal Models for Enhanced Alignment
by: Lou, Hantao, et al.
Published: (2025)
by: Lou, Hantao, et al.
Published: (2025)
Self-supervised Attribute-aware Dynamic Preference Ranking Alignment
by: Yang, Hongyu, et al.
Published: (2025)
by: Yang, Hongyu, et al.
Published: (2025)
Adversarial Preference Optimization: Enhancing Your Alignment via RM-LLM Game
by: Cheng, Pengyu, et al.
Published: (2023)
by: Cheng, Pengyu, et al.
Published: (2023)
ParetoHqD: Fast Offline Multiobjective Alignment of Large Language Models using Pareto High-quality Data
by: Gu, Haoran, et al.
Published: (2025)
by: Gu, Haoran, et al.
Published: (2025)
MPO: An Efficient Post-Processing Framework for Mixing Diverse Preference Alignment
by: Wang, Tianze, et al.
Published: (2025)
by: Wang, Tianze, et al.
Published: (2025)
Speech-Worthy Alignment for Japanese SpeechLLMs via Direct Preference Optimization
by: Zhao, Mengjie, et al.
Published: (2026)
by: Zhao, Mengjie, et al.
Published: (2026)
Accelerating Robotic Reinforcement Learning with Agent Guidance
by: Chen, Haojun, et al.
Published: (2026)
by: Chen, Haojun, et al.
Published: (2026)
PACIFIC: Can LLMs Discern the Traits Influencing Your Preferences? Evaluating Personality-Driven Preference Alignment in LLMs
by: Zhao, Tianyu, et al.
Published: (2026)
by: Zhao, Tianyu, et al.
Published: (2026)
HCAttention: Extreme KV Cache Compression via Heterogeneous Attention Computing for LLMs
by: Yang, Dongquan, et al.
Published: (2025)
by: Yang, Dongquan, et al.
Published: (2025)
Similar Items
-
Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs
by: Zhang, Zhaowei, et al.
Published: (2025) -
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment
by: Wang, Mingzhi, et al.
Published: (2024) -
Evolving Diverse Red-team Language Models in Multi-round Multi-agent Games
by: Ma, Chengdong, et al.
Published: (2023) -
Dealing with Structure Constraints in Evolutionary Pareto Set Learning
by: Lin, Xi, et al.
Published: (2023) -
In-Context Editing: Learning Knowledge from Self-Induced Distributions
by: Qi, Siyuan, et al.
Published: (2024)