Saved in:
| Main Authors: | Uhlig, Matthias, Schacht, Sigurd, Barkur, Sudarshan Kamath |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.10580 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Deception in LLMs: Self-Preservation and Autonomous Goals in Large Language Models
by: Barkur, Sudarshan Kamath, et al.
Published: (2025)
by: Barkur, Sudarshan Kamath, et al.
Published: (2025)
Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations
by: Donisch, Leo, et al.
Published: (2024)
by: Donisch, Leo, et al.
Published: (2024)
Cross-lingual Human-Preference Alignment for Neural Machine Translation with Direct Quality Optimization
by: Uhlig, Kaden, et al.
Published: (2024)
by: Uhlig, Kaden, et al.
Published: (2024)
A Review of Common Online Speaker Diarization Methods
by: Aperdannier, Roman, et al.
Published: (2024)
by: Aperdannier, Roman, et al.
Published: (2024)
Systematic Evaluation of Online Speaker Diarization Systems Regarding their Latency
by: Aperdannier, Roman, et al.
Published: (2024)
by: Aperdannier, Roman, et al.
Published: (2024)
An approach to optimize inference of the DIART speaker diarization pipeline
by: Aperdannier, Roman, et al.
Published: (2024)
by: Aperdannier, Roman, et al.
Published: (2024)
Language Models Largely Exhibit Human-like Constituent Ordering Preferences
by: Tur, Ada Defne, et al.
Published: (2025)
by: Tur, Ada Defne, et al.
Published: (2025)
eDIF: A European Deep Inference Fabric for Remote Interpretability of LLM
by: Guggenberger, Irma Heithoff. Marc, et al.
Published: (2025)
by: Guggenberger, Irma Heithoff. Marc, et al.
Published: (2025)
Direct Judgement Preference Optimization
by: Wang, Peifeng, et al.
Published: (2024)
by: Wang, Peifeng, et al.
Published: (2024)
BPO: Revisiting Preference Modeling in Direct Preference Optimization
by: Sun, Lin, et al.
Published: (2025)
by: Sun, Lin, et al.
Published: (2025)
Direct Multi-Turn Preference Optimization for Language Agents
by: Shi, Wentao, et al.
Published: (2024)
by: Shi, Wentao, et al.
Published: (2024)
New Desiderata for Direct Preference Optimization
by: Hu, Xiangkun, et al.
Published: (2024)
by: Hu, Xiangkun, et al.
Published: (2024)
BiasDPO: Mitigating Bias in Language Models through Direct Preference Optimization
by: Allam, Ahmed
Published: (2024)
by: Allam, Ahmed
Published: (2024)
Benchmarking Direct Preference Optimization for Medical Large Vision-Language Models
by: Kim, Dain, et al.
Published: (2026)
by: Kim, Dain, et al.
Published: (2026)
On Extending Direct Preference Optimization to Accommodate Ties
by: Chen, Jinghong, et al.
Published: (2024)
by: Chen, Jinghong, et al.
Published: (2024)
Token-weighted Direct Preference Optimization with Attention
by: Huang, Chengyu, et al.
Published: (2026)
by: Huang, Chengyu, et al.
Published: (2026)
Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment
by: Xiao, Teng, et al.
Published: (2024)
by: Xiao, Teng, et al.
Published: (2024)
Knowledge Editing in Language Models via Adapted Direct Preference Optimization
by: Rozner, Amit, et al.
Published: (2024)
by: Rozner, Amit, et al.
Published: (2024)
Length Desensitization in Direct Preference Optimization
by: Liu, Wei, et al.
Published: (2024)
by: Liu, Wei, et al.
Published: (2024)
Token-level Direct Preference Optimization
by: Zeng, Yongcheng, et al.
Published: (2024)
by: Zeng, Yongcheng, et al.
Published: (2024)
Filtered Direct Preference Optimization
by: Morimura, Tetsuro, et al.
Published: (2024)
by: Morimura, Tetsuro, et al.
Published: (2024)
Direct Preference Optimization with an Offset
by: Amini, Afra, et al.
Published: (2024)
by: Amini, Afra, et al.
Published: (2024)
Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training
by: Wu, Linjuan, et al.
Published: (2025)
by: Wu, Linjuan, et al.
Published: (2025)
DPO-Shift: Shifting the Distribution of Direct Preference Optimization
by: Yang, Xiliang, et al.
Published: (2025)
by: Yang, Xiliang, et al.
Published: (2025)
Ambiguity Awareness Optimization: Towards Semantic Disambiguation for Direct Preference Optimization
by: Li, Jian, et al.
Published: (2025)
by: Li, Jian, et al.
Published: (2025)
Does Synthetic Data Help Named Entity Recognition for Low-Resource Languages?
by: Kamath, Gaurav, et al.
Published: (2025)
by: Kamath, Gaurav, et al.
Published: (2025)
Understanding Reference Policies in Direct Preference Optimization
by: Liu, Yixin, et al.
Published: (2024)
by: Liu, Yixin, et al.
Published: (2024)
Accelerating Direct Preference Optimization with Prefix Sharing
by: Wang, Franklin, et al.
Published: (2024)
by: Wang, Franklin, et al.
Published: (2024)
Entropy Controllable Direct Preference Optimization
by: Omura, Motoki, et al.
Published: (2024)
by: Omura, Motoki, et al.
Published: (2024)
Orthogonal Finetuning for Direct Preference Optimization
by: Yang, Chenxu, et al.
Published: (2024)
by: Yang, Chenxu, et al.
Published: (2024)
System Message Generation for User Preferences using Open-Source Models
by: Jeong, Minbyul, et al.
Published: (2025)
by: Jeong, Minbyul, et al.
Published: (2025)
Direct Preference Knowledge Distillation for Large Language Models
by: Li, Yixing, et al.
Published: (2024)
by: Li, Yixing, et al.
Published: (2024)
Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization
by: Jiang, Yuxin, et al.
Published: (2024)
by: Jiang, Yuxin, et al.
Published: (2024)
Backtranslation Augmented Direct Preference Optimization for Neural Machine Translation
by: Ghassabi, Mehrdad, et al.
Published: (2026)
by: Ghassabi, Mehrdad, et al.
Published: (2026)
DGPO: Beyond Pairwise Preferences with Directional Consistent Groupwise Optimization
by: Deng, Mengyi, et al.
Published: (2026)
by: Deng, Mengyi, et al.
Published: (2026)
Uncertainty-Aware Exploratory Direct Preference Optimization for Multimodal Large Language Models
by: Zhang, Huatian, et al.
Published: (2026)
by: Zhang, Huatian, et al.
Published: (2026)
Fine Tuning Large Language Models for Medicine: The Role and Importance of Direct Preference Optimization
by: Savage, Thomas, et al.
Published: (2024)
by: Savage, Thomas, et al.
Published: (2024)
FedPDPO: Federated Personalized Direct Preference Optimization for Large Language Model Alignment
by: Zhu, Kewen, et al.
Published: (2026)
by: Zhu, Kewen, et al.
Published: (2026)
Difficulty-Controllable Multiple-Choice Question Generation Using Large Language Models and Direct Preference Optimization
by: Tomikawa, Yuto, et al.
Published: (2025)
by: Tomikawa, Yuto, et al.
Published: (2025)
The Crucial Role of Samplers in Online Direct Preference Optimization
by: Shi, Ruizhe, et al.
Published: (2024)
by: Shi, Ruizhe, et al.
Published: (2024)
Similar Items
-
Deception in LLMs: Self-Preservation and Autonomous Goals in Large Language Models
by: Barkur, Sudarshan Kamath, et al.
Published: (2025) -
Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations
by: Donisch, Leo, et al.
Published: (2024) -
Cross-lingual Human-Preference Alignment for Neural Machine Translation with Direct Quality Optimization
by: Uhlig, Kaden, et al.
Published: (2024) -
A Review of Common Online Speaker Diarization Methods
by: Aperdannier, Roman, et al.
Published: (2024) -
Systematic Evaluation of Online Speaker Diarization Systems Regarding their Latency
by: Aperdannier, Roman, et al.
Published: (2024)