Saved in:
| Main Authors: | Wang, Jianing, Zhou, Yang, Zhang, Xiaocheng, Bao, Mengjiao, Yan, Peng |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.11212 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TrendFact: A Benchmark for Explainable Hotspot Perception in Fact-Checking with Natural Language Explanation
by: Zhang, Xiaocheng, et al.
Published: (2024)
by: Zhang, Xiaocheng, et al.
Published: (2024)
Self-Steering Optimization: Autonomous Preference Optimization for Large Language Models
by: Xiang, Hao, et al.
Published: (2024)
by: Xiang, Hao, et al.
Published: (2024)
InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models
by: Gu, Yanggan, et al.
Published: (2025)
by: Gu, Yanggan, et al.
Published: (2025)
Enhancing Multilingual Capabilities of Large Language Models through Self-Distillation from Resource-Rich Languages
by: Zhang, Yuanchi, et al.
Published: (2024)
by: Zhang, Yuanchi, et al.
Published: (2024)
Advancing Large Language Model Attribution through Self-Improving
by: Huang, Lei, et al.
Published: (2024)
by: Huang, Lei, et al.
Published: (2024)
Self-supervised Preference Optimization: Enhance Your Language Model with Preference Degree Awareness
by: Li, Jian, et al.
Published: (2024)
by: Li, Jian, et al.
Published: (2024)
Uncertainty-Aware Exploratory Direct Preference Optimization for Multimodal Large Language Models
by: Zhang, Huatian, et al.
Published: (2026)
by: Zhang, Huatian, et al.
Published: (2026)
Prejudge-Before-Think: Enhancing Large Language Models at Test-Time by Process Prejudge Reasoning
by: Wang, Jianing, et al.
Published: (2025)
by: Wang, Jianing, et al.
Published: (2025)
Topology-Enhanced Alignment for Large Language Models: Trajectory Topology Loss and Topological Preference Optimization
by: Pan, Yurui, et al.
Published: (2026)
by: Pan, Yurui, et al.
Published: (2026)
Dynamic Noise Preference Optimization: Self-Improvement of Large Language Models with Self-Synthetic Data
by: Yang, Haoyan, et al.
Published: (2025)
by: Yang, Haoyan, et al.
Published: (2025)
Large Language Model for Multi-objective Evolutionary Optimization
by: Liu, Fei, et al.
Published: (2023)
by: Liu, Fei, et al.
Published: (2023)
InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference Alignment
by: Wang, Jianing, et al.
Published: (2024)
by: Wang, Jianing, et al.
Published: (2024)
Weights-Rotated Preference Optimization for Large Language Models
by: Yang, Chenxu, et al.
Published: (2025)
by: Yang, Chenxu, et al.
Published: (2025)
Uncertainty is Fragile: Manipulating Uncertainty in Large Language Models
by: Zeng, Qingcheng, et al.
Published: (2024)
by: Zeng, Qingcheng, et al.
Published: (2024)
Enhancing Non-English Capabilities of English-Centric Large Language Models through Deep Supervision Fine-Tuning
by: Huo, Wenshuai, et al.
Published: (2025)
by: Huo, Wenshuai, et al.
Published: (2025)
Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment
by: Yin, Yueqin, et al.
Published: (2024)
by: Yin, Yueqin, et al.
Published: (2024)
LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
by: Chen, Guanzheng, et al.
Published: (2025)
by: Chen, Guanzheng, et al.
Published: (2025)
Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling
by: Hou, Bairu, et al.
Published: (2023)
by: Hou, Bairu, et al.
Published: (2023)
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
by: Wang, Weiyun, et al.
Published: (2024)
by: Wang, Weiyun, et al.
Published: (2024)
Aligning Large Language Models with Human Preferences through Representation Engineering
by: Liu, Wenhao, et al.
Published: (2023)
by: Liu, Wenhao, et al.
Published: (2023)
Users as Annotators: LLM Preference Learning from Comparison Mode
by: Cai, Zhongze, et al.
Published: (2025)
by: Cai, Zhongze, et al.
Published: (2025)
Enhancing Multilingual Counterfactual Generation through Alignment-as-Preference Optimization
by: Wang, Yilong, et al.
Published: (2026)
by: Wang, Yilong, et al.
Published: (2026)
Self-Play Preference Optimization for Language Model Alignment
by: Wu, Yue, et al.
Published: (2024)
by: Wu, Yue, et al.
Published: (2024)
Preference-Oriented Supervised Fine-Tuning: Favoring Target Model Over Aligned Large Language Models
by: Fan, Yuchen, et al.
Published: (2024)
by: Fan, Yuchen, et al.
Published: (2024)
Aligning Large Language Models with Searcher Preferences
by: Wu, Wei, et al.
Published: (2026)
by: Wu, Wei, et al.
Published: (2026)
Preference Packing: Efficient Preference Optimization for Large Language Models
by: Cho, Jaekyung
Published: (2026)
by: Cho, Jaekyung
Published: (2026)
GENUINE: Graph Enhanced Multi-level Uncertainty Estimation for Large Language Models
by: Wang, Tuo, et al.
Published: (2025)
by: Wang, Tuo, et al.
Published: (2025)
HIPPO: Enhancing the Table Understanding Capability of LLMs through Hybrid-Modal Preference Optimization
by: Wang, Haolan, et al.
Published: (2025)
by: Wang, Haolan, et al.
Published: (2025)
Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization
by: Pi, Renjie, et al.
Published: (2024)
by: Pi, Renjie, et al.
Published: (2024)
mDPO: Conditional Preference Optimization for Multimodal Large Language Models
by: Wang, Fei, et al.
Published: (2024)
by: Wang, Fei, et al.
Published: (2024)
Do Large Language Models Excel in Complex Logical Reasoning with Formal Language?
by: Jiang, Jin, et al.
Published: (2025)
by: Jiang, Jin, et al.
Published: (2025)
DiffPO: Diffusion-styled Preference Optimization for Efficient Inference-Time Alignment of Large Language Models
by: Chen, Ruizhe, et al.
Published: (2025)
by: Chen, Ruizhe, et al.
Published: (2025)
Self-Boosting Large Language Models with Synthetic Preference Data
by: Dong, Qingxiu, et al.
Published: (2024)
by: Dong, Qingxiu, et al.
Published: (2024)
Reasoning Strategies in Large Language Models: Can They Follow, Prefer, and Optimize?
by: Zhang, Yanjian, et al.
Published: (2025)
by: Zhang, Yanjian, et al.
Published: (2025)
Ensemble Learning for Heterogeneous Large Language Models with Deep Parallel Collaboration
by: Huang, Yichong, et al.
Published: (2024)
by: Huang, Yichong, et al.
Published: (2024)
Enhancing Knowledge Distillation of Large Language Models through Efficient Multi-Modal Distribution Alignment
by: Peng, Tianyu, et al.
Published: (2024)
by: Peng, Tianyu, et al.
Published: (2024)
Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models
by: Singh, Joykirat, et al.
Published: (2025)
by: Singh, Joykirat, et al.
Published: (2025)
Observations and Remedies for Large Language Model Bias in Self-Consuming Performative Loop
by: Wang, Yaxuan, et al.
Published: (2026)
by: Wang, Yaxuan, et al.
Published: (2026)
METEOR: Evolutionary Journey of Large Language Models from Guidance to Self-Growth
by: Li, Jiawei, et al.
Published: (2024)
by: Li, Jiawei, et al.
Published: (2024)
From Implicit to Explicit: Enhancing Self-Recognition in Large Language Models
by: Zhou, Yinghan, et al.
Published: (2025)
by: Zhou, Yinghan, et al.
Published: (2025)
Similar Items
-
TrendFact: A Benchmark for Explainable Hotspot Perception in Fact-Checking with Natural Language Explanation
by: Zhang, Xiaocheng, et al.
Published: (2024) -
Self-Steering Optimization: Autonomous Preference Optimization for Large Language Models
by: Xiang, Hao, et al.
Published: (2024) -
InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models
by: Gu, Yanggan, et al.
Published: (2025) -
Enhancing Multilingual Capabilities of Large Language Models through Self-Distillation from Resource-Rich Languages
by: Zhang, Yuanchi, et al.
Published: (2024) -
Advancing Large Language Model Attribution through Self-Improving
by: Huang, Lei, et al.
Published: (2024)