Saved in:
| Main Authors: | Wang, Shanyong, Lin, Shuhang, Zhao, Yining, Zhu, Xi, Zhang, Yongfeng |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.12479 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Cache Mechanism for Agent RAG Systems
by: Lin, Shuhang, et al.
Published: (2025)
by: Lin, Shuhang, et al.
Published: (2025)
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting
by: Diao, Muxi, et al.
Published: (2026)
by: Diao, Muxi, et al.
Published: (2026)
OmniRouter: Budget and Performance Controllable Multi-LLM Routing
by: Mei, Kai, et al.
Published: (2025)
by: Mei, Kai, et al.
Published: (2025)
LRHP: Learning Representations for Human Preferences via Preference Pairs
by: Wang, Chenglong, et al.
Published: (2024)
by: Wang, Chenglong, et al.
Published: (2024)
ReaGAN: Node-as-Agent-Reasoning Graph Agentic Network
by: Guo, Minghao, et al.
Published: (2025)
by: Guo, Minghao, et al.
Published: (2025)
TruthfulRAG: Resolving Factual-level Conflicts in Retrieval-Augmented Generation with Knowledge Graphs
by: Liu, Shuyi, et al.
Published: (2025)
by: Liu, Shuyi, et al.
Published: (2025)
UORA: Uniform Orthogonal Reinitialization Adaptation in Parameter-Efficient Fine-Tuning of Large Models
by: Zhang, Xueyan, et al.
Published: (2025)
by: Zhang, Xueyan, et al.
Published: (2025)
MPO: An Efficient Post-Processing Framework for Mixing Diverse Preference Alignment
by: Wang, Tianze, et al.
Published: (2025)
by: Wang, Tianze, et al.
Published: (2025)
MVPBench: A Benchmark and Fine-Tuning Framework for Aligning Large Language Models with Diverse Human Values
by: Liang, Yao, et al.
Published: (2025)
by: Liang, Yao, et al.
Published: (2025)
Triplets Better Than Pairs: Towards Stable and Effective Self-Play Fine-Tuning for LLMs
by: Wang, Yibo, et al.
Published: (2026)
by: Wang, Yibo, et al.
Published: (2026)
Resolving Knowledge Conflicts in Large Language Models
by: Wang, Yike, et al.
Published: (2023)
by: Wang, Yike, et al.
Published: (2023)
DEFT: Distribution-guided Efficient Fine-Tuning for Human Alignment
by: Zhu, Liang, et al.
Published: (2026)
by: Zhu, Liang, et al.
Published: (2026)
ConflictRAG: Detecting and Resolving Knowledge Conflicts in Retrieval Augmented Generation
by: Wang, Chenyu, et al.
Published: (2026)
by: Wang, Chenyu, et al.
Published: (2026)
SOLAR: Towards Characterizing Subjectivity of Individuals through Modeling Value Conflicts and Trade-offs
by: Lee, Younghun, et al.
Published: (2025)
by: Lee, Younghun, et al.
Published: (2025)
Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment
by: Zhang, Jianfei, et al.
Published: (2024)
by: Zhang, Jianfei, et al.
Published: (2024)
$λ$-GRPO: Unifying the GRPO Frameworks with Learnable Token Preferences
by: Wang, Yining, et al.
Published: (2025)
by: Wang, Yining, et al.
Published: (2025)
Reinforcement Fine-Tuning Naturally Mitigates Forgetting in Continual Post-Training
by: Lai, Song, et al.
Published: (2025)
by: Lai, Song, et al.
Published: (2025)
LiteCUA: Computer as MCP Server for Computer-Use Agent on AIOS
by: Mei, Kai, et al.
Published: (2025)
by: Mei, Kai, et al.
Published: (2025)
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback
by: Gao, Songyang, et al.
Published: (2024)
by: Gao, Songyang, et al.
Published: (2024)
Align Generative Artificial Intelligence with Human Preferences: A Novel Large Language Model Fine-Tuning Method for Online Review Management
by: Wang, Yanan, et al.
Published: (2026)
by: Wang, Yanan, et al.
Published: (2026)
Transitivity Meets Cyclicity: Explicit Preference Decomposition for Dynamic Large Language Model Alignment
by: Huang, Yucong, et al.
Published: (2026)
by: Huang, Yucong, et al.
Published: (2026)
LoRA-PAR: A Flexible Dual-System LoRA Partitioning Approach to Efficient LLM Fine-Tuning
by: Huang, Yining, et al.
Published: (2025)
by: Huang, Yining, et al.
Published: (2025)
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values
by: P Team, et al.
Published: (2025)
by: P Team, et al.
Published: (2025)
SoRFT: Issue Resolving with Subtask-oriented Reinforced Fine-Tuning
by: Ma, Zexiong, et al.
Published: (2025)
by: Ma, Zexiong, et al.
Published: (2025)
Fine-Tuning LLMs with Fine-Grained Human Feedback on Text Spans
by: CH-Wang, Sky, et al.
Published: (2025)
by: CH-Wang, Sky, et al.
Published: (2025)
ValueSim: Generating Backstories to Model Individual Value Systems
by: Du, Bangde, et al.
Published: (2025)
by: Du, Bangde, et al.
Published: (2025)
AlignSum: Data Pyramid Hierarchical Fine-tuning for Aligning with Human Summarization Preference
by: Han, Yang, et al.
Published: (2024)
by: Han, Yang, et al.
Published: (2024)
BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis
by: Lin, Shuhang, et al.
Published: (2024)
by: Lin, Shuhang, et al.
Published: (2024)
TuCo: Measuring the Contribution of Fine-Tuning to Individual Responses of LLMs
by: Nuti, Felipe, et al.
Published: (2025)
by: Nuti, Felipe, et al.
Published: (2025)
Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilities
by: Hua, Wenyue, et al.
Published: (2024)
by: Hua, Wenyue, et al.
Published: (2024)
Beyond Single-Reward: Multi-Pair, Multi-Perspective Preference Optimization for Machine Translation
by: Wang, Hao, et al.
Published: (2025)
by: Wang, Hao, et al.
Published: (2025)
ConflictBench: Evaluating Human-AI Conflict via Interactive and Visually Grounded Environments
by: Zhao, Weixiang, et al.
Published: (2026)
by: Zhao, Weixiang, et al.
Published: (2026)
Generalization to Political Beliefs from Fine-Tuning on Sports Team Preferences
by: Terry, Owen
Published: (2026)
by: Terry, Owen
Published: (2026)
Retrieval-Augmented Fine-Tuning With Preference Optimization For Visual Program Generation
by: Kang, Deokhyung, et al.
Published: (2025)
by: Kang, Deokhyung, et al.
Published: (2025)
Preference-Oriented Supervised Fine-Tuning: Favoring Target Model Over Aligned Large Language Models
by: Fan, Yuchen, et al.
Published: (2024)
by: Fan, Yuchen, et al.
Published: (2024)
Can Language Models Reason about Individualistic Human Values and Preferences?
by: Jiang, Liwei, et al.
Published: (2024)
by: Jiang, Liwei, et al.
Published: (2024)
PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs
by: Zhang, Rongzhi, et al.
Published: (2024)
by: Zhang, Rongzhi, et al.
Published: (2024)
Parameter-Efficient Fine-Tuning of Large Language Models via Deconvolution in Subspace
by: Zhang, Jia-Chen, et al.
Published: (2025)
by: Zhang, Jia-Chen, et al.
Published: (2025)
LoRA$^2$ : Multi-Scale Low-Rank Approximations for Fine-Tuning Large Language Models
by: Zhang, Jia-Chen, et al.
Published: (2024)
by: Zhang, Jia-Chen, et al.
Published: (2024)
Dynamics of Instruction Fine-Tuning for Chinese Large Language Models
by: Song, Chiyu, et al.
Published: (2023)
by: Song, Chiyu, et al.
Published: (2023)
Similar Items
-
Cache Mechanism for Agent RAG Systems
by: Lin, Shuhang, et al.
Published: (2025) -
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting
by: Diao, Muxi, et al.
Published: (2026) -
OmniRouter: Budget and Performance Controllable Multi-LLM Routing
by: Mei, Kai, et al.
Published: (2025) -
LRHP: Learning Representations for Human Preferences via Preference Pairs
by: Wang, Chenglong, et al.
Published: (2024) -
ReaGAN: Node-as-Agent-Reasoning Graph Agentic Network
by: Guo, Minghao, et al.
Published: (2025)