Fu, L., & Xu, Y. (2026). TPMM-DPO: Trajectory-aware Preference-guided Model Merging for Iterative Direct Preference Optimization.
Chicago Style (17th ed.) CitationFu, Lingling, and Yongfu Xu. TPMM-DPO: Trajectory-aware Preference-guided Model Merging for Iterative Direct Preference Optimization. 2026.
MLA (9th ed.) CitationFu, Lingling, and Yongfu Xu. TPMM-DPO: Trajectory-aware Preference-guided Model Merging for Iterative Direct Preference Optimization. 2026.
Warning: These citations may not always be 100% accurate.