Saved in:
| Main Authors: | Pham, Khiem, Nguyen, Quang, Nguyen, Tung, Zhu, Jingsen, Santacatterina, Michele, Metaxas, Dimitris, Zabih, Ramin |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.06195 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection
by: Behrouz, Ali, et al.
Published: (2024)
by: Behrouz, Ali, et al.
Published: (2024)
SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step Diffusion
by: Nguyen, Trong-Tung, et al.
Published: (2024)
by: Nguyen, Trong-Tung, et al.
Published: (2024)
MPDiT: Multi-Patch Global-to-Local Transformer Architecture For Efficient Flow Matching and Diffusion Model
by: Dao, Quan, et al.
Published: (2026)
by: Dao, Quan, et al.
Published: (2026)
Score-Guided Diffusion for 3D Human Recovery
by: Stathopoulos, Anastasis, et al.
Published: (2024)
by: Stathopoulos, Anastasis, et al.
Published: (2024)
FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image Editing
by: Nguyen, Trong-Tung, et al.
Published: (2024)
by: Nguyen, Trong-Tung, et al.
Published: (2024)
AVC-DPO: Aligned Video Captioning via Direct Preference Optimization
by: Tang, Jiyang, et al.
Published: (2025)
by: Tang, Jiyang, et al.
Published: (2025)
DreamWalk: Style Space Exploration using Diffusion Guidance
by: Shu, Michelle, et al.
Published: (2024)
by: Shu, Michelle, et al.
Published: (2024)
DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models
by: Wu, Ziyi, et al.
Published: (2025)
by: Wu, Ziyi, et al.
Published: (2025)
Chimera: Effectively Modeling Multivariate Time Series with 2-Dimensional State Space Models
by: Behrouz, Ali, et al.
Published: (2024)
by: Behrouz, Ali, et al.
Published: (2024)
SEE-DPO: Self Entropy Enhanced Direct Preference Optimization
by: Shekhar, Shivanshu, et al.
Published: (2024)
by: Shekhar, Shivanshu, et al.
Published: (2024)
DreamBoothDPO: Improving Personalized Generation using Direct Preference Optimization
by: Ayupov, Shamil, et al.
Published: (2025)
by: Ayupov, Shamil, et al.
Published: (2025)
SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation
by: Pham, Duc-Hai, et al.
Published: (2024)
by: Pham, Duc-Hai, et al.
Published: (2024)
PainDiffusion: Learning to Express Pain
by: Dam, Quang Tien, et al.
Published: (2024)
by: Dam, Quang Tien, et al.
Published: (2024)
SwiftTry: Fast and Consistent Video Virtual Try-On with Diffusion Models
by: Nguyen, Hung, et al.
Published: (2024)
by: Nguyen, Hung, et al.
Published: (2024)
SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization
by: Jia, Hongrui, et al.
Published: (2024)
by: Jia, Hongrui, et al.
Published: (2024)
InverFill: One-Step Inversion for Enhanced Few-Step Diffusion Inpainting
by: Vu, Duc, et al.
Published: (2026)
by: Vu, Duc, et al.
Published: (2026)
SwiftPie: Lightning-fast Subject-driven Image Personalization via One step Diffusion
by: Duong, Huy, et al.
Published: (2026)
by: Duong, Huy, et al.
Published: (2026)
On Inference Stability for Diffusion Models
by: Nguyen, Viet, et al.
Published: (2023)
by: Nguyen, Viet, et al.
Published: (2023)
VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models
by: Huang, Haojian, et al.
Published: (2025)
by: Huang, Haojian, et al.
Published: (2025)
$ϕ$-DPO: Fairness Direct Preference Optimization Approach to Continual Learning in Large Multimodal Models
by: Truong, Thanh-Dat, et al.
Published: (2026)
by: Truong, Thanh-Dat, et al.
Published: (2026)
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models
by: Liu, Ziyu, et al.
Published: (2024)
by: Liu, Ziyu, et al.
Published: (2024)
UIT-DarkCow team at ImageCLEFmedical Caption 2024: Diagnostic Captioning for Radiology Images Efficiency with Transformer Models
by: Van Nguyen, Quan, et al.
Published: (2024)
by: Van Nguyen, Quan, et al.
Published: (2024)
Stable Messenger: Steganography for Message-Concealed Image Generation
by: Nguyen, Quang, et al.
Published: (2023)
by: Nguyen, Quang, et al.
Published: (2023)
Linear-DPO: Linear Direct Preference Optimization for Diffusion and Flow-Matching Generative Models
by: Li, Kesong, et al.
Published: (2026)
by: Li, Kesong, et al.
Published: (2026)
EditScout: Locating Forged Regions from Diffusion-based Edited Images with Multimodal LLM
by: Nguyen, Quang, et al.
Published: (2024)
by: Nguyen, Quang, et al.
Published: (2024)
PrefGen: Multimodal Preference Learning for Preference-Conditioned Image Generation
by: Mo, Wenyi, et al.
Published: (2025)
by: Mo, Wenyi, et al.
Published: (2025)
Rethinking Direct Preference Optimization in Diffusion Models
by: Kang, Junyong, et al.
Published: (2025)
by: Kang, Junyong, et al.
Published: (2025)
HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment
by: Jiang, Lifan, et al.
Published: (2025)
by: Jiang, Lifan, et al.
Published: (2025)
SINE: SINgle Image Editing with Text-to-Image Diffusion Models
by: Zhang, Zhixing, et al.
Published: (2022)
by: Zhang, Zhixing, et al.
Published: (2022)
S2H-DPO: Hardness-Aware Preference Optimization for Vision-Language Models
by: Shukla, Nitish, et al.
Published: (2026)
by: Shukla, Nitish, et al.
Published: (2026)
Diffusion-SDPO: Safeguarded Direct Preference Optimization for Diffusion Models
by: Fu, Minghao, et al.
Published: (2025)
by: Fu, Minghao, et al.
Published: (2025)
ViOCRVQA: Novel Benchmark Dataset and Vision Reader for Visual Question Answering by Understanding Vietnamese Text in Images
by: Pham, Huy Quang, et al.
Published: (2024)
by: Pham, Huy Quang, et al.
Published: (2024)
Curriculum-DPO++: Direct Preference Optimization via Data and Model Curricula for Text-to-Image Generation
by: Croitoru, Florinel-Alin, et al.
Published: (2026)
by: Croitoru, Florinel-Alin, et al.
Published: (2026)
CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models
by: Nguyen, Quang-Binh, et al.
Published: (2025)
by: Nguyen, Quang-Binh, et al.
Published: (2025)
Reg-DPO: SFT-Regularized Direct Preference Optimization with GT-Pair for Improving Video Generation
by: Du, Jie, et al.
Published: (2025)
by: Du, Jie, et al.
Published: (2025)
ConPro: Learning Severity Representation for Medical Images using Contrastive Learning and Preference Optimization
by: Nguyen, Hong, et al.
Published: (2024)
by: Nguyen, Hong, et al.
Published: (2024)
Toward Fine-Grained Speech Inpainting Forensics:A Dataset, Method, and Metric for Multi-Region Tampering Localization
by: Vu, Tung, et al.
Published: (2026)
by: Vu, Tung, et al.
Published: (2026)
AutoEdit: Automatic Hyperparameter Tuning for Image Editing
by: Pham, Chau, et al.
Published: (2025)
by: Pham, Chau, et al.
Published: (2025)
V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference Optimization
by: Xie, Yuxi, et al.
Published: (2024)
by: Xie, Yuxi, et al.
Published: (2024)
UMAMI: Unifying Masked Autoregressive Models and Deterministic Rendering for View Synthesis
by: Le, Thanh-Tung, et al.
Published: (2025)
by: Le, Thanh-Tung, et al.
Published: (2025)
Similar Items
-
MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection
by: Behrouz, Ali, et al.
Published: (2024) -
SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step Diffusion
by: Nguyen, Trong-Tung, et al.
Published: (2024) -
MPDiT: Multi-Patch Global-to-Local Transformer Architecture For Efficient Flow Matching and Diffusion Model
by: Dao, Quan, et al.
Published: (2026) -
Score-Guided Diffusion for 3D Human Recovery
by: Stathopoulos, Anastasis, et al.
Published: (2024) -
FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image Editing
by: Nguyen, Trong-Tung, et al.
Published: (2024)