:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chen, Yan-Ting, Chen, Hao-Wei, Hsiao, Tsu-Ching, Lee, Chun-Yi
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2605.19865
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Confronting Ambiguity in 6D Object Pose Estimation via Score-Based Diffusion on SE(3)
by: Hsiao, Tsu-Ching, et al.
Published: (2023)

Precise Pick-and-Place using Score-Based Diffusion Networks
by: Guo, Shih-Wei, et al.
Published: (2024)

MENTOR: Multilingual tExt detectioN TOward leaRning by analogy
by: Lin, Hsin-Ju, et al.
Published: (2024)

Warm Diffusion: Recipe for Blur-Noise Mixture Diffusion Models
by: Hsueh, Hao-Chien, et al.
Published: (2025)

Virtual Guidance as a Mid-level Representation for Navigation with Augmented Reality
by: Yang, Hsuan-Kung, et al.
Published: (2023)

Boosting Diffusion Guidance via Learning Degradation-Aware Models for Blind Super Resolution
by: Lu, Shao-Hao, et al.
Published: (2025)

Zero-shot Adaptation of Stable Diffusion via Plug-in Hierarchical Degradation Representation for Real-World Super-Resolution
by: Liao, Yi-Cheng, et al.
Published: (2025)

DynFaceRestore: Balancing Fidelity and Quality in Diffusion-Guided Blind Face Restoration with Dynamic Blur-Level Mapping and Guidance
by: Do, Huu-Phu, et al.
Published: (2025)

DreamJourney: Perpetual View Generation with Video Diffusion Models
by: Pan, Bo, et al.
Published: (2025)

Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts
by: Chin, Zhi-Yi, et al.
Published: (2023)

Arbitrary-Resolution and Arbitrary-Scale Face Super-Resolution with Implicit Representation Networks
by: Tsai, Yi Ting, et al.
Published: (2025)

Task-Oriented Human Grasp Synthesis via Context- and Task-Aware Diffusers
by: Liu, An-Lun, et al.
Published: (2025)

A Geometric Perspective on Diffusion Models
by: Chen, Defang, et al.
Published: (2023)

UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
by: Fu, Tsu-Jui, et al.
Published: (2025)

Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification
by: Li, Shuhan, et al.
Published: (2024)

ExReg: Wide-range Photo Exposure Correction via a Multi-dimensional Regressor with Attention
by: Do, Huu-Phu, et al.
Published: (2022)

Plug-and-Play Diffusion Distillation
by: Hsiao, Yi-Ting, et al.
Published: (2024)

Aerial View River Landform Video segmentation: A Weakly Supervised Context-aware Temporal Consistency Distillation Approach
by: Chen, Chi-Han, et al.
Published: (2025)

Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion
by: Guizilini, Vitor, et al.
Published: (2025)

Human-Free Automated Prompting for Vision-Language Anomaly Detection: Prompt Optimization with Meta-guiding Prompt Scheme
by: Chen, Pi-Wei, et al.
Published: (2024)

AutoSketch: VLM-assisted Style-Aware Vector Sketch Completion
by: Chin, Hsiao-Yuan, et al.
Published: (2025)

RC-AutoCalib: An End-to-End Radar-Camera Automatic Calibration Network
by: Luu, Van-Tin, et al.
Published: (2025)

HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior
by: Tsao, Li-Yuan, et al.
Published: (2024)

Beyond 3D VQAs: Injecting 3D Spatial Priors into Vision-Language Models for Enhanced Geometric Reasoning
by: Yeh, Chun-Hsiao, et al.
Published: (2026)

TARA: Token-Aware LoRA for Composable Personalization in Diffusion Models
by: Peng, Yuqi, et al.
Published: (2025)

Sortblock: Similarity-Aware Feature Reuse for Diffusion Model
by: Chen, Hanqi, et al.
Published: (2025)

Debiasing Diffusion Model: Enhancing Fairness through Latent Representation Learning in Stable Diffusion Model
by: Huang, Lin-Chun, et al.
Published: (2025)

Scale-Aware UAV-to-Satellite Cross-View Geo-Localization: A Semantic Geometric Approach
by: Ye, Yibin, et al.
Published: (2026)

DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation
by: Chen, Chen, et al.
Published: (2025)

Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs
by: Yeh, Chun-Hsiao, et al.
Published: (2025)

BLEnD-Vis: Benchmarking Multimodal Cultural Understanding in Vision Language Models
by: Tan, Bryan Chen Zhengyu, et al.
Published: (2025)

DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
by: Yeh, Chang-Han, et al.
Published: (2024)

Parallel Sampling of Diffusion Models on $SO(3)$
by: Chen, Yan-Ting, et al.
Published: (2025)

Reliev3R: Relieving Feed-forward Reconstruction from Multi-View Geometric Annotations
by: Chen, Youyu, et al.
Published: (2026)

Top2Ground: A Height-Aware Dual Conditioning Diffusion Model for Robust Aerial-to-Ground View Generation
by: Lee, Jae Joong, et al.
Published: (2025)

EAMamba: Efficient All-Around Vision State Space Model for Image Restoration
by: Lin, Yu-Cheng, et al.
Published: (2025)

Lightweight and Accurate Multi-View Stereo with Confidence-Aware Diffusion Model
by: Wang, Fangjinhua, et al.
Published: (2025)

DiSR-NeRF: Diffusion-Guided View-Consistent Super-Resolution NeRF
by: Lee, Jie Long, et al.
Published: (2024)

Taming Outlier Tokens in Diffusion Transformers
by: Wu, Xiaoyu, et al.
Published: (2026)

ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models
by: Shih, Meng-Li, et al.
Published: (2024)