Saved in:
| Main Authors: | Chen, Yan-Ting, Chen, Hao-Wei, Hsiao, Tsu-Ching, Lee, Chun-Yi |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.19865 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Confronting Ambiguity in 6D Object Pose Estimation via Score-Based Diffusion on SE(3)
by: Hsiao, Tsu-Ching, et al.
Published: (2023)
by: Hsiao, Tsu-Ching, et al.
Published: (2023)
Precise Pick-and-Place using Score-Based Diffusion Networks
by: Guo, Shih-Wei, et al.
Published: (2024)
by: Guo, Shih-Wei, et al.
Published: (2024)
MENTOR: Multilingual tExt detectioN TOward leaRning by analogy
by: Lin, Hsin-Ju, et al.
Published: (2024)
by: Lin, Hsin-Ju, et al.
Published: (2024)
Warm Diffusion: Recipe for Blur-Noise Mixture Diffusion Models
by: Hsueh, Hao-Chien, et al.
Published: (2025)
by: Hsueh, Hao-Chien, et al.
Published: (2025)
Virtual Guidance as a Mid-level Representation for Navigation with Augmented Reality
by: Yang, Hsuan-Kung, et al.
Published: (2023)
by: Yang, Hsuan-Kung, et al.
Published: (2023)
Boosting Diffusion Guidance via Learning Degradation-Aware Models for Blind Super Resolution
by: Lu, Shao-Hao, et al.
Published: (2025)
by: Lu, Shao-Hao, et al.
Published: (2025)
Zero-shot Adaptation of Stable Diffusion via Plug-in Hierarchical Degradation Representation for Real-World Super-Resolution
by: Liao, Yi-Cheng, et al.
Published: (2025)
by: Liao, Yi-Cheng, et al.
Published: (2025)
DynFaceRestore: Balancing Fidelity and Quality in Diffusion-Guided Blind Face Restoration with Dynamic Blur-Level Mapping and Guidance
by: Do, Huu-Phu, et al.
Published: (2025)
by: Do, Huu-Phu, et al.
Published: (2025)
DreamJourney: Perpetual View Generation with Video Diffusion Models
by: Pan, Bo, et al.
Published: (2025)
by: Pan, Bo, et al.
Published: (2025)
Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts
by: Chin, Zhi-Yi, et al.
Published: (2023)
by: Chin, Zhi-Yi, et al.
Published: (2023)
Arbitrary-Resolution and Arbitrary-Scale Face Super-Resolution with Implicit Representation Networks
by: Tsai, Yi Ting, et al.
Published: (2025)
by: Tsai, Yi Ting, et al.
Published: (2025)
Task-Oriented Human Grasp Synthesis via Context- and Task-Aware Diffusers
by: Liu, An-Lun, et al.
Published: (2025)
by: Liu, An-Lun, et al.
Published: (2025)
A Geometric Perspective on Diffusion Models
by: Chen, Defang, et al.
Published: (2023)
by: Chen, Defang, et al.
Published: (2023)
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
by: Fu, Tsu-Jui, et al.
Published: (2025)
by: Fu, Tsu-Jui, et al.
Published: (2025)
Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification
by: Li, Shuhan, et al.
Published: (2024)
by: Li, Shuhan, et al.
Published: (2024)
ExReg: Wide-range Photo Exposure Correction via a Multi-dimensional Regressor with Attention
by: Do, Huu-Phu, et al.
Published: (2022)
by: Do, Huu-Phu, et al.
Published: (2022)
Plug-and-Play Diffusion Distillation
by: Hsiao, Yi-Ting, et al.
Published: (2024)
by: Hsiao, Yi-Ting, et al.
Published: (2024)
Aerial View River Landform Video segmentation: A Weakly Supervised Context-aware Temporal Consistency Distillation Approach
by: Chen, Chi-Han, et al.
Published: (2025)
by: Chen, Chi-Han, et al.
Published: (2025)
Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion
by: Guizilini, Vitor, et al.
Published: (2025)
by: Guizilini, Vitor, et al.
Published: (2025)
Human-Free Automated Prompting for Vision-Language Anomaly Detection: Prompt Optimization with Meta-guiding Prompt Scheme
by: Chen, Pi-Wei, et al.
Published: (2024)
by: Chen, Pi-Wei, et al.
Published: (2024)
AutoSketch: VLM-assisted Style-Aware Vector Sketch Completion
by: Chin, Hsiao-Yuan, et al.
Published: (2025)
by: Chin, Hsiao-Yuan, et al.
Published: (2025)
RC-AutoCalib: An End-to-End Radar-Camera Automatic Calibration Network
by: Luu, Van-Tin, et al.
Published: (2025)
by: Luu, Van-Tin, et al.
Published: (2025)
HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior
by: Tsao, Li-Yuan, et al.
Published: (2024)
by: Tsao, Li-Yuan, et al.
Published: (2024)
Beyond 3D VQAs: Injecting 3D Spatial Priors into Vision-Language Models for Enhanced Geometric Reasoning
by: Yeh, Chun-Hsiao, et al.
Published: (2026)
by: Yeh, Chun-Hsiao, et al.
Published: (2026)
TARA: Token-Aware LoRA for Composable Personalization in Diffusion Models
by: Peng, Yuqi, et al.
Published: (2025)
by: Peng, Yuqi, et al.
Published: (2025)
Sortblock: Similarity-Aware Feature Reuse for Diffusion Model
by: Chen, Hanqi, et al.
Published: (2025)
by: Chen, Hanqi, et al.
Published: (2025)
Debiasing Diffusion Model: Enhancing Fairness through Latent Representation Learning in Stable Diffusion Model
by: Huang, Lin-Chun, et al.
Published: (2025)
by: Huang, Lin-Chun, et al.
Published: (2025)
Scale-Aware UAV-to-Satellite Cross-View Geo-Localization: A Semantic Geometric Approach
by: Ye, Yibin, et al.
Published: (2026)
by: Ye, Yibin, et al.
Published: (2026)
DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation
by: Chen, Chen, et al.
Published: (2025)
by: Chen, Chen, et al.
Published: (2025)
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs
by: Yeh, Chun-Hsiao, et al.
Published: (2025)
by: Yeh, Chun-Hsiao, et al.
Published: (2025)
BLEnD-Vis: Benchmarking Multimodal Cultural Understanding in Vision Language Models
by: Tan, Bryan Chen Zhengyu, et al.
Published: (2025)
by: Tan, Bryan Chen Zhengyu, et al.
Published: (2025)
DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
by: Yeh, Chang-Han, et al.
Published: (2024)
by: Yeh, Chang-Han, et al.
Published: (2024)
Parallel Sampling of Diffusion Models on $SO(3)$
by: Chen, Yan-Ting, et al.
Published: (2025)
by: Chen, Yan-Ting, et al.
Published: (2025)
Reliev3R: Relieving Feed-forward Reconstruction from Multi-View Geometric Annotations
by: Chen, Youyu, et al.
Published: (2026)
by: Chen, Youyu, et al.
Published: (2026)
Top2Ground: A Height-Aware Dual Conditioning Diffusion Model for Robust Aerial-to-Ground View Generation
by: Lee, Jae Joong, et al.
Published: (2025)
by: Lee, Jae Joong, et al.
Published: (2025)
EAMamba: Efficient All-Around Vision State Space Model for Image Restoration
by: Lin, Yu-Cheng, et al.
Published: (2025)
by: Lin, Yu-Cheng, et al.
Published: (2025)
Lightweight and Accurate Multi-View Stereo with Confidence-Aware Diffusion Model
by: Wang, Fangjinhua, et al.
Published: (2025)
by: Wang, Fangjinhua, et al.
Published: (2025)
DiSR-NeRF: Diffusion-Guided View-Consistent Super-Resolution NeRF
by: Lee, Jie Long, et al.
Published: (2024)
by: Lee, Jie Long, et al.
Published: (2024)
Taming Outlier Tokens in Diffusion Transformers
by: Wu, Xiaoyu, et al.
Published: (2026)
by: Wu, Xiaoyu, et al.
Published: (2026)
ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models
by: Shih, Meng-Li, et al.
Published: (2024)
by: Shih, Meng-Li, et al.
Published: (2024)
Similar Items
-
Confronting Ambiguity in 6D Object Pose Estimation via Score-Based Diffusion on SE(3)
by: Hsiao, Tsu-Ching, et al.
Published: (2023) -
Precise Pick-and-Place using Score-Based Diffusion Networks
by: Guo, Shih-Wei, et al.
Published: (2024) -
MENTOR: Multilingual tExt detectioN TOward leaRning by analogy
by: Lin, Hsin-Ju, et al.
Published: (2024) -
Warm Diffusion: Recipe for Blur-Noise Mixture Diffusion Models
by: Hsueh, Hao-Chien, et al.
Published: (2025) -
Virtual Guidance as a Mid-level Representation for Navigation with Augmented Reality
by: Yang, Hsuan-Kung, et al.
Published: (2023)