:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Efimov, Timofey, Dong, Harry, Shah, Megna, Simmons, Jeff, Donegan, Sean, Chi, Yuejie
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2410.05143
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Multimodal Diffusion to Mutually Enhance Polarized Light and Low Resolution EBSD Data
by: Dong, Harry, et al.
Published: (2026)

Cross-Modal Guidance for Fast Diffusion-Based Computed Tomography
by: Efimov, Timofey, et al.
Published: (2026)

Provably Robust Score-Based Diffusion Posterior Sampling for Plug-and-Play Image Reconstruction
by: Xu, Xingyu, et al.
Published: (2024)

Distributed Image Compression with Multimodal Side Information at Extremely Low Bitrates
by: Xu, Guojun, et al.
Published: (2026)

MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation
by: Shah, Ansh, et al.
Published: (2024)

Eyes on the Streets: Leveraging Street-Level Imaging to Model Urban Crime Dynamics
by: Qi, Zhixuan, et al.
Published: (2024)

Leveraging Diffusion Models for Stylization using Multiple Style Images
by: Ruta, Dan, et al.
Published: (2025)

DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification
by: Shen, Sitian, et al.
Published: (2023)

FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators
by: Wang, Haiping, et al.
Published: (2023)

Leveraging Text-to-Image Diffusion Models for Unsupervised Visual Object Tracking
by: Zhang, Zhengbo, et al.
Published: (2026)

Adaptive Stochastic Coefficients for Accelerating Diffusion Sampling
by: Wang, Ruoyu, et al.
Published: (2025)

Glance: Accelerating Diffusion Models with 1 Sample
by: Dong, Zhuobai, et al.
Published: (2025)

Disciplined Diffusion: Text-to-Image Diffusion Model against NSFW Generation
by: Zhang, Chi, et al.
Published: (2026)

Leveraging Diffusion Model and Image Foundation Model for Improved Correspondence Matching in Coronary Angiography
by: Zhao, Lin, et al.
Published: (2025)

SPIRiT-Diffusion: Self-Consistency Driven Diffusion Model for Accelerated MRI
by: Cui, Zhuo-Xu, et al.
Published: (2023)

DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion
by: Shen, Liao, et al.
Published: (2024)

Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models
by: Meng, Chutian, et al.
Published: (2024)

TBAC-UniImage: Unified Understanding and Generation by Ladder-Side Diffusion Tuning
by: Xu, Junzhe, et al.
Published: (2025)

E-MMDiT: Revisiting Multimodal Diffusion Transformer Design for Fast Image Synthesis under Limited Resources
by: Shen, Tong, et al.
Published: (2025)

A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models
by: Shuai, Xincheng, et al.
Published: (2024)

Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing
by: Baldrati, Alberto, et al.
Published: (2024)

How Many Images Does It Take? Estimating Imitation Thresholds in Text-to-Image Models
by: Verma, Sahil, et al.
Published: (2024)

Leveraging Large Language Models for Multimodal Search
by: Barbany, Oriol, et al.
Published: (2024)

Cancer-Net PCa-MultiSeg: Multimodal Enhancement of Prostate Cancer Lesion Segmentation Using Synthetic Correlated Diffusion Imaging
by: Dewbury, Jarett, et al.
Published: (2025)

ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models
by: Srivastava, Ashutosh, et al.
Published: (2024)

Leveraging the Powerful Attention of a Pre-trained Diffusion Model for Exemplar-based Image Colorization
by: Kosugi, Satoshi
Published: (2025)

Tag2Text: Guiding Vision-Language Model via Image Tagging
by: Huang, Xinyu, et al.
Published: (2023)

Inference-Time Search Using Side Information for Diffusion-Based Image Reconstruction
by: Farahbakhsh, Mahdi, et al.
Published: (2025)

Leveraging Pre-Trained Visual Models for AI-Generated Video Detection
by: Veeramachaneni, Keerthi, et al.
Published: (2025)

Enhancing Image Aesthetics with Dual-Conditioned Diffusion Models Guided by Multimodal Perception
by: Nan, Xinyu, et al.
Published: (2026)

H2OFlow: Grounding Human-Object Affordances with 3D Generative Models and Dense Diffused Flows
by: Zhang, Harry, et al.
Published: (2025)

Paragraph-to-Image Generation with Information-Enriched Diffusion Model
by: Wu, Weijia, et al.
Published: (2023)

Leveraging Multimodal Large Language Models for All-in-One Image Restoration via a Mixture of Frequency Experts
by: Lee, Eunho, et al.
Published: (2026)

Leveraging Generic Foundation Models for Multimodal Surgical Data Analysis
by: Pezold, Simon, et al.
Published: (2025)

Leveraging Foundation Models for Multimodal Graph-Based Action Recognition
by: Ziaeetabar, Fatemeh, et al.
Published: (2025)

Leveraging Prior Knowledge of Diffusion Model for Person Search
by: Kim, Giyeol, et al.
Published: (2025)

Semantic Image Synthesis via Diffusion Models
by: Zhou, Wengang, et al.
Published: (2022)

DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models
by: Wang, Zhendong, et al.
Published: (2025)

Side Effects of Erasing Concepts from Diffusion Models
by: Saha, Shaswati, et al.
Published: (2025)

Image-to-Brain Signal Generation for Visual Prosthesis with CLIP Guided Multimodal Diffusion Models
by: Xu, Ganxi, et al.
Published: (2025)