Saved in:
| Main Authors: | Efimov, Timofey, Dong, Harry, Shah, Megna, Simmons, Jeff, Donegan, Sean, Chi, Yuejie |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.05143 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Multimodal Diffusion to Mutually Enhance Polarized Light and Low Resolution EBSD Data
by: Dong, Harry, et al.
Published: (2026)
by: Dong, Harry, et al.
Published: (2026)
Cross-Modal Guidance for Fast Diffusion-Based Computed Tomography
by: Efimov, Timofey, et al.
Published: (2026)
by: Efimov, Timofey, et al.
Published: (2026)
Provably Robust Score-Based Diffusion Posterior Sampling for Plug-and-Play Image Reconstruction
by: Xu, Xingyu, et al.
Published: (2024)
by: Xu, Xingyu, et al.
Published: (2024)
Distributed Image Compression with Multimodal Side Information at Extremely Low Bitrates
by: Xu, Guojun, et al.
Published: (2026)
by: Xu, Guojun, et al.
Published: (2026)
MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation
by: Shah, Ansh, et al.
Published: (2024)
by: Shah, Ansh, et al.
Published: (2024)
Eyes on the Streets: Leveraging Street-Level Imaging to Model Urban Crime Dynamics
by: Qi, Zhixuan, et al.
Published: (2024)
by: Qi, Zhixuan, et al.
Published: (2024)
Leveraging Diffusion Models for Stylization using Multiple Style Images
by: Ruta, Dan, et al.
Published: (2025)
by: Ruta, Dan, et al.
Published: (2025)
DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification
by: Shen, Sitian, et al.
Published: (2023)
by: Shen, Sitian, et al.
Published: (2023)
FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators
by: Wang, Haiping, et al.
Published: (2023)
by: Wang, Haiping, et al.
Published: (2023)
Leveraging Text-to-Image Diffusion Models for Unsupervised Visual Object Tracking
by: Zhang, Zhengbo, et al.
Published: (2026)
by: Zhang, Zhengbo, et al.
Published: (2026)
Adaptive Stochastic Coefficients for Accelerating Diffusion Sampling
by: Wang, Ruoyu, et al.
Published: (2025)
by: Wang, Ruoyu, et al.
Published: (2025)
Glance: Accelerating Diffusion Models with 1 Sample
by: Dong, Zhuobai, et al.
Published: (2025)
by: Dong, Zhuobai, et al.
Published: (2025)
Disciplined Diffusion: Text-to-Image Diffusion Model against NSFW Generation
by: Zhang, Chi, et al.
Published: (2026)
by: Zhang, Chi, et al.
Published: (2026)
Leveraging Diffusion Model and Image Foundation Model for Improved Correspondence Matching in Coronary Angiography
by: Zhao, Lin, et al.
Published: (2025)
by: Zhao, Lin, et al.
Published: (2025)
SPIRiT-Diffusion: Self-Consistency Driven Diffusion Model for Accelerated MRI
by: Cui, Zhuo-Xu, et al.
Published: (2023)
by: Cui, Zhuo-Xu, et al.
Published: (2023)
DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion
by: Shen, Liao, et al.
Published: (2024)
by: Shen, Liao, et al.
Published: (2024)
Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models
by: Meng, Chutian, et al.
Published: (2024)
by: Meng, Chutian, et al.
Published: (2024)
TBAC-UniImage: Unified Understanding and Generation by Ladder-Side Diffusion Tuning
by: Xu, Junzhe, et al.
Published: (2025)
by: Xu, Junzhe, et al.
Published: (2025)
E-MMDiT: Revisiting Multimodal Diffusion Transformer Design for Fast Image Synthesis under Limited Resources
by: Shen, Tong, et al.
Published: (2025)
by: Shen, Tong, et al.
Published: (2025)
A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models
by: Shuai, Xincheng, et al.
Published: (2024)
by: Shuai, Xincheng, et al.
Published: (2024)
Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing
by: Baldrati, Alberto, et al.
Published: (2024)
by: Baldrati, Alberto, et al.
Published: (2024)
How Many Images Does It Take? Estimating Imitation Thresholds in Text-to-Image Models
by: Verma, Sahil, et al.
Published: (2024)
by: Verma, Sahil, et al.
Published: (2024)
Leveraging Large Language Models for Multimodal Search
by: Barbany, Oriol, et al.
Published: (2024)
by: Barbany, Oriol, et al.
Published: (2024)
Cancer-Net PCa-MultiSeg: Multimodal Enhancement of Prostate Cancer Lesion Segmentation Using Synthetic Correlated Diffusion Imaging
by: Dewbury, Jarett, et al.
Published: (2025)
by: Dewbury, Jarett, et al.
Published: (2025)
ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models
by: Srivastava, Ashutosh, et al.
Published: (2024)
by: Srivastava, Ashutosh, et al.
Published: (2024)
Leveraging the Powerful Attention of a Pre-trained Diffusion Model for Exemplar-based Image Colorization
by: Kosugi, Satoshi
Published: (2025)
by: Kosugi, Satoshi
Published: (2025)
Tag2Text: Guiding Vision-Language Model via Image Tagging
by: Huang, Xinyu, et al.
Published: (2023)
by: Huang, Xinyu, et al.
Published: (2023)
Inference-Time Search Using Side Information for Diffusion-Based Image Reconstruction
by: Farahbakhsh, Mahdi, et al.
Published: (2025)
by: Farahbakhsh, Mahdi, et al.
Published: (2025)
Leveraging Pre-Trained Visual Models for AI-Generated Video Detection
by: Veeramachaneni, Keerthi, et al.
Published: (2025)
by: Veeramachaneni, Keerthi, et al.
Published: (2025)
Enhancing Image Aesthetics with Dual-Conditioned Diffusion Models Guided by Multimodal Perception
by: Nan, Xinyu, et al.
Published: (2026)
by: Nan, Xinyu, et al.
Published: (2026)
H2OFlow: Grounding Human-Object Affordances with 3D Generative Models and Dense Diffused Flows
by: Zhang, Harry, et al.
Published: (2025)
by: Zhang, Harry, et al.
Published: (2025)
Paragraph-to-Image Generation with Information-Enriched Diffusion Model
by: Wu, Weijia, et al.
Published: (2023)
by: Wu, Weijia, et al.
Published: (2023)
Leveraging Multimodal Large Language Models for All-in-One Image Restoration via a Mixture of Frequency Experts
by: Lee, Eunho, et al.
Published: (2026)
by: Lee, Eunho, et al.
Published: (2026)
Leveraging Generic Foundation Models for Multimodal Surgical Data Analysis
by: Pezold, Simon, et al.
Published: (2025)
by: Pezold, Simon, et al.
Published: (2025)
Leveraging Foundation Models for Multimodal Graph-Based Action Recognition
by: Ziaeetabar, Fatemeh, et al.
Published: (2025)
by: Ziaeetabar, Fatemeh, et al.
Published: (2025)
Leveraging Prior Knowledge of Diffusion Model for Person Search
by: Kim, Giyeol, et al.
Published: (2025)
by: Kim, Giyeol, et al.
Published: (2025)
Semantic Image Synthesis via Diffusion Models
by: Zhou, Wengang, et al.
Published: (2022)
by: Zhou, Wengang, et al.
Published: (2022)
DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models
by: Wang, Zhendong, et al.
Published: (2025)
by: Wang, Zhendong, et al.
Published: (2025)
Side Effects of Erasing Concepts from Diffusion Models
by: Saha, Shaswati, et al.
Published: (2025)
by: Saha, Shaswati, et al.
Published: (2025)
Image-to-Brain Signal Generation for Visual Prosthesis with CLIP Guided Multimodal Diffusion Models
by: Xu, Ganxi, et al.
Published: (2025)
by: Xu, Ganxi, et al.
Published: (2025)
Similar Items
-
Multimodal Diffusion to Mutually Enhance Polarized Light and Low Resolution EBSD Data
by: Dong, Harry, et al.
Published: (2026) -
Cross-Modal Guidance for Fast Diffusion-Based Computed Tomography
by: Efimov, Timofey, et al.
Published: (2026) -
Provably Robust Score-Based Diffusion Posterior Sampling for Plug-and-Play Image Reconstruction
by: Xu, Xingyu, et al.
Published: (2024) -
Distributed Image Compression with Multimodal Side Information at Extremely Low Bitrates
by: Xu, Guojun, et al.
Published: (2026) -
MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation
by: Shah, Ansh, et al.
Published: (2024)