Guardado en:
| Autores principales: | Zhang, Pengfei, Jia, Shouqing |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2504.19600 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Free Lunch Alignment of Text-to-Image Diffusion Models without Preference Image Pairs
por: Xian, Jia Jun Cheng, et al.
Publicado: (2025)
por: Xian, Jia Jun Cheng, et al.
Publicado: (2025)
An Ordinary Differential Equation Sampler with Stochastic Start for Diffusion Bridge Models
por: Wang, Yuang, et al.
Publicado: (2024)
por: Wang, Yuang, et al.
Publicado: (2024)
An Ensemble Model with Attention Based Mechanism for Image Captioning
por: Badarneh, Israa Al, et al.
Publicado: (2025)
por: Badarneh, Israa Al, et al.
Publicado: (2025)
Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement
por: Yang, Tao, et al.
Publicado: (2024)
por: Yang, Tao, et al.
Publicado: (2024)
Attention Mechanism based Cognition-level Scene Understanding
por: Tang, Xuejiao, et al.
Publicado: (2022)
por: Tang, Xuejiao, et al.
Publicado: (2022)
Towards Robust Unsupervised Attention Prediction in Autonomous Driving
por: Qi, Mengshi, et al.
Publicado: (2025)
por: Qi, Mengshi, et al.
Publicado: (2025)
Fast Sampling Through The Reuse Of Attention Maps In Diffusion Models
por: Hunter, Rosco, et al.
Publicado: (2023)
por: Hunter, Rosco, et al.
Publicado: (2023)
Re-Attentional Controllable Video Diffusion Editing
por: Wang, Yuanzhi, et al.
Publicado: (2024)
por: Wang, Yuanzhi, et al.
Publicado: (2024)
LiteAttention: A Temporal Sparse Attention for Diffusion Transformers
por: Shmilovich, Dor, et al.
Publicado: (2025)
por: Shmilovich, Dor, et al.
Publicado: (2025)
Local Intrinsic Dimension Unveils Hallucinations in Diffusion Models
por: Sobieski, Bartlomiej, et al.
Publicado: (2026)
por: Sobieski, Bartlomiej, et al.
Publicado: (2026)
MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention
por: Wang, Yuhan, et al.
Publicado: (2025)
por: Wang, Yuhan, et al.
Publicado: (2025)
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
por: Zhu, Lianghui, et al.
Publicado: (2024)
por: Zhu, Lianghui, et al.
Publicado: (2024)
MotionFlow: Attention-Driven Motion Transfer in Video Diffusion Models
por: Meral, Tuna Han Salih, et al.
Publicado: (2024)
por: Meral, Tuna Han Salih, et al.
Publicado: (2024)
From Structure to Detail: Hierarchical Distillation for Efficient Diffusion Model
por: Cheng, Hanbo, et al.
Publicado: (2025)
por: Cheng, Hanbo, et al.
Publicado: (2025)
Latent Diffusion Model without Variational Autoencoder
por: Shi, Minglei, et al.
Publicado: (2025)
por: Shi, Minglei, et al.
Publicado: (2025)
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation
por: Cheng, Hanbo, et al.
Publicado: (2024)
por: Cheng, Hanbo, et al.
Publicado: (2024)
DraftAttention: Fast Video Diffusion via Low-Resolution Attention Guidance
por: Shen, Xuan, et al.
Publicado: (2025)
por: Shen, Xuan, et al.
Publicado: (2025)
On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models
por: Ifriqi, Tariq Berrada, et al.
Publicado: (2024)
por: Ifriqi, Tariq Berrada, et al.
Publicado: (2024)
Attention in Diffusion Model: A Survey
por: Hua, Litao, et al.
Publicado: (2025)
por: Hua, Litao, et al.
Publicado: (2025)
AID: Attention Interpolation of Text-to-Image Diffusion
por: He, Qiyuan, et al.
Publicado: (2024)
por: He, Qiyuan, et al.
Publicado: (2024)
PrimeComposer: Faster Progressively Combined Diffusion for Image Composition with Attention Steering
por: Wang, Yibin, et al.
Publicado: (2024)
por: Wang, Yibin, et al.
Publicado: (2024)
USAD: End-to-End Human Activity Recognition via Diffusion Model with Spatiotemporal Attention
por: Xiao, Hang, et al.
Publicado: (2025)
por: Xiao, Hang, et al.
Publicado: (2025)
CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention
por: Lin, Gaojie, et al.
Publicado: (2024)
por: Lin, Gaojie, et al.
Publicado: (2024)
Fast Autoregressive Video Diffusion and World Models with Temporal Cache Compression and Sparse Attention
por: Samuel, Dvir, et al.
Publicado: (2026)
por: Samuel, Dvir, et al.
Publicado: (2026)
Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
por: Zou, Siyu, et al.
Publicado: (2024)
por: Zou, Siyu, et al.
Publicado: (2024)
Optimized Culprit Identification Using Mobilenet and Attention Mechanisms
por: J, Savitha N, et al.
Publicado: (2026)
por: J, Savitha N, et al.
Publicado: (2026)
VMonarch: Efficient Video Diffusion Transformers with Structured Attention
por: Liang, Cheng, et al.
Publicado: (2026)
por: Liang, Cheng, et al.
Publicado: (2026)
Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model
por: Park, Sangjoon, et al.
Publicado: (2024)
por: Park, Sangjoon, et al.
Publicado: (2024)
You Don't Need All That Attention: Surgical Memorization Mitigation in Text-to-Image Diffusion Models
por: Zhao, Kairan, et al.
Publicado: (2026)
por: Zhao, Kairan, et al.
Publicado: (2026)
Repairing Catastrophic-Neglect in Text-to-Image Diffusion Models via Attention-Guided Feature Enhancement
por: Chang, Zhiyuan, et al.
Publicado: (2024)
por: Chang, Zhiyuan, et al.
Publicado: (2024)
Adaptive Dual Residual U-Net with Attention Gate and Multiscale Spatial Attention Mechanisms (ADRUwAMS)
por: Suraki, Mohsen Yaghoubi
Publicado: (2026)
por: Suraki, Mohsen Yaghoubi
Publicado: (2026)
Model Already Knows the Best Noise: Bayesian Active Noise Selection via Attention in Video Diffusion Model
por: Kim, Kwanyoung, et al.
Publicado: (2025)
por: Kim, Kwanyoung, et al.
Publicado: (2025)
Dynamic Attention Mechanism in Spatiotemporal Memory Networks for Object Tracking
por: Zhou, Meng, et al.
Publicado: (2025)
por: Zhou, Meng, et al.
Publicado: (2025)
CreativeSynth: Cross-Art-Attention for Artistic Image Synthesis with Multimodal Diffusion
por: Huang, Nisha, et al.
Publicado: (2024)
por: Huang, Nisha, et al.
Publicado: (2024)
An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Models
por: Hu, Zizhao, et al.
Publicado: (2024)
por: Hu, Zizhao, et al.
Publicado: (2024)
MKSNet: Advanced Small Object Detection in Remote Sensing Imagery with Multi-Kernel and Dual Attention Mechanisms
por: Zhang, Jiahao, et al.
Publicado: (2025)
por: Zhang, Jiahao, et al.
Publicado: (2025)
Multi-Scale Generative Modeling with Heat Dissipation Flow Matching
por: Ma, Jun, et al.
Publicado: (2026)
por: Ma, Jun, et al.
Publicado: (2026)
Controllable Coupled Image Generation via Diffusion Models
por: Yuan, Chenfei, et al.
Publicado: (2025)
por: Yuan, Chenfei, et al.
Publicado: (2025)
Diffusion Attention Expert Model for Predicting and Semi-automatic Localizing STAS in Lung Cancer Histopathological Images
por: Pan, Liangrui, et al.
Publicado: (2026)
por: Pan, Liangrui, et al.
Publicado: (2026)
PersGuard: Preventing Malicious Personalization via Backdoor Attacks on Pre-trained Text-to-Image Diffusion Models
por: Liu, Xinwei, et al.
Publicado: (2025)
por: Liu, Xinwei, et al.
Publicado: (2025)
Ejemplares similares
-
Free Lunch Alignment of Text-to-Image Diffusion Models without Preference Image Pairs
por: Xian, Jia Jun Cheng, et al.
Publicado: (2025) -
An Ordinary Differential Equation Sampler with Stochastic Start for Diffusion Bridge Models
por: Wang, Yuang, et al.
Publicado: (2024) -
An Ensemble Model with Attention Based Mechanism for Image Captioning
por: Badarneh, Israa Al, et al.
Publicado: (2025) -
Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement
por: Yang, Tao, et al.
Publicado: (2024) -
Attention Mechanism based Cognition-level Scene Understanding
por: Tang, Xuejiao, et al.
Publicado: (2022)