:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Dong, ZiYi, Zhou, Chengxing, Deng, Weijian, Wei, Pengxu, Ji, Xiangyang, Lin, Liang
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2504.21292
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Delving into Cascaded Instability: A Lipschitz Continuity View on Image Restoration and Object Detection Synergy
by: Zhao, Qing, et al.
Published: (2025)

Language Generation as Optimal Control: Closed-Loop Diffusion in Latent Control Space
by: Dong, ZiYi, et al.
Published: (2026)

When Preference Labels Fall Short: Aligning Diffusion Models from Real Data
by: Chen, Weiyan, et al.
Published: (2026)

Unveiling Perceptual Artifacts: A Fine-Grained Benchmark for Interpretable AI-Generated Image Detection
by: Xiao, Yao, et al.
Published: (2026)

Slot Attention with Re-Initialization and Self-Distillation
by: Zhao, Rongzhen, et al.
Published: (2025)

Attentive Convolution: Unifying the Expressivity of Self-Attention with Convolutional Efficiency
by: Yu, Hao, et al.
Published: (2025)

Decoder-Only LLMs are Better Controllers for Diffusion Models
by: Dong, Ziyi, et al.
Published: (2025)

ELA: Efficient Local Attention for Deep Convolutional Neural Networks
by: Xu, Wei, et al.
Published: (2024)

DreamArtist++: Controllable One-Shot Text-to-Image Generation via Positive-Negative Adapter
by: Dong, Ziyi, et al.
Published: (2022)

CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications
by: Zhang, Tianfang, et al.
Published: (2024)

HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval
by: He, Chao, et al.
Published: (2024)

Multi-Field De-interlacing using Deformable Convolution Residual Blocks and Self-Attention
by: Ji, Ronglei, et al.
Published: (2022)

SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer
by: Fang, Tongcheng, et al.
Published: (2026)

MAT: Multi-Range Attention Transformer for Efficient Image Super-Resolution
by: Xie, Chengxing, et al.
Published: (2024)

Three-Stream Temporal-Shift Attention Network Based on Self-Knowledge Distillation for Micro-Expression Recognition
by: Zhu, Guanghao, et al.
Published: (2024)

Partial Convolution Meets Visual Attention
by: Huang, Haiduo, et al.
Published: (2025)

Efficient Face Image Quality Assessment via Self-training and Knowledge Distillation
by: Sun, Wei, et al.
Published: (2025)

Efficient Single Image Super-Resolution with Entropy Attention and Receptive Field Augmentation
by: Zhao, Xiaole, et al.
Published: (2024)

CSAKD: Knowledge Distillation with Cross Self-Attention for Hyperspectral and Multispectral Image Fusion
by: Hsu, Chih-Chung, et al.
Published: (2024)

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
by: Zhou, Yupeng, et al.
Published: (2024)

Optimizing Knowledge Distillation in Transformers: Enabling Multi-Head Attention without Alignment Barriers
by: Bing, Zhaodong, et al.
Published: (2025)

Progressively Normalized Self-Attention Network for Video Polyp Segmentation
by: Ji, Ge-Peng, et al.
Published: (2021)

Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention
by: Truong, Quang-Trung, et al.
Published: (2024)

Self-Attention Decomposition For Training Free Diffusion Editing
by: Anand, Tharun, et al.
Published: (2025)

Towards Understanding the Robustness of Diffusion-Based Purification: A Stochastic Perspective
by: Liu, Yiming, et al.
Published: (2024)

Veda: Scalable Video Diffusion via Distilled Sparse Attention
by: Han, Shihao, et al.
Published: (2026)

CASA: Cross-Attention over Self-Attention for Efficient Vision-Language Fusion
by: Böhle, Moritz, et al.
Published: (2025)

Diff-Mosaic: Augmenting Realistic Representations in Infrared Small Target Detection via Diffusion Prior
by: Shi, Yukai, et al.
Published: (2024)

Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance
by: Ahn, Donghoon, et al.
Published: (2024)

Hybrid Convolutional and Attention Network for Hyperspectral Image Denoising
by: Hu, Shuai, et al.
Published: (2024)

Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
by: Zhou, Yifan, et al.
Published: (2025)

Synthesizer Based Efficient Self-Attention for Vision Tasks
by: Zhu, Guangyang, et al.
Published: (2022)

LAC-Net: Linear-Fusion Attention-Guided Convolutional Network for Accurate Robotic Grasping Under the Occlusion
by: Zhang, Jinyu, et al.
Published: (2024)

ATOM: Attention Mixer for Efficient Dataset Distillation
by: Khaki, Samir, et al.
Published: (2024)

Efficient Masked Image Compression with Position-Indexed Self-Attention
by: Dai, Chengjie, et al.
Published: (2025)

S2AFormer: Strip Self-Attention for Efficient Vision Transformer
by: Xu, Guoan, et al.
Published: (2025)

VMonarch: Efficient Video Diffusion Transformers with Structured Attention
by: Liang, Cheng, et al.
Published: (2026)

GateAttentionPose: Enhancing Pose Estimation with Agent Attention and Improved Gated Convolutions
by: Feng, Liang, et al.
Published: (2024)

Efficient Star Distillation Attention Network for Lightweight Image Super-Resolution
by: Hao, Fangwei, et al.
Published: (2025)

Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection Guidance
by: Sun, Wenhao, et al.
Published: (2024)