Saved in:
| Main Authors: | Dong, ZiYi, Zhou, Chengxing, Deng, Weijian, Wei, Pengxu, Ji, Xiangyang, Lin, Liang |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.21292 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Delving into Cascaded Instability: A Lipschitz Continuity View on Image Restoration and Object Detection Synergy
by: Zhao, Qing, et al.
Published: (2025)
by: Zhao, Qing, et al.
Published: (2025)
Language Generation as Optimal Control: Closed-Loop Diffusion in Latent Control Space
by: Dong, ZiYi, et al.
Published: (2026)
by: Dong, ZiYi, et al.
Published: (2026)
When Preference Labels Fall Short: Aligning Diffusion Models from Real Data
by: Chen, Weiyan, et al.
Published: (2026)
by: Chen, Weiyan, et al.
Published: (2026)
Unveiling Perceptual Artifacts: A Fine-Grained Benchmark for Interpretable AI-Generated Image Detection
by: Xiao, Yao, et al.
Published: (2026)
by: Xiao, Yao, et al.
Published: (2026)
Slot Attention with Re-Initialization and Self-Distillation
by: Zhao, Rongzhen, et al.
Published: (2025)
by: Zhao, Rongzhen, et al.
Published: (2025)
Attentive Convolution: Unifying the Expressivity of Self-Attention with Convolutional Efficiency
by: Yu, Hao, et al.
Published: (2025)
by: Yu, Hao, et al.
Published: (2025)
Decoder-Only LLMs are Better Controllers for Diffusion Models
by: Dong, Ziyi, et al.
Published: (2025)
by: Dong, Ziyi, et al.
Published: (2025)
ELA: Efficient Local Attention for Deep Convolutional Neural Networks
by: Xu, Wei, et al.
Published: (2024)
by: Xu, Wei, et al.
Published: (2024)
DreamArtist++: Controllable One-Shot Text-to-Image Generation via Positive-Negative Adapter
by: Dong, Ziyi, et al.
Published: (2022)
by: Dong, Ziyi, et al.
Published: (2022)
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications
by: Zhang, Tianfang, et al.
Published: (2024)
by: Zhang, Tianfang, et al.
Published: (2024)
HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval
by: He, Chao, et al.
Published: (2024)
by: He, Chao, et al.
Published: (2024)
Multi-Field De-interlacing using Deformable Convolution Residual Blocks and Self-Attention
by: Ji, Ronglei, et al.
Published: (2022)
by: Ji, Ronglei, et al.
Published: (2022)
SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer
by: Fang, Tongcheng, et al.
Published: (2026)
by: Fang, Tongcheng, et al.
Published: (2026)
MAT: Multi-Range Attention Transformer for Efficient Image Super-Resolution
by: Xie, Chengxing, et al.
Published: (2024)
by: Xie, Chengxing, et al.
Published: (2024)
Three-Stream Temporal-Shift Attention Network Based on Self-Knowledge Distillation for Micro-Expression Recognition
by: Zhu, Guanghao, et al.
Published: (2024)
by: Zhu, Guanghao, et al.
Published: (2024)
Partial Convolution Meets Visual Attention
by: Huang, Haiduo, et al.
Published: (2025)
by: Huang, Haiduo, et al.
Published: (2025)
Efficient Face Image Quality Assessment via Self-training and Knowledge Distillation
by: Sun, Wei, et al.
Published: (2025)
by: Sun, Wei, et al.
Published: (2025)
Efficient Single Image Super-Resolution with Entropy Attention and Receptive Field Augmentation
by: Zhao, Xiaole, et al.
Published: (2024)
by: Zhao, Xiaole, et al.
Published: (2024)
CSAKD: Knowledge Distillation with Cross Self-Attention for Hyperspectral and Multispectral Image Fusion
by: Hsu, Chih-Chung, et al.
Published: (2024)
by: Hsu, Chih-Chung, et al.
Published: (2024)
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
by: Zhou, Yupeng, et al.
Published: (2024)
by: Zhou, Yupeng, et al.
Published: (2024)
Optimizing Knowledge Distillation in Transformers: Enabling Multi-Head Attention without Alignment Barriers
by: Bing, Zhaodong, et al.
Published: (2025)
by: Bing, Zhaodong, et al.
Published: (2025)
Progressively Normalized Self-Attention Network for Video Polyp Segmentation
by: Ji, Ge-Peng, et al.
Published: (2021)
by: Ji, Ge-Peng, et al.
Published: (2021)
Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention
by: Truong, Quang-Trung, et al.
Published: (2024)
by: Truong, Quang-Trung, et al.
Published: (2024)
Self-Attention Decomposition For Training Free Diffusion Editing
by: Anand, Tharun, et al.
Published: (2025)
by: Anand, Tharun, et al.
Published: (2025)
Towards Understanding the Robustness of Diffusion-Based Purification: A Stochastic Perspective
by: Liu, Yiming, et al.
Published: (2024)
by: Liu, Yiming, et al.
Published: (2024)
Veda: Scalable Video Diffusion via Distilled Sparse Attention
by: Han, Shihao, et al.
Published: (2026)
by: Han, Shihao, et al.
Published: (2026)
CASA: Cross-Attention over Self-Attention for Efficient Vision-Language Fusion
by: Böhle, Moritz, et al.
Published: (2025)
by: Böhle, Moritz, et al.
Published: (2025)
Diff-Mosaic: Augmenting Realistic Representations in Infrared Small Target Detection via Diffusion Prior
by: Shi, Yukai, et al.
Published: (2024)
by: Shi, Yukai, et al.
Published: (2024)
Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance
by: Ahn, Donghoon, et al.
Published: (2024)
by: Ahn, Donghoon, et al.
Published: (2024)
Hybrid Convolutional and Attention Network for Hyperspectral Image Denoising
by: Hu, Shuai, et al.
Published: (2024)
by: Hu, Shuai, et al.
Published: (2024)
Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
by: Zhou, Yifan, et al.
Published: (2025)
by: Zhou, Yifan, et al.
Published: (2025)
Synthesizer Based Efficient Self-Attention for Vision Tasks
by: Zhu, Guangyang, et al.
Published: (2022)
by: Zhu, Guangyang, et al.
Published: (2022)
LAC-Net: Linear-Fusion Attention-Guided Convolutional Network for Accurate Robotic Grasping Under the Occlusion
by: Zhang, Jinyu, et al.
Published: (2024)
by: Zhang, Jinyu, et al.
Published: (2024)
ATOM: Attention Mixer for Efficient Dataset Distillation
by: Khaki, Samir, et al.
Published: (2024)
by: Khaki, Samir, et al.
Published: (2024)
Efficient Masked Image Compression with Position-Indexed Self-Attention
by: Dai, Chengjie, et al.
Published: (2025)
by: Dai, Chengjie, et al.
Published: (2025)
S2AFormer: Strip Self-Attention for Efficient Vision Transformer
by: Xu, Guoan, et al.
Published: (2025)
by: Xu, Guoan, et al.
Published: (2025)
VMonarch: Efficient Video Diffusion Transformers with Structured Attention
by: Liang, Cheng, et al.
Published: (2026)
by: Liang, Cheng, et al.
Published: (2026)
GateAttentionPose: Enhancing Pose Estimation with Agent Attention and Improved Gated Convolutions
by: Feng, Liang, et al.
Published: (2024)
by: Feng, Liang, et al.
Published: (2024)
Efficient Star Distillation Attention Network for Lightweight Image Super-Resolution
by: Hao, Fangwei, et al.
Published: (2025)
by: Hao, Fangwei, et al.
Published: (2025)
Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection Guidance
by: Sun, Wenhao, et al.
Published: (2024)
by: Sun, Wenhao, et al.
Published: (2024)
Similar Items
-
Delving into Cascaded Instability: A Lipschitz Continuity View on Image Restoration and Object Detection Synergy
by: Zhao, Qing, et al.
Published: (2025) -
Language Generation as Optimal Control: Closed-Loop Diffusion in Latent Control Space
by: Dong, ZiYi, et al.
Published: (2026) -
When Preference Labels Fall Short: Aligning Diffusion Models from Real Data
by: Chen, Weiyan, et al.
Published: (2026) -
Unveiling Perceptual Artifacts: A Fine-Grained Benchmark for Interpretable AI-Generated Image Detection
by: Xiao, Yao, et al.
Published: (2026) -
Slot Attention with Re-Initialization and Self-Distillation
by: Zhao, Rongzhen, et al.
Published: (2025)