:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Tan, Haoyue, Wang, Shengnan, Qiao, Yulin, Zhang, Juncheng, Bai, Youhui, Gong, Ping, Jin, Zewen, Li, Cheng
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2604.18348
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention
by: Du, Zewen, et al.
Published: (2024)

AdaIAT: Adaptively Increasing Attention to Generated Text to Alleviate Hallucinations in LVLM
by: Zhong, Li'an, et al.
Published: (2026)

VideoClusterNet: Self-Supervised and Adaptive Face Clustering For Videos
by: Walawalkar, Devesh, et al.
Published: (2024)

AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
by: Ni, Zanlin, et al.
Published: (2024)

Towards Robust Out-of-Distribution Generalization: Data Augmentation and Neural Architecture Search Approaches
by: Bai, Haoyue
Published: (2024)

Attention Sparsity is Input-Stable: Training-Free Sparse Attention for Video Generation via Offline Sparsity Profiling and Online QK Co-Clustering
by: Luo, Jiayi, et al.
Published: (2026)

AdaGen: Learning Adaptive Policy for Image Synthesis
by: Ni, Zanlin, et al.
Published: (2026)

Training-free and Adaptive Sparse Attention for Efficient Long Video Generation
by: Xia, Yifei, et al.
Published: (2025)

CountCluster: Training-Free Object Quantity Guidance with Cross-Attention Map Clustering for Text-to-Image Generation
by: Lee, Joohyeon, et al.
Published: (2025)

AdaMHF: Adaptive Multimodal Hierarchical Fusion for Survival Prediction
by: Zhang, Shuaiyu, et al.
Published: (2025)

Efficient Long-Context LLM Inference via KV Cache Clustering
by: Hu, Jie, et al.
Published: (2025)

AdaFlow: Efficient Long Video Editing via Adaptive Attention Slimming And Keyframe Selection
by: Zhang, Shuheng, et al.
Published: (2025)

AdaSpark: Adaptive Sparsity for Efficient Long-Video Understanding
by: Li, Handong, et al.
Published: (2026)

AVPDN: Learning Motion-Robust and Scale-Adaptive Representations for Video-Based Polyp Detection
by: Chen, Zilin, et al.
Published: (2025)

AdaVid: Adaptive Video-Language Pretraining
by: Patel, Chaitanya, et al.
Published: (2025)

AdaRD-key: Adaptive Relevance-Diversity Keyframe Sampling for Long-form Video understanding
by: Zhang, Xian, et al.
Published: (2025)

VORTA: Efficient Video Diffusion via Routing Sparse Attention
by: Sun, Wenhao, et al.
Published: (2025)

AdaTooler-V: Adaptive Tool-Use for Images and Videos
by: Wang, Chaoyang, et al.
Published: (2025)

Component Adaptive Clustering for Generalized Category Discovery
by: Yan, Mingfu, et al.
Published: (2025)

DynamicRad: Content-Adaptive Sparse Attention for Long Video Diffusion
by: Long, Yongji, et al.
Published: (2026)

Efficient Solvers for Sparse Subspace Clustering
by: Pourkamali-Anaraki, Farhad, et al.
Published: (2018)

AdaOcc: Adaptive-Resolution Occupancy Prediction
by: Chen, Chao, et al.
Published: (2024)

AdaEraser: Training-Free Object Removal via Adaptive Attention Suppression
by: Liu, Dingming
Published: (2026)

Beyond Uniform Query Distribution: Key-Driven Grouped Query Attention
by: Khan, Zohaib, et al.
Published: (2024)

Accelerating Text-to-Video Generation with Calibrated Sparse Attention
by: Yehezkel, Shai, et al.
Published: (2026)

Accelerating Long-Tail Generation in Synchronous RLHF Training via Adaptive Tensor Parallelism
by: Zhao, Long, et al.
Published: (2026)

Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention
by: Lv, Chengtao, et al.
Published: (2026)

Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition
by: Wang, Yulin, et al.
Published: (2024)

Adaptively Clustering Neighbor Elements for Image-Text Generation
by: Wang, Zihua, et al.
Published: (2023)

Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation
by: Yang, Shuo, et al.
Published: (2025)

AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding
by: Xue, Zhucun, et al.
Published: (2025)

Edge-aware Hard Clustering Graph Pooling for Brain Imaging
by: Zhu, Cheng, et al.
Published: (2023)

AdaState: Self-Evolving Anchors for Streaming Video Generation
by: Dalva, Yusuf, et al.
Published: (2026)

AdaTP: Attention-Debiased Token Pruning for Video Large Language Models
by: Sun, Fengyuan, et al.
Published: (2025)

GeoQuery: Geometry-Query Diffusion for Sparse-View Reconstruction
by: Cao, Xiao, et al.
Published: (2026)

Multi-tracklet Tracking for Generic Targets with Adaptive Detection Clustering
by: Wu, Zewei, et al.
Published: (2025)

DFSAttn: Dynamic Fine-grained Sparse Attention for Efficient Video Generation
by: Hu, Jie, et al.
Published: (2026)

USV: Towards Understanding the User-generated Short-form Videos
by: Cheng, Haoyue, et al.
Published: (2026)

CurveFormer++: 3D Lane Detection by Curve Propagation with Temporal Curve Queries and Attention
by: Bai, Yifeng, et al.
Published: (2024)

SparseAD: Sparse Query-Centric Paradigm for Efficient End-to-End Autonomous Driving
by: Zhang, Diankun, et al.
Published: (2024)