Saved in:
| Main Authors: | Tan, Haoyue, Wang, Shengnan, Qiao, Yulin, Zhang, Juncheng, Bai, Youhui, Gong, Ping, Jin, Zewen, Li, Cheng |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.18348 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention
by: Du, Zewen, et al.
Published: (2024)
by: Du, Zewen, et al.
Published: (2024)
AdaIAT: Adaptively Increasing Attention to Generated Text to Alleviate Hallucinations in LVLM
by: Zhong, Li'an, et al.
Published: (2026)
by: Zhong, Li'an, et al.
Published: (2026)
VideoClusterNet: Self-Supervised and Adaptive Face Clustering For Videos
by: Walawalkar, Devesh, et al.
Published: (2024)
by: Walawalkar, Devesh, et al.
Published: (2024)
AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
by: Ni, Zanlin, et al.
Published: (2024)
by: Ni, Zanlin, et al.
Published: (2024)
Towards Robust Out-of-Distribution Generalization: Data Augmentation and Neural Architecture Search Approaches
by: Bai, Haoyue
Published: (2024)
by: Bai, Haoyue
Published: (2024)
Attention Sparsity is Input-Stable: Training-Free Sparse Attention for Video Generation via Offline Sparsity Profiling and Online QK Co-Clustering
by: Luo, Jiayi, et al.
Published: (2026)
by: Luo, Jiayi, et al.
Published: (2026)
AdaGen: Learning Adaptive Policy for Image Synthesis
by: Ni, Zanlin, et al.
Published: (2026)
by: Ni, Zanlin, et al.
Published: (2026)
Training-free and Adaptive Sparse Attention for Efficient Long Video Generation
by: Xia, Yifei, et al.
Published: (2025)
by: Xia, Yifei, et al.
Published: (2025)
CountCluster: Training-Free Object Quantity Guidance with Cross-Attention Map Clustering for Text-to-Image Generation
by: Lee, Joohyeon, et al.
Published: (2025)
by: Lee, Joohyeon, et al.
Published: (2025)
AdaMHF: Adaptive Multimodal Hierarchical Fusion for Survival Prediction
by: Zhang, Shuaiyu, et al.
Published: (2025)
by: Zhang, Shuaiyu, et al.
Published: (2025)
Efficient Long-Context LLM Inference via KV Cache Clustering
by: Hu, Jie, et al.
Published: (2025)
by: Hu, Jie, et al.
Published: (2025)
AdaFlow: Efficient Long Video Editing via Adaptive Attention Slimming And Keyframe Selection
by: Zhang, Shuheng, et al.
Published: (2025)
by: Zhang, Shuheng, et al.
Published: (2025)
AdaSpark: Adaptive Sparsity for Efficient Long-Video Understanding
by: Li, Handong, et al.
Published: (2026)
by: Li, Handong, et al.
Published: (2026)
AVPDN: Learning Motion-Robust and Scale-Adaptive Representations for Video-Based Polyp Detection
by: Chen, Zilin, et al.
Published: (2025)
by: Chen, Zilin, et al.
Published: (2025)
AdaVid: Adaptive Video-Language Pretraining
by: Patel, Chaitanya, et al.
Published: (2025)
by: Patel, Chaitanya, et al.
Published: (2025)
AdaRD-key: Adaptive Relevance-Diversity Keyframe Sampling for Long-form Video understanding
by: Zhang, Xian, et al.
Published: (2025)
by: Zhang, Xian, et al.
Published: (2025)
VORTA: Efficient Video Diffusion via Routing Sparse Attention
by: Sun, Wenhao, et al.
Published: (2025)
by: Sun, Wenhao, et al.
Published: (2025)
AdaTooler-V: Adaptive Tool-Use for Images and Videos
by: Wang, Chaoyang, et al.
Published: (2025)
by: Wang, Chaoyang, et al.
Published: (2025)
Component Adaptive Clustering for Generalized Category Discovery
by: Yan, Mingfu, et al.
Published: (2025)
by: Yan, Mingfu, et al.
Published: (2025)
DynamicRad: Content-Adaptive Sparse Attention for Long Video Diffusion
by: Long, Yongji, et al.
Published: (2026)
by: Long, Yongji, et al.
Published: (2026)
Efficient Solvers for Sparse Subspace Clustering
by: Pourkamali-Anaraki, Farhad, et al.
Published: (2018)
by: Pourkamali-Anaraki, Farhad, et al.
Published: (2018)
AdaOcc: Adaptive-Resolution Occupancy Prediction
by: Chen, Chao, et al.
Published: (2024)
by: Chen, Chao, et al.
Published: (2024)
AdaEraser: Training-Free Object Removal via Adaptive Attention Suppression
by: Liu, Dingming
Published: (2026)
by: Liu, Dingming
Published: (2026)
Beyond Uniform Query Distribution: Key-Driven Grouped Query Attention
by: Khan, Zohaib, et al.
Published: (2024)
by: Khan, Zohaib, et al.
Published: (2024)
Accelerating Text-to-Video Generation with Calibrated Sparse Attention
by: Yehezkel, Shai, et al.
Published: (2026)
by: Yehezkel, Shai, et al.
Published: (2026)
Accelerating Long-Tail Generation in Synchronous RLHF Training via Adaptive Tensor Parallelism
by: Zhao, Long, et al.
Published: (2026)
by: Zhao, Long, et al.
Published: (2026)
Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention
by: Lv, Chengtao, et al.
Published: (2026)
by: Lv, Chengtao, et al.
Published: (2026)
Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition
by: Wang, Yulin, et al.
Published: (2024)
by: Wang, Yulin, et al.
Published: (2024)
Adaptively Clustering Neighbor Elements for Image-Text Generation
by: Wang, Zihua, et al.
Published: (2023)
by: Wang, Zihua, et al.
Published: (2023)
Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation
by: Yang, Shuo, et al.
Published: (2025)
by: Yang, Shuo, et al.
Published: (2025)
AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding
by: Xue, Zhucun, et al.
Published: (2025)
by: Xue, Zhucun, et al.
Published: (2025)
Edge-aware Hard Clustering Graph Pooling for Brain Imaging
by: Zhu, Cheng, et al.
Published: (2023)
by: Zhu, Cheng, et al.
Published: (2023)
AdaState: Self-Evolving Anchors for Streaming Video Generation
by: Dalva, Yusuf, et al.
Published: (2026)
by: Dalva, Yusuf, et al.
Published: (2026)
AdaTP: Attention-Debiased Token Pruning for Video Large Language Models
by: Sun, Fengyuan, et al.
Published: (2025)
by: Sun, Fengyuan, et al.
Published: (2025)
GeoQuery: Geometry-Query Diffusion for Sparse-View Reconstruction
by: Cao, Xiao, et al.
Published: (2026)
by: Cao, Xiao, et al.
Published: (2026)
Multi-tracklet Tracking for Generic Targets with Adaptive Detection Clustering
by: Wu, Zewei, et al.
Published: (2025)
by: Wu, Zewei, et al.
Published: (2025)
DFSAttn: Dynamic Fine-grained Sparse Attention for Efficient Video Generation
by: Hu, Jie, et al.
Published: (2026)
by: Hu, Jie, et al.
Published: (2026)
USV: Towards Understanding the User-generated Short-form Videos
by: Cheng, Haoyue, et al.
Published: (2026)
by: Cheng, Haoyue, et al.
Published: (2026)
CurveFormer++: 3D Lane Detection by Curve Propagation with Temporal Curve Queries and Attention
by: Bai, Yifeng, et al.
Published: (2024)
by: Bai, Yifeng, et al.
Published: (2024)
SparseAD: Sparse Query-Centric Paradigm for Efficient End-to-End Autonomous Driving
by: Zhang, Diankun, et al.
Published: (2024)
by: Zhang, Diankun, et al.
Published: (2024)
Similar Items
-
LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention
by: Du, Zewen, et al.
Published: (2024) -
AdaIAT: Adaptively Increasing Attention to Generated Text to Alleviate Hallucinations in LVLM
by: Zhong, Li'an, et al.
Published: (2026) -
VideoClusterNet: Self-Supervised and Adaptive Face Clustering For Videos
by: Walawalkar, Devesh, et al.
Published: (2024) -
AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
by: Ni, Zanlin, et al.
Published: (2024) -
Towards Robust Out-of-Distribution Generalization: Data Augmentation and Neural Architecture Search Approaches
by: Bai, Haoyue
Published: (2024)