Saved in:
| Main Authors: | Wang, Huizheng, Wang, Hongbin, Wei, Shaojun, Hu, Yang, Yin, Shouyi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.07855 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
BitStopper: An Efficient Transformer Attention Accelerator via Stage-fusion and Early Termination
by: Wang, Huizheng, et al.
Published: (2025)
by: Wang, Huizheng, et al.
Published: (2025)
Catch-Up Distillation: You Only Need to Train Once for Accelerating Sampling
by: Shao, Shitong, et al.
Published: (2023)
by: Shao, Shitong, et al.
Published: (2023)
ELSA: Exploiting Layer-wise N:M Sparsity for Vision Transformer Acceleration
by: Huang, Ning-Chi, et al.
Published: (2024)
by: Huang, Ning-Chi, et al.
Published: (2024)
Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
by: Xi, Haocheng, et al.
Published: (2025)
by: Xi, Haocheng, et al.
Published: (2025)
SparVAR: Exploring Sparsity in Visual AutoRegressive Modeling for Training-Free Acceleration
by: Li, Zekun, et al.
Published: (2026)
by: Li, Zekun, et al.
Published: (2026)
Dynamic Sparse Training with Structured Sparsity
by: Lasby, Mike, et al.
Published: (2023)
by: Lasby, Mike, et al.
Published: (2023)
SINR: Sparsity Driven Compressed Implicit Neural Representations
by: Jayasundara, Dhananjaya, et al.
Published: (2025)
by: Jayasundara, Dhananjaya, et al.
Published: (2025)
Dynamic Domain Adaptation-Driven Physics-Informed Graph Representation Learning for AC-OPF
by: Zhu, Hongjie, et al.
Published: (2025)
by: Zhu, Hongjie, et al.
Published: (2025)
Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic Sparsity
by: Liu, Shiwei, et al.
Published: (2021)
by: Liu, Shiwei, et al.
Published: (2021)
Token Caching for Diffusion Transformer Acceleration
by: Lou, Jinming, et al.
Published: (2024)
by: Lou, Jinming, et al.
Published: (2024)
Modulated Diffusion: Accelerating Generative Modeling with Modulated Quantization
by: Gao, Weizhi, et al.
Published: (2025)
by: Gao, Weizhi, et al.
Published: (2025)
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention
by: Zhang, Jintao, et al.
Published: (2025)
by: Zhang, Jintao, et al.
Published: (2025)
Robust Cross-Domain WiFi Fall Detection via Physics-Driven Attention-Enhanced Transformers
by: Wang, Yingzhe, et al.
Published: (2026)
by: Wang, Yingzhe, et al.
Published: (2026)
Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching
by: Ma, Xinyin, et al.
Published: (2024)
by: Ma, Xinyin, et al.
Published: (2024)
Domain-Enhanced Dual-Branch Model for Efficient and Interpretable Accident Anticipation
by: Guan, Yanchen, et al.
Published: (2025)
by: Guan, Yanchen, et al.
Published: (2025)
FastMMoE: Accelerating Multimodal Large Language Models through Dynamic Expert Activation and Routing-Aware Token Pruning
by: Xia, Guoyang, et al.
Published: (2025)
by: Xia, Guoyang, et al.
Published: (2025)
Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient
by: Wu, Yongliang, et al.
Published: (2024)
by: Wu, Yongliang, et al.
Published: (2024)
LASERS: LAtent Space Encoding for Representations with Sparsity for Generative Modeling
by: Li, Xin, et al.
Published: (2024)
by: Li, Xin, et al.
Published: (2024)
SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity
by: Ma, Ke, et al.
Published: (2025)
by: Ma, Ke, et al.
Published: (2025)
SQ-DM: Accelerating Diffusion Models with Aggressive Quantization and Temporal Sparsity
by: Fan, Zichen, et al.
Published: (2025)
by: Fan, Zichen, et al.
Published: (2025)
Predictive Dynamic Fusion
by: Cao, Bing, et al.
Published: (2024)
by: Cao, Bing, et al.
Published: (2024)
Editable Concept Bottleneck Models
by: Hu, Lijie, et al.
Published: (2024)
by: Hu, Lijie, et al.
Published: (2024)
Learning Velocity and Acceleration: Self-Supervised Motion Consistency for Pedestrian Trajectory Prediction
by: Huang, Yizhou, et al.
Published: (2025)
by: Huang, Yizhou, et al.
Published: (2025)
Sparsity Hurts: Simple Linear Adapter Can Boost Generalized Category Discovery
by: Ye, Bo, et al.
Published: (2026)
by: Ye, Bo, et al.
Published: (2026)
Hyper-STTN: Hypergraph Augmented Spatial-Temporal Transformer Network for Trajectory Prediction
by: Wang, Weizheng, et al.
Published: (2024)
by: Wang, Weizheng, et al.
Published: (2024)
LOTUS: Improving Transformer Efficiency with Sparsity Pruning and Data Lottery Tickets
by: Upadhyay, Ojasw
Published: (2024)
by: Upadhyay, Ojasw
Published: (2024)
Rethinking Pruning for Vision-Language Models: Strategies for Effective Sparsity and Performance Restoration
by: He, Shwai, et al.
Published: (2024)
by: He, Shwai, et al.
Published: (2024)
Video Prediction of Dynamic Physical Simulations With Pixel-Space Spatiotemporal Transformers
by: Slack, Dean L, et al.
Published: (2025)
by: Slack, Dean L, et al.
Published: (2025)
On-Demand Multi-Task Sparsity for Efficient Large-Model Deployment on Edge Devices
by: Huang, Lianming, et al.
Published: (2025)
by: Huang, Lianming, et al.
Published: (2025)
Stable Vision Concept Transformers for Medical Diagnosis
by: Hu, Lijie, et al.
Published: (2025)
by: Hu, Lijie, et al.
Published: (2025)
JLT: Clean-Latent Prediction in Latent Diffusion Transformers
by: Fu, Funing, et al.
Published: (2026)
by: Fu, Funing, et al.
Published: (2026)
Relational Feature Caching for Accelerating Diffusion Transformers
by: Son, Byunggwan, et al.
Published: (2026)
by: Son, Byunggwan, et al.
Published: (2026)
Guiding a Diffusion Transformer with the Internal Dynamics of Itself
by: Zhou, Xingyu, et al.
Published: (2025)
by: Zhou, Xingyu, et al.
Published: (2025)
Model Predictive Simulation Using Structured Graphical Models and Transformers
by: Lou, Xinghua, et al.
Published: (2024)
by: Lou, Xinghua, et al.
Published: (2024)
PCM-SAR: Physics-Driven Contrastive Mutual Learning for SAR Classification
by: Wang, Pengfei, et al.
Published: (2025)
by: Wang, Pengfei, et al.
Published: (2025)
LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention
by: Du, Zewen, et al.
Published: (2024)
by: Du, Zewen, et al.
Published: (2024)
FIS-DiT: Breaking the Few-Step Video Inference Barrier via Training-Free Frame Interleaved Sparsity
by: Tang, Jian, et al.
Published: (2026)
by: Tang, Jian, et al.
Published: (2026)
Vision Transformer-based Adversarial Domain Adaptation
by: Li, Yahan, et al.
Published: (2024)
by: Li, Yahan, et al.
Published: (2024)
Efficient Visual Transformer by Learnable Token Merging
by: Wang, Yancheng, et al.
Published: (2024)
by: Wang, Yancheng, et al.
Published: (2024)
Compact Vision Transformer by Reduction of Kernel Complexity
by: Wang, Yancheng, et al.
Published: (2025)
by: Wang, Yancheng, et al.
Published: (2025)
Similar Items
-
BitStopper: An Efficient Transformer Attention Accelerator via Stage-fusion and Early Termination
by: Wang, Huizheng, et al.
Published: (2025) -
Catch-Up Distillation: You Only Need to Train Once for Accelerating Sampling
by: Shao, Shitong, et al.
Published: (2023) -
ELSA: Exploiting Layer-wise N:M Sparsity for Vision Transformer Acceleration
by: Huang, Ning-Chi, et al.
Published: (2024) -
Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
by: Xi, Haocheng, et al.
Published: (2025) -
SparVAR: Exploring Sparsity in Visual AutoRegressive Modeling for Training-Free Acceleration
by: Li, Zekun, et al.
Published: (2026)