Saved in:
| Main Authors: | Chu, Huanpeng, Wu, Wei, Fen, Guanyu, Zhang, Yutao |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.16212 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
QNCD: Quantization Noise Correction for Diffusion Models
by: Chu, Huanpeng, et al.
Published: (2024)
by: Chu, Huanpeng, et al.
Published: (2024)
AdaCorrection: Adaptive Offset Cache Correction for Accurate Diffusion Transformers
by: Liu, Dong, et al.
Published: (2026)
by: Liu, Dong, et al.
Published: (2026)
FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation
by: Liu, Dong, et al.
Published: (2025)
by: Liu, Dong, et al.
Published: (2025)
Rethinking Token-wise Feature Caching: Accelerating Diffusion Transformers with Dual Feature Caching
by: Zou, Chang, et al.
Published: (2024)
by: Zou, Chang, et al.
Published: (2024)
CacheQuant: Comprehensively Accelerated Diffusion Models
by: Liu, Xuewen, et al.
Published: (2025)
by: Liu, Xuewen, et al.
Published: (2025)
BWCache: Accelerating Video Diffusion Transformers through Block-Wise Caching
by: Cui, Hanshuai, et al.
Published: (2025)
by: Cui, Hanshuai, et al.
Published: (2025)
Accelerating Diffusion Transformers with Token-wise Feature Caching
by: Zou, Chang, et al.
Published: (2024)
by: Zou, Chang, et al.
Published: (2024)
DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature Caching
by: Zou, Chang, et al.
Published: (2026)
by: Zou, Chang, et al.
Published: (2026)
H2-Cache: A Novel Hierarchical Dual-Stage Cache for High-Performance Acceleration of Generative Diffusion Models
by: Sung, Mingyu, et al.
Published: (2025)
by: Sung, Mingyu, et al.
Published: (2025)
SpeCa: Accelerating Diffusion Transformers with Speculative Feature Caching
by: Liu, Jiacheng, et al.
Published: (2025)
by: Liu, Jiacheng, et al.
Published: (2025)
OmniDiT: Extending Diffusion Transformer to Omni-VTON Framework
by: Zeng, Weixuan, et al.
Published: (2026)
by: Zeng, Weixuan, et al.
Published: (2026)
RT-Cache: Training-Free Retrieval for Real-Time Manipulation
by: Kwon, Owen, et al.
Published: (2025)
by: Kwon, Owen, et al.
Published: (2025)
DreamCache: Finetuning-Free Lightweight Personalized Image Generation via Feature Caching
by: Aiello, Emanuele, et al.
Published: (2024)
by: Aiello, Emanuele, et al.
Published: (2024)
Efficient Long-Horizon GUI Agents via Training-Free KV Cache Compression
by: Zhou, Bowen, et al.
Published: (2026)
by: Zhou, Bowen, et al.
Published: (2026)
Fast Autoregressive Video Diffusion and World Models with Temporal Cache Compression and Sparse Attention
by: Samuel, Dvir, et al.
Published: (2026)
by: Samuel, Dvir, et al.
Published: (2026)
FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models
by: So, Junhyuk, et al.
Published: (2023)
by: So, Junhyuk, et al.
Published: (2023)
AirCache: Activating Inter-modal Relevancy KV Cache Compression for Efficient Large Vision-Language Model Inference
by: Huang, Kai, et al.
Published: (2025)
by: Huang, Kai, et al.
Published: (2025)
What Kind of Visual Tokens Do We Need? Training-free Visual Token Pruning for Multi-modal Large Language Models from the Perspective of Graph
by: Jiang, Yutao, et al.
Published: (2025)
by: Jiang, Yutao, et al.
Published: (2025)
FreqCa: Accelerating Diffusion Models via Frequency-Aware Caching
by: Liu, Jiacheng, et al.
Published: (2025)
by: Liu, Jiacheng, et al.
Published: (2025)
Real2SAM2Real: Generative 3D Caches as Complementary Context for Video Diffusion
by: Wu, Jiayi, et al.
Published: (2026)
by: Wu, Jiayi, et al.
Published: (2026)
WorldCache: Content-Aware Caching for Accelerated Video World Models
by: Nawaz, Umair, et al.
Published: (2026)
by: Nawaz, Umair, et al.
Published: (2026)
Learning Generalized and Flexible Trajectory Models from Omni-Semantic Supervision
by: Zhu, Yuanshao, et al.
Published: (2025)
by: Zhu, Yuanshao, et al.
Published: (2025)
A Survey on Cache Methods in Diffusion Models: Toward Efficient Multi-Modal Generation
by: Liu, Jiacheng, et al.
Published: (2025)
by: Liu, Jiacheng, et al.
Published: (2025)
VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion
by: Yesiltepe, Hidir, et al.
Published: (2026)
by: Yesiltepe, Hidir, et al.
Published: (2026)
LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation
by: Gao, Huanlin, et al.
Published: (2025)
by: Gao, Huanlin, et al.
Published: (2025)
FlashBlock: Attention Caching for Efficient Long-Context Block Diffusion
by: Chen, Zhuokun, et al.
Published: (2026)
by: Chen, Zhuokun, et al.
Published: (2026)
Motion-Aware Caching for Efficient Autoregressive Video Generation
by: Xu, Jing, et al.
Published: (2026)
by: Xu, Jing, et al.
Published: (2026)
RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers
by: Gokmen, Ahmet Berke, et al.
Published: (2025)
by: Gokmen, Ahmet Berke, et al.
Published: (2025)
Cached Multi-Lora Composition for Multi-Concept Image Generation
by: Zou, Xiandong, et al.
Published: (2025)
by: Zou, Xiandong, et al.
Published: (2025)
Accelerating Diffusion-based Video Editing via Heterogeneous Caching: Beyond Full Computing at Sampled Denoising Timestep
by: Liu, Tianyi, et al.
Published: (2026)
by: Liu, Tianyi, et al.
Published: (2026)
Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models
by: Ma, Xuran, et al.
Published: (2025)
by: Ma, Xuran, et al.
Published: (2025)
KVCapsule: Efficient Sequential KV Cache Compression for Vision-Language Models with Asymmetric Redundancy
by: Huang, Yingbing, et al.
Published: (2026)
by: Huang, Yingbing, et al.
Published: (2026)
Multi-Cache Enhanced Prototype Learning for Test-Time Generalization of Vision-Language Models
by: Chen, Xinyu, et al.
Published: (2025)
by: Chen, Xinyu, et al.
Published: (2025)
From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers
by: Liu, Jiacheng, et al.
Published: (2025)
by: Liu, Jiacheng, et al.
Published: (2025)
Fast Sampling Through The Reuse Of Attention Maps In Diffusion Models
by: Hunter, Rosco, et al.
Published: (2023)
by: Hunter, Rosco, et al.
Published: (2023)
EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation
by: Jagpal, Diljeet, et al.
Published: (2025)
by: Jagpal, Diljeet, et al.
Published: (2025)
Cross-Self KV Cache Pruning for Efficient Vision-Language Inference
by: Pei, Xiaohuan, et al.
Published: (2024)
by: Pei, Xiaohuan, et al.
Published: (2024)
Chipmunk: Training-Free Acceleration of Diffusion Transformers with Dynamic Column-Sparse Deltas
by: Silveria, Austin, et al.
Published: (2025)
by: Silveria, Austin, et al.
Published: (2025)
FAIRT2V: Training-Free Debiasing for Text-to-Video Diffusion Models
by: Zhong, Haonan, et al.
Published: (2026)
by: Zhong, Haonan, et al.
Published: (2026)
OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation
by: Zhang, Guohui, et al.
Published: (2026)
by: Zhang, Guohui, et al.
Published: (2026)
Similar Items
-
QNCD: Quantization Noise Correction for Diffusion Models
by: Chu, Huanpeng, et al.
Published: (2024) -
AdaCorrection: Adaptive Offset Cache Correction for Accurate Diffusion Transformers
by: Liu, Dong, et al.
Published: (2026) -
FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation
by: Liu, Dong, et al.
Published: (2025) -
Rethinking Token-wise Feature Caching: Accelerating Diffusion Transformers with Dual Feature Caching
by: Zou, Chang, et al.
Published: (2024) -
CacheQuant: Comprehensively Accelerated Diffusion Models
by: Liu, Xuewen, et al.
Published: (2025)