Saved in:
| Main Authors: | Wu, Junhao, Yao, Dezhong, Jin, Hai |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.27003 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Collaborative Few-Step Distillation and Low-Bit Quantization for Wan2.2 Dual-Expert Video Diffusion Models
by: Du, Jinyang, et al.
Published: (2026)
by: Du, Jinyang, et al.
Published: (2026)
Fine-Tuning Open Video Generators for Cinematic Scene Synthesis: A Small-Data Pipeline with LoRA and Wan2.1 I2V
by: Akarsu, Meftun, et al.
Published: (2025)
by: Akarsu, Meftun, et al.
Published: (2025)
Q$^2$: Quantization-Aware Gradient Balancing and Attention Alignment for Low-Bit Quantization
by: Wang, Zhaoyang, et al.
Published: (2025)
by: Wang, Zhaoyang, et al.
Published: (2025)
Timestep-Aware Correction for Quantized Diffusion Models
by: Yao, Yuzhe, et al.
Published: (2024)
by: Yao, Yuzhe, et al.
Published: (2024)
TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis
by: Ton, Tri, et al.
Published: (2025)
by: Ton, Tri, et al.
Published: (2025)
Explainable Synthetic Image Detection through Diffusion Timestep Ensembling
by: Wu, Yixin, et al.
Published: (2025)
by: Wu, Yixin, et al.
Published: (2025)
P4Q: Learning to Prompt for Quantization in Visual-language Models
by: Sun, Huixin, et al.
Published: (2024)
by: Sun, Huixin, et al.
Published: (2024)
HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration
by: Sun, Desen, et al.
Published: (2026)
by: Sun, Desen, et al.
Published: (2026)
Fine-Grained Post-Training Quantization for Large Vision Language Models with Quantization-Aware Integrated Gradients
by: Xiang, Ziwei, et al.
Published: (2026)
by: Xiang, Ziwei, et al.
Published: (2026)
DuQuant++: Fine-grained Rotation Enhances Microscaling FP4 Quantization
by: Lin, Haokun, et al.
Published: (2026)
by: Lin, Haokun, et al.
Published: (2026)
Q-SAM2: Accurate Quantization for Segment Anything Model 2
by: Farronato, Nicola, et al.
Published: (2025)
by: Farronato, Nicola, et al.
Published: (2025)
Characterizing Motion Encoding in Video Diffusion Timesteps
by: Baherwani, Vatsal, et al.
Published: (2025)
by: Baherwani, Vatsal, et al.
Published: (2025)
The Disappearance of Timestep Embedding in Modern Time-Dependent Neural Networks
by: Kim, Bum Jun, et al.
Published: (2024)
by: Kim, Bum Jun, et al.
Published: (2024)
SegQuant: A Semantics-Aware and Generalizable Quantization Framework for Diffusion Models
by: Zhang, Jiaji, et al.
Published: (2025)
by: Zhang, Jiaji, et al.
Published: (2025)
Accelerating Diffusion-based Video Editing via Heterogeneous Caching: Beyond Full Computing at Sampled Denoising Timestep
by: Liu, Tianyi, et al.
Published: (2026)
by: Liu, Tianyi, et al.
Published: (2026)
Is it safe to cross? Interpretable Risk Assessment with GPT-4V for Safety-Aware Street Crossing
by: Hwang, Hochul, et al.
Published: (2024)
by: Hwang, Hochul, et al.
Published: (2024)
HQ-DiT: Efficient Diffusion Transformer with FP4 Hybrid Quantization
by: Liu, Wenxuan, et al.
Published: (2024)
by: Liu, Wenxuan, et al.
Published: (2024)
Semantic-Consistent Bidirectional Contrastive Hashing for Noisy Multi-Label Cross-Modal Retrieval
by: Peng, Likang, et al.
Published: (2025)
by: Peng, Likang, et al.
Published: (2025)
SDQ-LLM: Sigma-Delta Quantization for 1-bit LLMs of any size
by: Xia, Junhao, et al.
Published: (2025)
by: Xia, Junhao, et al.
Published: (2025)
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
by: Li, Muyang, et al.
Published: (2024)
by: Li, Muyang, et al.
Published: (2024)
T2I-VeRW: Part-level Fine-grained Perception for Text-to-Image Vehicle Retrieval
by: Wang, Xiao, et al.
Published: (2026)
by: Wang, Xiao, et al.
Published: (2026)
Anti-I2V: Safeguarding your photos from malicious image-to-video generation
by: Vu, Duc, et al.
Published: (2026)
by: Vu, Duc, et al.
Published: (2026)
Layer- and Timestep-Adaptive Differentiable Token Compression Ratios for Efficient Diffusion Transformers
by: You, Haoran, et al.
Published: (2024)
by: You, Haoran, et al.
Published: (2024)
Modality-Aware and Anatomical Vector-Quantized Autoencoding for Multimodal Brain MRI
by: Li, Mingjie, et al.
Published: (2026)
by: Li, Mingjie, et al.
Published: (2026)
PTQ4ARVG: Post-Training Quantization for AutoRegressive Visual Generation Models
by: Liu, Xuewen, et al.
Published: (2026)
by: Liu, Xuewen, et al.
Published: (2026)
Event Stream-based Visual Object Tracking: HDETrack V2 and A High-Definition Benchmark
by: Wang, Shiao, et al.
Published: (2025)
by: Wang, Shiao, et al.
Published: (2025)
Temporal Aware Pruning for Efficient Diffusion-based Video Generation
by: Li, Sheng, et al.
Published: (2026)
by: Li, Sheng, et al.
Published: (2026)
PointCloud-Text Matching: Benchmark Datasets and a Baseline
by: Feng, Yanglin, et al.
Published: (2024)
by: Feng, Yanglin, et al.
Published: (2024)
QAPruner: Quantization-Aware Vision Token Pruning for Multimodal Large Language Models
by: Wang, Xinhao, et al.
Published: (2026)
by: Wang, Xinhao, et al.
Published: (2026)
Interruption-Aware Cooperative Perception for V2X Communication-Aided Autonomous Driving
by: Ren, Shunli, et al.
Published: (2023)
by: Ren, Shunli, et al.
Published: (2023)
T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text
by: Yin, Aoxiong, et al.
Published: (2024)
by: Yin, Aoxiong, et al.
Published: (2024)
Wan-S2V: Audio-Driven Cinematic Video Generation
by: Gao, Xin, et al.
Published: (2025)
by: Gao, Xin, et al.
Published: (2025)
The Dawn of KAN in Image-to-Image (I2I) Translation: Integrating Kolmogorov-Arnold Networks with GANs for Unpaired I2I Translation
by: Mahara, Arpan, et al.
Published: (2024)
by: Mahara, Arpan, et al.
Published: (2024)
SweetTok: Semantic-Aware Spatial-Temporal Tokenizer for Compact Video Discretization
by: Tan, Zhentao, et al.
Published: (2024)
by: Tan, Zhentao, et al.
Published: (2024)
Q-Sched: Pushing the Boundaries of Few-Step Diffusion Models with Quantization-Aware Scheduling
by: Frumkin, Natalia, et al.
Published: (2025)
by: Frumkin, Natalia, et al.
Published: (2025)
Efficient Quantization-Aware Training on Segment Anything Model in Medical Images and Its Deployment
by: Lu, Haisheng, et al.
Published: (2024)
by: Lu, Haisheng, et al.
Published: (2024)
GAT-NeRF: Geometry-Aware-Transformer Enhanced Neural Radiance Fields for High-Fidelity 4D Facial Avatars
by: Chang, Zhe, et al.
Published: (2026)
by: Chang, Zhe, et al.
Published: (2026)
Echo4DIR: 4D Implicit Heart Reconstruction from 2D Echocardiography Videos
by: Liu, Yanan, et al.
Published: (2026)
by: Liu, Yanan, et al.
Published: (2026)
Hyp2Former: Hierarchy-Aware Hyperbolic Embeddings for Open-Set Panoptic Segmentation
by: Lu, Yao, et al.
Published: (2026)
by: Lu, Yao, et al.
Published: (2026)
Channel-wise Vector Quantization
by: Song, Wei, et al.
Published: (2026)
by: Song, Wei, et al.
Published: (2026)
Similar Items
-
Collaborative Few-Step Distillation and Low-Bit Quantization for Wan2.2 Dual-Expert Video Diffusion Models
by: Du, Jinyang, et al.
Published: (2026) -
Fine-Tuning Open Video Generators for Cinematic Scene Synthesis: A Small-Data Pipeline with LoRA and Wan2.1 I2V
by: Akarsu, Meftun, et al.
Published: (2025) -
Q$^2$: Quantization-Aware Gradient Balancing and Attention Alignment for Low-Bit Quantization
by: Wang, Zhaoyang, et al.
Published: (2025) -
Timestep-Aware Correction for Quantized Diffusion Models
by: Yao, Yuzhe, et al.
Published: (2024) -
TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis
by: Ton, Tri, et al.
Published: (2025)