:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wu, Junhao, Yao, Dezhong, Jin, Hai
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.27003
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Collaborative Few-Step Distillation and Low-Bit Quantization for Wan2.2 Dual-Expert Video Diffusion Models
by: Du, Jinyang, et al.
Published: (2026)

Fine-Tuning Open Video Generators for Cinematic Scene Synthesis: A Small-Data Pipeline with LoRA and Wan2.1 I2V
by: Akarsu, Meftun, et al.
Published: (2025)

Q$^2$: Quantization-Aware Gradient Balancing and Attention Alignment for Low-Bit Quantization
by: Wang, Zhaoyang, et al.
Published: (2025)

Timestep-Aware Correction for Quantized Diffusion Models
by: Yao, Yuzhe, et al.
Published: (2024)

TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis
by: Ton, Tri, et al.
Published: (2025)

Explainable Synthetic Image Detection through Diffusion Timestep Ensembling
by: Wu, Yixin, et al.
Published: (2025)

P4Q: Learning to Prompt for Quantization in Visual-language Models
by: Sun, Huixin, et al.
Published: (2024)

HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration
by: Sun, Desen, et al.
Published: (2026)

Fine-Grained Post-Training Quantization for Large Vision Language Models with Quantization-Aware Integrated Gradients
by: Xiang, Ziwei, et al.
Published: (2026)

DuQuant++: Fine-grained Rotation Enhances Microscaling FP4 Quantization
by: Lin, Haokun, et al.
Published: (2026)

Q-SAM2: Accurate Quantization for Segment Anything Model 2
by: Farronato, Nicola, et al.
Published: (2025)

Characterizing Motion Encoding in Video Diffusion Timesteps
by: Baherwani, Vatsal, et al.
Published: (2025)

The Disappearance of Timestep Embedding in Modern Time-Dependent Neural Networks
by: Kim, Bum Jun, et al.
Published: (2024)

SegQuant: A Semantics-Aware and Generalizable Quantization Framework for Diffusion Models
by: Zhang, Jiaji, et al.
Published: (2025)

Accelerating Diffusion-based Video Editing via Heterogeneous Caching: Beyond Full Computing at Sampled Denoising Timestep
by: Liu, Tianyi, et al.
Published: (2026)

Is it safe to cross? Interpretable Risk Assessment with GPT-4V for Safety-Aware Street Crossing
by: Hwang, Hochul, et al.
Published: (2024)

HQ-DiT: Efficient Diffusion Transformer with FP4 Hybrid Quantization
by: Liu, Wenxuan, et al.
Published: (2024)

Semantic-Consistent Bidirectional Contrastive Hashing for Noisy Multi-Label Cross-Modal Retrieval
by: Peng, Likang, et al.
Published: (2025)

SDQ-LLM: Sigma-Delta Quantization for 1-bit LLMs of any size
by: Xia, Junhao, et al.
Published: (2025)

SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
by: Li, Muyang, et al.
Published: (2024)

T2I-VeRW: Part-level Fine-grained Perception for Text-to-Image Vehicle Retrieval
by: Wang, Xiao, et al.
Published: (2026)

Anti-I2V: Safeguarding your photos from malicious image-to-video generation
by: Vu, Duc, et al.
Published: (2026)

Layer- and Timestep-Adaptive Differentiable Token Compression Ratios for Efficient Diffusion Transformers
by: You, Haoran, et al.
Published: (2024)

Modality-Aware and Anatomical Vector-Quantized Autoencoding for Multimodal Brain MRI
by: Li, Mingjie, et al.
Published: (2026)

PTQ4ARVG: Post-Training Quantization for AutoRegressive Visual Generation Models
by: Liu, Xuewen, et al.
Published: (2026)

Event Stream-based Visual Object Tracking: HDETrack V2 and A High-Definition Benchmark
by: Wang, Shiao, et al.
Published: (2025)

Temporal Aware Pruning for Efficient Diffusion-based Video Generation
by: Li, Sheng, et al.
Published: (2026)

PointCloud-Text Matching: Benchmark Datasets and a Baseline
by: Feng, Yanglin, et al.
Published: (2024)

QAPruner: Quantization-Aware Vision Token Pruning for Multimodal Large Language Models
by: Wang, Xinhao, et al.
Published: (2026)

Interruption-Aware Cooperative Perception for V2X Communication-Aided Autonomous Driving
by: Ren, Shunli, et al.
Published: (2023)

T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text
by: Yin, Aoxiong, et al.
Published: (2024)

Wan-S2V: Audio-Driven Cinematic Video Generation
by: Gao, Xin, et al.
Published: (2025)

The Dawn of KAN in Image-to-Image (I2I) Translation: Integrating Kolmogorov-Arnold Networks with GANs for Unpaired I2I Translation
by: Mahara, Arpan, et al.
Published: (2024)

SweetTok: Semantic-Aware Spatial-Temporal Tokenizer for Compact Video Discretization
by: Tan, Zhentao, et al.
Published: (2024)

Q-Sched: Pushing the Boundaries of Few-Step Diffusion Models with Quantization-Aware Scheduling
by: Frumkin, Natalia, et al.
Published: (2025)

Efficient Quantization-Aware Training on Segment Anything Model in Medical Images and Its Deployment
by: Lu, Haisheng, et al.
Published: (2024)

GAT-NeRF: Geometry-Aware-Transformer Enhanced Neural Radiance Fields for High-Fidelity 4D Facial Avatars
by: Chang, Zhe, et al.
Published: (2026)

Echo4DIR: 4D Implicit Heart Reconstruction from 2D Echocardiography Videos
by: Liu, Yanan, et al.
Published: (2026)

Hyp2Former: Hierarchy-Aware Hyperbolic Embeddings for Open-Set Panoptic Segmentation
by: Lu, Yao, et al.
Published: (2026)

Channel-wise Vector Quantization
by: Song, Wei, et al.
Published: (2026)