:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chen, Leyang, Wu, Junyi, Li, Zhiteng, Zhang, Yulun
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2605.15852
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation
by: Wu, Junyi, et al.
Published: (2025)

FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing
by: Wu, Junyi, et al.
Published: (2025)

Evict3R: Training-Free Token Eviction for Memory-Bounded Streaming Visual Geometry Transformers
by: Mahdi, Soroush, et al.
Published: (2025)

DVD-Quant: Data-free Video Diffusion Transformers Quantization
by: Li, Zhiteng, et al.
Published: (2025)

BinaryHPE: 3D Human Pose and Shape Estimation via Binarization
by: Li, Zhiteng, et al.
Published: (2023)

WinT3R: Window-Based Streaming Reconstruction with Camera Token Pool
by: Li, Zizun, et al.
Published: (2025)

VEQ: Modality-Adaptive Quantization for MoE Vision-Language Models
by: Qin, Guangshuo, et al.
Published: (2026)

HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution
by: Zhang, Xiang, et al.
Published: (2024)

FlashClear: Ultra-Fast Image Content Removal via Efficient Step Distillation and Feature Caching
by: Tang, Yixin, et al.
Published: (2026)

BiMaCoSR: Binary One-Step Diffusion Model Leveraging Flexible Matrix Compression for Real Super-Resolution
by: Liu, Kai, et al.
Published: (2025)

CondiQuant: Condition Number Based Low-Bit Quantization for Image Super-Resolution
by: Liu, Kai, et al.
Published: (2025)

StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams
by: Wu, Zike, et al.
Published: (2025)

DensifyBeforehand: LiDAR-assisted Content-aware Densification for Efficient and Quality 3D Gaussian Splatting
by: Patt, Phurtivilai, et al.
Published: (2025)

Prior-guided Hierarchical Harmonization Network for Efficient Image Dehazing
by: Su, Xiongfei, et al.
Published: (2025)

HorizonStream: Long-Horizon Attention for Streaming 3D Reconstruction
by: Cheng, Chong, et al.
Published: (2026)

DreamCraft3D++: Efficient Hierarchical 3D Generation with Multi-Plane Reconstruction Model
by: Sun, Jingxiang, et al.
Published: (2024)

StreamGS: Online Generalizable Gaussian Splatting Reconstruction for Unposed Image Streams
by: LI, Yang, et al.
Published: (2025)

EGG-Fusion: Efficient 3D Reconstruction with Geometry-aware Gaussian Surfel on the Fly
by: Pan, Xiaokun, et al.
Published: (2025)

Accelerating Streaming Video Large Language Models via Hierarchical Token Compression
by: Wang, Yiyu, et al.
Published: (2025)

TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos
by: Yao, Linli, et al.
Published: (2025)

Online3R: Online Learning for Consistent Sequential Reconstruction Based on Geometry Foundation Model
by: Zhou, Shunkai, et al.
Published: (2026)

Towards Unified 3D Hair Reconstruction from Single-View Portraits
by: Zheng, Yujian, et al.
Published: (2024)

TokenSeg: Efficient 3D Medical Image Segmentation via Hierarchical Visual Token Compression
by: Zeng, Sen, et al.
Published: (2026)

HRGS: Hierarchical Gaussian Splatting for Memory-Efficient High-Resolution 3D Reconstruction
by: Li, Changbai, et al.
Published: (2025)

Geometric Context Transformer for Streaming 3D Reconstruction
by: Chen, Lin-Zhuo, et al.
Published: (2026)

DenoiseGS: Gaussian Reconstruction Model for Burst Denoising
by: Cheng, Yongsen, et al.
Published: (2025)

4DGCPro: Efficient Hierarchical 4D Gaussian Compression for Progressive Volumetric Video Streaming
by: Zheng, Zihan, et al.
Published: (2025)

StreamingTOM: Streaming Token Compression for Efficient Video Understanding
by: Chen, Xueyi, et al.
Published: (2025)

StreamingAssistant: Efficient Visual Token Pruning for Accelerating Online Video Understanding
by: Jin, Xinqi, et al.
Published: (2025)

Spark3R: Asymmetric Token Reduction Makes Fast Feed-Forward 3D Reconstruction
by: Tang, Zecheng, et al.
Published: (2026)

AdaSVD: Adaptive Singular Value Decomposition for Large Language Models
by: Li, Zhiteng, et al.
Published: (2025)

HCC-3D: Hierarchical Compensatory Compression for 98% 3D Token Reduction in Vision-Language Models
by: Zhang, Liheng, et al.
Published: (2025)

Ray-Aware Pointer Memory with Adaptive Updates for Streaming 3D Reconstruction
by: Li, Feifei, et al.
Published: (2026)

InfVSR: Toward Consistency-Driven Streaming Generative Video Super-Resolution
by: Zhang, Ziqing, et al.
Published: (2025)

LONG3R: Long Sequence Streaming 3D Reconstruction
by: Chen, Zhuoguang, et al.
Published: (2025)

Hierarchical Separable Video Transformer for Snapshot Compressive Imaging
by: Wang, Ping, et al.
Published: (2024)

PlanViz: Evaluating Planning-Oriented Image Generation and Editing for Computer-Use Tasks
by: Li, Junxian, et al.
Published: (2026)

Spherical Geometry Diffusion: Generating High-quality 3D Face Geometry via Sphere-anchored Representations
by: Zhang, Junyi, et al.
Published: (2026)

IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction
by: Li, Hao, et al.
Published: (2025)

LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant
by: Li, Wei, et al.
Published: (2025)