Saved in:
| Main Authors: | Chen, Leyang, Wu, Junyi, Li, Zhiteng, Zhang, Yulun |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.15852 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation
by: Wu, Junyi, et al.
Published: (2025)
by: Wu, Junyi, et al.
Published: (2025)
FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing
by: Wu, Junyi, et al.
Published: (2025)
by: Wu, Junyi, et al.
Published: (2025)
Evict3R: Training-Free Token Eviction for Memory-Bounded Streaming Visual Geometry Transformers
by: Mahdi, Soroush, et al.
Published: (2025)
by: Mahdi, Soroush, et al.
Published: (2025)
DVD-Quant: Data-free Video Diffusion Transformers Quantization
by: Li, Zhiteng, et al.
Published: (2025)
by: Li, Zhiteng, et al.
Published: (2025)
BinaryHPE: 3D Human Pose and Shape Estimation via Binarization
by: Li, Zhiteng, et al.
Published: (2023)
by: Li, Zhiteng, et al.
Published: (2023)
WinT3R: Window-Based Streaming Reconstruction with Camera Token Pool
by: Li, Zizun, et al.
Published: (2025)
by: Li, Zizun, et al.
Published: (2025)
VEQ: Modality-Adaptive Quantization for MoE Vision-Language Models
by: Qin, Guangshuo, et al.
Published: (2026)
by: Qin, Guangshuo, et al.
Published: (2026)
HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution
by: Zhang, Xiang, et al.
Published: (2024)
by: Zhang, Xiang, et al.
Published: (2024)
FlashClear: Ultra-Fast Image Content Removal via Efficient Step Distillation and Feature Caching
by: Tang, Yixin, et al.
Published: (2026)
by: Tang, Yixin, et al.
Published: (2026)
BiMaCoSR: Binary One-Step Diffusion Model Leveraging Flexible Matrix Compression for Real Super-Resolution
by: Liu, Kai, et al.
Published: (2025)
by: Liu, Kai, et al.
Published: (2025)
CondiQuant: Condition Number Based Low-Bit Quantization for Image Super-Resolution
by: Liu, Kai, et al.
Published: (2025)
by: Liu, Kai, et al.
Published: (2025)
StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams
by: Wu, Zike, et al.
Published: (2025)
by: Wu, Zike, et al.
Published: (2025)
DensifyBeforehand: LiDAR-assisted Content-aware Densification for Efficient and Quality 3D Gaussian Splatting
by: Patt, Phurtivilai, et al.
Published: (2025)
by: Patt, Phurtivilai, et al.
Published: (2025)
Prior-guided Hierarchical Harmonization Network for Efficient Image Dehazing
by: Su, Xiongfei, et al.
Published: (2025)
by: Su, Xiongfei, et al.
Published: (2025)
HorizonStream: Long-Horizon Attention for Streaming 3D Reconstruction
by: Cheng, Chong, et al.
Published: (2026)
by: Cheng, Chong, et al.
Published: (2026)
DreamCraft3D++: Efficient Hierarchical 3D Generation with Multi-Plane Reconstruction Model
by: Sun, Jingxiang, et al.
Published: (2024)
by: Sun, Jingxiang, et al.
Published: (2024)
StreamGS: Online Generalizable Gaussian Splatting Reconstruction for Unposed Image Streams
by: LI, Yang, et al.
Published: (2025)
by: LI, Yang, et al.
Published: (2025)
EGG-Fusion: Efficient 3D Reconstruction with Geometry-aware Gaussian Surfel on the Fly
by: Pan, Xiaokun, et al.
Published: (2025)
by: Pan, Xiaokun, et al.
Published: (2025)
Accelerating Streaming Video Large Language Models via Hierarchical Token Compression
by: Wang, Yiyu, et al.
Published: (2025)
by: Wang, Yiyu, et al.
Published: (2025)
TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos
by: Yao, Linli, et al.
Published: (2025)
by: Yao, Linli, et al.
Published: (2025)
Online3R: Online Learning for Consistent Sequential Reconstruction Based on Geometry Foundation Model
by: Zhou, Shunkai, et al.
Published: (2026)
by: Zhou, Shunkai, et al.
Published: (2026)
Towards Unified 3D Hair Reconstruction from Single-View Portraits
by: Zheng, Yujian, et al.
Published: (2024)
by: Zheng, Yujian, et al.
Published: (2024)
TokenSeg: Efficient 3D Medical Image Segmentation via Hierarchical Visual Token Compression
by: Zeng, Sen, et al.
Published: (2026)
by: Zeng, Sen, et al.
Published: (2026)
HRGS: Hierarchical Gaussian Splatting for Memory-Efficient High-Resolution 3D Reconstruction
by: Li, Changbai, et al.
Published: (2025)
by: Li, Changbai, et al.
Published: (2025)
Geometric Context Transformer for Streaming 3D Reconstruction
by: Chen, Lin-Zhuo, et al.
Published: (2026)
by: Chen, Lin-Zhuo, et al.
Published: (2026)
DenoiseGS: Gaussian Reconstruction Model for Burst Denoising
by: Cheng, Yongsen, et al.
Published: (2025)
by: Cheng, Yongsen, et al.
Published: (2025)
4DGCPro: Efficient Hierarchical 4D Gaussian Compression for Progressive Volumetric Video Streaming
by: Zheng, Zihan, et al.
Published: (2025)
by: Zheng, Zihan, et al.
Published: (2025)
StreamingTOM: Streaming Token Compression for Efficient Video Understanding
by: Chen, Xueyi, et al.
Published: (2025)
by: Chen, Xueyi, et al.
Published: (2025)
StreamingAssistant: Efficient Visual Token Pruning for Accelerating Online Video Understanding
by: Jin, Xinqi, et al.
Published: (2025)
by: Jin, Xinqi, et al.
Published: (2025)
Spark3R: Asymmetric Token Reduction Makes Fast Feed-Forward 3D Reconstruction
by: Tang, Zecheng, et al.
Published: (2026)
by: Tang, Zecheng, et al.
Published: (2026)
AdaSVD: Adaptive Singular Value Decomposition for Large Language Models
by: Li, Zhiteng, et al.
Published: (2025)
by: Li, Zhiteng, et al.
Published: (2025)
HCC-3D: Hierarchical Compensatory Compression for 98% 3D Token Reduction in Vision-Language Models
by: Zhang, Liheng, et al.
Published: (2025)
by: Zhang, Liheng, et al.
Published: (2025)
Ray-Aware Pointer Memory with Adaptive Updates for Streaming 3D Reconstruction
by: Li, Feifei, et al.
Published: (2026)
by: Li, Feifei, et al.
Published: (2026)
InfVSR: Toward Consistency-Driven Streaming Generative Video Super-Resolution
by: Zhang, Ziqing, et al.
Published: (2025)
by: Zhang, Ziqing, et al.
Published: (2025)
LONG3R: Long Sequence Streaming 3D Reconstruction
by: Chen, Zhuoguang, et al.
Published: (2025)
by: Chen, Zhuoguang, et al.
Published: (2025)
Hierarchical Separable Video Transformer for Snapshot Compressive Imaging
by: Wang, Ping, et al.
Published: (2024)
by: Wang, Ping, et al.
Published: (2024)
PlanViz: Evaluating Planning-Oriented Image Generation and Editing for Computer-Use Tasks
by: Li, Junxian, et al.
Published: (2026)
by: Li, Junxian, et al.
Published: (2026)
Spherical Geometry Diffusion: Generating High-quality 3D Face Geometry via Sphere-anchored Representations
by: Zhang, Junyi, et al.
Published: (2026)
by: Zhang, Junyi, et al.
Published: (2026)
IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction
by: Li, Hao, et al.
Published: (2025)
by: Li, Hao, et al.
Published: (2025)
LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant
by: Li, Wei, et al.
Published: (2025)
by: Li, Wei, et al.
Published: (2025)
Similar Items
-
QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation
by: Wu, Junyi, et al.
Published: (2025) -
FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing
by: Wu, Junyi, et al.
Published: (2025) -
Evict3R: Training-Free Token Eviction for Memory-Bounded Streaming Visual Geometry Transformers
by: Mahdi, Soroush, et al.
Published: (2025) -
DVD-Quant: Data-free Video Diffusion Transformers Quantization
by: Li, Zhiteng, et al.
Published: (2025) -
BinaryHPE: 3D Human Pose and Shape Estimation via Binarization
by: Li, Zhiteng, et al.
Published: (2023)