:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Tong, Enwei, Bai, Yuanchao, Zhu, Yao, Jiang, Junjun, Liu, Xianming
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2602.05809
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Semantic Ensemble Loss and Latent Refinement for High-Fidelity Neural Image Compression
by: Li, Daxin, et al.
Published: (2024)

GroupedMixer: An Entropy Model with Group-wise Token-Mixers for Learned Image Compression
by: Li, Daxin, et al.
Published: (2024)

Rethinking Autoregressive Models for Lossless Image Compression via Hierarchical Parallelism and Progressive Adaptation
by: Li, Daxin, et al.
Published: (2025)

PVContext: Hybrid Context Model for Point Cloud Compression
by: Zhang, Guoqing, et al.
Published: (2024)

CALLIC: Content Adaptive Learning for Lossless Image Compression
by: Li, Daxin, et al.
Published: (2024)

Learning Lossless Compression for High Bit-Depth Volumetric Medical Image
by: Wang, Kai, et al.
Published: (2024)

IVC-Prune: Revealing the Implicit Visual Coordinates in LVLMs for Vision Token Pruning
by: Sun, Zhichao, et al.
Published: (2026)

VRS-UIE: Value-Driven Reordering Scanning for Underwater Image Enhancement
by: Jiang, Kui, et al.
Published: (2025)

PruneVid: Visual Token Pruning for Efficient Video Large Language Models
by: Huang, Xiaohu, et al.
Published: (2024)

Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach
by: Wu, Gang, et al.
Published: (2024)

Learning from History: Task-agnostic Model Contrastive Learning for Image Restoration
by: Wu, Gang, et al.
Published: (2023)

Boosting All-in-One Image Restoration via Self-Improved Privilege Learning
by: Wu, Gang, et al.
Published: (2025)

GridPrune: From "Where to Look" to "What to Select" in Visual Token Pruning for MLLMs
by: Duan, Yuxiang, et al.
Published: (2025)

Spatial Annealing for Efficient Few-shot Neural Rendering
by: Xiao, Yuru, et al.
Published: (2024)

UTPTrack: Towards Simple and Unified Token Pruning for Visual Tracking
by: Wu, Hao, et al.
Published: (2026)

UDPNet: Unleashing Depth-based Priors for Robust Image Dehazing
by: Zuo, Zengyuan, et al.
Published: (2026)

Image Deblurring by Exploring In-depth Properties of Transformer
by: Liang, Pengwei, et al.
Published: (2023)

LLV-FSR: Exploiting Large Language-Vision Prior for Face Super-resolution
by: Wang, Chenyang, et al.
Published: (2024)

Factorized Visual Tokenization and Generation
by: Bai, Zechen, et al.
Published: (2024)

Fully $1\times1$ Convolutional Network for Lightweight Image Super-Resolution
by: Wu, Gang, et al.
Published: (2023)

FocusLLaVA: A Coarse-to-Fine Approach for Efficient and Effective Visual Token Compression
by: Zhu, Yuke, et al.
Published: (2024)

HAWK: Head Importance-Aware Visual Token Pruning in Multimodal Models
by: Zhu, Qihui, et al.
Published: (2026)

IDPruner: Harmonizing Importance and Diversity in Visual Token Pruning for MLLMs
by: Tan, Yifan, et al.
Published: (2026)

StreamingAssistant: Efficient Visual Token Pruning for Accelerating Online Video Understanding
by: Jin, Xinqi, et al.
Published: (2025)

Bridging the Semantic-Action Gap in Visual Token Pruning for Efficient VLA Inference
by: Liu, Ziyan, et al.
Published: (2025)

DSwinIR: Rethinking Window-based Attention for Image Restoration
by: Wu, Gang, et al.
Published: (2025)

COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian Splitting
by: Zhang, Jiaxin, et al.
Published: (2025)

Beyond Degradation Redundancy: Contrastive Prompt Learning for All-in-One Image Restoration
by: Wu, Gang, et al.
Published: (2025)

When Token Pruning is Worse than Random: Understanding Visual Token Information in VLLMs
by: Wang, Yahong, et al.
Published: (2025)

CROP: Contextual Region-Oriented Visual Token Pruning
by: Guo, Jiawei, et al.
Published: (2025)

HM-Talker: Hybrid Motion Modeling for High-Fidelity Talking Head Synthesis
by: Liu, Shiyu, et al.
Published: (2025)

Improving Domain Generalization in Self-supervised Monocular Depth Estimation via Stabilized Adversarial Training
by: Yao, Yuanqi, et al.
Published: (2024)

Exploiting Self-Supervised Constraints in Image Super-Resolution
by: Wu, Gang, et al.
Published: (2024)

OTPrune: Distribution-Aligned Visual Token Pruning via Optimal Transport
by: Chen, Xiwen, et al.
Published: (2026)

Pear: Pruning and Sharing Adapters in Visual Parameter-Efficient Fine-Tuning
by: Zhong, Yibo, et al.
Published: (2024)

Refining CLIP's Spatial Awareness: A Visual-Centric Perspective
by: Qiu, Congpei, et al.
Published: (2025)

EntropyPrune: Matrix Entropy Guided Visual Token Pruning for Multimodal Large Language Models
by: Wang, Yahong, et al.
Published: (2026)

TrimTokenator: Towards Adaptive Visual Token Pruning for Large Multimodal Models
by: Zhang, Hao, et al.
Published: (2025)

FUSE: Label-Free Image-Event Joint Monocular Depth Estimation via Frequency-Decoupled Alignment and Degradation-Robust Fusion
by: Sun, Pihai, et al.
Published: (2025)

SDGE: Stereo Guided Depth Estimation for 360$^\circ$ Camera Sets
by: Xu, Jialei, et al.
Published: (2024)