Saved in:
| Main Authors: | Tong, Enwei, Bai, Yuanchao, Zhu, Yao, Jiang, Junjun, Liu, Xianming |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.05809 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Semantic Ensemble Loss and Latent Refinement for High-Fidelity Neural Image Compression
by: Li, Daxin, et al.
Published: (2024)
by: Li, Daxin, et al.
Published: (2024)
GroupedMixer: An Entropy Model with Group-wise Token-Mixers for Learned Image Compression
by: Li, Daxin, et al.
Published: (2024)
by: Li, Daxin, et al.
Published: (2024)
Rethinking Autoregressive Models for Lossless Image Compression via Hierarchical Parallelism and Progressive Adaptation
by: Li, Daxin, et al.
Published: (2025)
by: Li, Daxin, et al.
Published: (2025)
PVContext: Hybrid Context Model for Point Cloud Compression
by: Zhang, Guoqing, et al.
Published: (2024)
by: Zhang, Guoqing, et al.
Published: (2024)
CALLIC: Content Adaptive Learning for Lossless Image Compression
by: Li, Daxin, et al.
Published: (2024)
by: Li, Daxin, et al.
Published: (2024)
Learning Lossless Compression for High Bit-Depth Volumetric Medical Image
by: Wang, Kai, et al.
Published: (2024)
by: Wang, Kai, et al.
Published: (2024)
IVC-Prune: Revealing the Implicit Visual Coordinates in LVLMs for Vision Token Pruning
by: Sun, Zhichao, et al.
Published: (2026)
by: Sun, Zhichao, et al.
Published: (2026)
VRS-UIE: Value-Driven Reordering Scanning for Underwater Image Enhancement
by: Jiang, Kui, et al.
Published: (2025)
by: Jiang, Kui, et al.
Published: (2025)
PruneVid: Visual Token Pruning for Efficient Video Large Language Models
by: Huang, Xiaohu, et al.
Published: (2024)
by: Huang, Xiaohu, et al.
Published: (2024)
Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach
by: Wu, Gang, et al.
Published: (2024)
by: Wu, Gang, et al.
Published: (2024)
Learning from History: Task-agnostic Model Contrastive Learning for Image Restoration
by: Wu, Gang, et al.
Published: (2023)
by: Wu, Gang, et al.
Published: (2023)
Boosting All-in-One Image Restoration via Self-Improved Privilege Learning
by: Wu, Gang, et al.
Published: (2025)
by: Wu, Gang, et al.
Published: (2025)
GridPrune: From "Where to Look" to "What to Select" in Visual Token Pruning for MLLMs
by: Duan, Yuxiang, et al.
Published: (2025)
by: Duan, Yuxiang, et al.
Published: (2025)
Spatial Annealing for Efficient Few-shot Neural Rendering
by: Xiao, Yuru, et al.
Published: (2024)
by: Xiao, Yuru, et al.
Published: (2024)
UTPTrack: Towards Simple and Unified Token Pruning for Visual Tracking
by: Wu, Hao, et al.
Published: (2026)
by: Wu, Hao, et al.
Published: (2026)
UDPNet: Unleashing Depth-based Priors for Robust Image Dehazing
by: Zuo, Zengyuan, et al.
Published: (2026)
by: Zuo, Zengyuan, et al.
Published: (2026)
Image Deblurring by Exploring In-depth Properties of Transformer
by: Liang, Pengwei, et al.
Published: (2023)
by: Liang, Pengwei, et al.
Published: (2023)
LLV-FSR: Exploiting Large Language-Vision Prior for Face Super-resolution
by: Wang, Chenyang, et al.
Published: (2024)
by: Wang, Chenyang, et al.
Published: (2024)
Factorized Visual Tokenization and Generation
by: Bai, Zechen, et al.
Published: (2024)
by: Bai, Zechen, et al.
Published: (2024)
Fully $1\times1$ Convolutional Network for Lightweight Image Super-Resolution
by: Wu, Gang, et al.
Published: (2023)
by: Wu, Gang, et al.
Published: (2023)
FocusLLaVA: A Coarse-to-Fine Approach for Efficient and Effective Visual Token Compression
by: Zhu, Yuke, et al.
Published: (2024)
by: Zhu, Yuke, et al.
Published: (2024)
HAWK: Head Importance-Aware Visual Token Pruning in Multimodal Models
by: Zhu, Qihui, et al.
Published: (2026)
by: Zhu, Qihui, et al.
Published: (2026)
IDPruner: Harmonizing Importance and Diversity in Visual Token Pruning for MLLMs
by: Tan, Yifan, et al.
Published: (2026)
by: Tan, Yifan, et al.
Published: (2026)
StreamingAssistant: Efficient Visual Token Pruning for Accelerating Online Video Understanding
by: Jin, Xinqi, et al.
Published: (2025)
by: Jin, Xinqi, et al.
Published: (2025)
Bridging the Semantic-Action Gap in Visual Token Pruning for Efficient VLA Inference
by: Liu, Ziyan, et al.
Published: (2025)
by: Liu, Ziyan, et al.
Published: (2025)
DSwinIR: Rethinking Window-based Attention for Image Restoration
by: Wu, Gang, et al.
Published: (2025)
by: Wu, Gang, et al.
Published: (2025)
COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian Splitting
by: Zhang, Jiaxin, et al.
Published: (2025)
by: Zhang, Jiaxin, et al.
Published: (2025)
Beyond Degradation Redundancy: Contrastive Prompt Learning for All-in-One Image Restoration
by: Wu, Gang, et al.
Published: (2025)
by: Wu, Gang, et al.
Published: (2025)
When Token Pruning is Worse than Random: Understanding Visual Token Information in VLLMs
by: Wang, Yahong, et al.
Published: (2025)
by: Wang, Yahong, et al.
Published: (2025)
CROP: Contextual Region-Oriented Visual Token Pruning
by: Guo, Jiawei, et al.
Published: (2025)
by: Guo, Jiawei, et al.
Published: (2025)
HM-Talker: Hybrid Motion Modeling for High-Fidelity Talking Head Synthesis
by: Liu, Shiyu, et al.
Published: (2025)
by: Liu, Shiyu, et al.
Published: (2025)
Improving Domain Generalization in Self-supervised Monocular Depth Estimation via Stabilized Adversarial Training
by: Yao, Yuanqi, et al.
Published: (2024)
by: Yao, Yuanqi, et al.
Published: (2024)
Exploiting Self-Supervised Constraints in Image Super-Resolution
by: Wu, Gang, et al.
Published: (2024)
by: Wu, Gang, et al.
Published: (2024)
OTPrune: Distribution-Aligned Visual Token Pruning via Optimal Transport
by: Chen, Xiwen, et al.
Published: (2026)
by: Chen, Xiwen, et al.
Published: (2026)
Pear: Pruning and Sharing Adapters in Visual Parameter-Efficient Fine-Tuning
by: Zhong, Yibo, et al.
Published: (2024)
by: Zhong, Yibo, et al.
Published: (2024)
Refining CLIP's Spatial Awareness: A Visual-Centric Perspective
by: Qiu, Congpei, et al.
Published: (2025)
by: Qiu, Congpei, et al.
Published: (2025)
EntropyPrune: Matrix Entropy Guided Visual Token Pruning for Multimodal Large Language Models
by: Wang, Yahong, et al.
Published: (2026)
by: Wang, Yahong, et al.
Published: (2026)
TrimTokenator: Towards Adaptive Visual Token Pruning for Large Multimodal Models
by: Zhang, Hao, et al.
Published: (2025)
by: Zhang, Hao, et al.
Published: (2025)
FUSE: Label-Free Image-Event Joint Monocular Depth Estimation via Frequency-Decoupled Alignment and Degradation-Robust Fusion
by: Sun, Pihai, et al.
Published: (2025)
by: Sun, Pihai, et al.
Published: (2025)
SDGE: Stereo Guided Depth Estimation for 360$^\circ$ Camera Sets
by: Xu, Jialei, et al.
Published: (2024)
by: Xu, Jialei, et al.
Published: (2024)
Similar Items
-
Semantic Ensemble Loss and Latent Refinement for High-Fidelity Neural Image Compression
by: Li, Daxin, et al.
Published: (2024) -
GroupedMixer: An Entropy Model with Group-wise Token-Mixers for Learned Image Compression
by: Li, Daxin, et al.
Published: (2024) -
Rethinking Autoregressive Models for Lossless Image Compression via Hierarchical Parallelism and Progressive Adaptation
by: Li, Daxin, et al.
Published: (2025) -
PVContext: Hybrid Context Model for Point Cloud Compression
by: Zhang, Guoqing, et al.
Published: (2024) -
CALLIC: Content Adaptive Learning for Lossless Image Compression
by: Li, Daxin, et al.
Published: (2024)