Saved in:
| Main Authors: | Gao, Yuhan, Li, Xinqing, He, Xin, Li, Bing, Zhu, Xinzhong, Cheng, Ming-Ming, Liu, Yun |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.27661 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Joint Quantization and Token Pruning of Vision-Language Models
by: Li, Xinqing, et al.
Published: (2026)
by: Li, Xinqing, et al.
Published: (2026)
CATP: Confidence-Aware Token Pruning for Camouflaged Object Detection
by: Gao, Yuhan, et al.
Published: (2026)
by: Gao, Yuhan, et al.
Published: (2026)
A Comprehensive Survey on World Models for Embodied AI
by: Li, Xinqing, et al.
Published: (2025)
by: Li, Xinqing, et al.
Published: (2025)
Towards Single-Source Domain Generalized Object Detection via Causal Visual Prompts
by: Li, Chen, et al.
Published: (2025)
by: Li, Chen, et al.
Published: (2025)
Mamba YOLO: A Simple Baseline for Object Detection with State Space Model
by: Wang, Zeyu, et al.
Published: (2024)
by: Wang, Zeyu, et al.
Published: (2024)
Make It Up: Fake Images, Real Gains in Generalized Few-shot Semantic Segmentation
by: Xie, Guohuan, et al.
Published: (2026)
by: Xie, Guohuan, et al.
Published: (2026)
Unifying Heterogeneous Multi-Modal Remote Sensing Detection Via Language-Pivoted Pretraining
by: Li, Yuxuan, et al.
Published: (2026)
by: Li, Yuxuan, et al.
Published: (2026)
RemDet: Rethinking Efficient Model Design for UAV Object Detection
by: Li, Chen, et al.
Published: (2024)
by: Li, Chen, et al.
Published: (2024)
Robust Lane Detection with Wavelet-Enhanced Context Modeling and Adaptive Sampling
by: Li, Kunyang, et al.
Published: (2025)
by: Li, Kunyang, et al.
Published: (2025)
Towards RAW Object Detection in Diverse Conditions
by: Li, Zhong-Yu, et al.
Published: (2024)
by: Li, Zhong-Yu, et al.
Published: (2024)
A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects
by: Xie, Guohuan, et al.
Published: (2025)
by: Xie, Guohuan, et al.
Published: (2025)
A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models
by: Zeng, Quan-Sheng, et al.
Published: (2025)
by: Zeng, Quan-Sheng, et al.
Published: (2025)
Task-Adaptive Saliency Guidance for Exemplar-free Class Incremental Learning
by: Liu, Xialei, et al.
Published: (2022)
by: Liu, Xialei, et al.
Published: (2022)
Suppressing Gradient Conflict for Generalizable Deepfake Detection
by: Liu, Ming-Hui, et al.
Published: (2025)
by: Liu, Ming-Hui, et al.
Published: (2025)
Point Cloud Quantization through Multimodal Prompting for 3D Understanding
by: Li, Hongxuan, et al.
Published: (2025)
by: Li, Hongxuan, et al.
Published: (2025)
Block-based Symmetric Pruning and Fusion for Efficient Vision Transformers
by: Hsieh, Yi-Kuan, et al.
Published: (2025)
by: Hsieh, Yi-Kuan, et al.
Published: (2025)
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection
by: Chen, Yuming, et al.
Published: (2023)
by: Chen, Yuming, et al.
Published: (2023)
Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection
by: Yuan, Xinbin, et al.
Published: (2025)
by: Yuan, Xinbin, et al.
Published: (2025)
Attention Debiasing for Token Pruning in Vision Language Models
by: Zhao, Kai, et al.
Published: (2025)
by: Zhao, Kai, et al.
Published: (2025)
SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection
by: Li, Yuxuan, et al.
Published: (2024)
by: Li, Yuxuan, et al.
Published: (2024)
Rethinking RGB-D Salient Object Detection: Models, Data Sets, and Large-Scale Benchmarks
by: Fan, Deng-Ping, et al.
Published: (2019)
by: Fan, Deng-Ping, et al.
Published: (2019)
BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection
by: Liu, Jiaming, et al.
Published: (2022)
by: Liu, Jiaming, et al.
Published: (2022)
NuWa: Deriving Lightweight Task-Specific Vision Transformers for Edge Devices
by: Wei, Ziteng, et al.
Published: (2025)
by: Wei, Ziteng, et al.
Published: (2025)
Balanced Multi-view Clustering
by: Li, Zhenglai, et al.
Published: (2025)
by: Li, Zhenglai, et al.
Published: (2025)
Location-guided Head Pose Estimation for Fisheye Image
by: Li, Bing, et al.
Published: (2024)
by: Li, Bing, et al.
Published: (2024)
Predictive Sample Assignment for Semantically Coherent Out-of-Distribution Detection
by: Peng, Zhimao, et al.
Published: (2025)
by: Peng, Zhimao, et al.
Published: (2025)
AuxDet: Auxiliary Metadata Matters for Omni-Domain Infrared Small Target Detection
by: Shi, Yangting, et al.
Published: (2025)
by: Shi, Yangting, et al.
Published: (2025)
Multi-Cue Adaptive Visual Token Pruning for Large Vision-Language Models
by: Luan, Bozhi, et al.
Published: (2025)
by: Luan, Bozhi, et al.
Published: (2025)
CorrAdaptor: Adaptive Local Context Learning for Correspondence Pruning
by: Zhu, Wei, et al.
Published: (2024)
by: Zhu, Wei, et al.
Published: (2024)
Learning Real Facial Concepts for Independent Deepfake Detection
by: Liu, Ming-Hui, et al.
Published: (2025)
by: Liu, Ming-Hui, et al.
Published: (2025)
Multi-view Clustering via Bi-level Decoupling and Consistency Learning
by: Dong, Shihao, et al.
Published: (2025)
by: Dong, Shihao, et al.
Published: (2025)
RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image Classification
by: Zou, Guangwenjie, et al.
Published: (2024)
by: Zou, Guangwenjie, et al.
Published: (2024)
When W4A4 Breaks Camouflaged Object Detection: Token-Group Dual-Constraint Activation Quantization
by: Li, Tianqi, et al.
Published: (2026)
by: Li, Tianqi, et al.
Published: (2026)
FGP: Feature-Gradient-Prune for Efficient Convolutional Layer Pruning
by: Lv, Qingsong, et al.
Published: (2024)
by: Lv, Qingsong, et al.
Published: (2024)
Towards Generalizable Deepfake Detection via Real Distribution Bias Correction
by: Liu, Ming-Hui, et al.
Published: (2026)
by: Liu, Ming-Hui, et al.
Published: (2026)
AIVD: Adaptive Edge-Cloud Collaboration for Accurate and Efficient Industrial Visual Detection
by: Hu, Yunqing, et al.
Published: (2026)
by: Hu, Yunqing, et al.
Published: (2026)
Multi-Token Enhancing for Vision Representation Learning
by: Li, Zhong-Yu, et al.
Published: (2024)
by: Li, Zhong-Yu, et al.
Published: (2024)
Rapid Salient Object Detection with Difference Convolutional Neural Networks
by: Su, Zhuo, et al.
Published: (2025)
by: Su, Zhuo, et al.
Published: (2025)
AdaptInfer: Adaptive Token Pruning for Vision-Language Model Inference with Dynamical Text Guidance
by: Zhang, Weichen, et al.
Published: (2025)
by: Zhang, Weichen, et al.
Published: (2025)
RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark
by: Zhang, Xin, et al.
Published: (2025)
by: Zhang, Xin, et al.
Published: (2025)
Similar Items
-
Towards Joint Quantization and Token Pruning of Vision-Language Models
by: Li, Xinqing, et al.
Published: (2026) -
CATP: Confidence-Aware Token Pruning for Camouflaged Object Detection
by: Gao, Yuhan, et al.
Published: (2026) -
A Comprehensive Survey on World Models for Embodied AI
by: Li, Xinqing, et al.
Published: (2025) -
Towards Single-Source Domain Generalized Object Detection via Causal Visual Prompts
by: Li, Chen, et al.
Published: (2025) -
Mamba YOLO: A Simple Baseline for Object Detection with State Space Model
by: Wang, Zeyu, et al.
Published: (2024)