:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liu, Xinqi, Zhou, Li, Zhou, Zikun, Chen, Jianqiu, He, Zhenyu
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2411.15459
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ZeroBP: Learning Position-Aware Correspondence for Zero-shot 6D Pose Estimation in Bin-Picking
by: Chen, Jianqiu, et al.
Published: (2025)

Harnessing Vision-Language Pretrained Models with Temporal-Aware Adaptation for Referring Video Object Segmentation
by: Zhou, Zikun, et al.
Published: (2024)

ZeroPose: CAD-Prompted Zero-shot Object 6D Pose Estimation in Cluttered Scenes
by: Chen, Jianqiu, et al.
Published: (2023)

RTracker: Recoverable Tracking via PN Tree Structured Memory
by: Huang, Yuqing, et al.
Published: (2024)

RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation
by: Liu, Jiaming, et al.
Published: (2024)

Evolving Prompt Adaptation for Vision-Language Models
by: Zhang, Enming, et al.
Published: (2026)

VL-Mamba: Exploring State Space Models for Multimodal Learning
by: Qiao, Yanyuan, et al.
Published: (2024)

Pan-Mamba: Effective pan-sharpening with State Space Model
by: He, Xuanhua, et al.
Published: (2024)

Selective Visual Prompting in Vision Mamba
by: Yao, Yifeng, et al.
Published: (2024)

Instruction-guided Multi-Granularity Segmentation and Captioning with Large Multimodal Model
by: Zhou, Li, et al.
Published: (2024)

MemoryMamba: Memory-Augmented State Space Model for Defect Recognition
by: Wang, Qianning, et al.
Published: (2024)

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation
by: Liao, Bencheng, et al.
Published: (2025)

MambaTrack: A Simple Baseline for Multiple Object Tracking with State Space Model
by: Xiao, Changcheng, et al.
Published: (2024)

MambaLCT: Boosting Tracking via Long-term Context State Space Model
by: Li, Xiaohai, et al.
Published: (2024)

MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking
by: Zhang, Chunhui, et al.
Published: (2024)

Shuffle Mamba: State Space Models with Random Shuffle for Multi-Modal Image Fusion
by: Cao, Ke, et al.
Published: (2024)

VSSD: Vision Mamba with Non-Causal State Space Duality
by: Shi, Yuheng, et al.
Published: (2024)

MambaMOT: State-Space Model as Motion Predictor for Multi-Object Tracking
by: Huang, Hsiang-Wei, et al.
Published: (2024)

SE-VLN: A Self-Evolving Vision-Language Navigation Framework Based on Multimodal Large Language Models
by: Dong, Xiangyu, et al.
Published: (2025)

VideoMamba: State Space Model for Efficient Video Understanding
by: Li, Kunchang, et al.
Published: (2024)

DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM
by: Li, Xuchen, et al.
Published: (2024)

SF-Mamba: Rethinking State Space Model for Vision
by: Yoshimura, Masakazu, et al.
Published: (2026)

OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models
by: Zou, Jialv, et al.
Published: (2025)

MambaMIM: Pre-training Mamba with State Space Token Interpolation and its Application to Medical Image Segmentation
by: Tang, Fenghe, et al.
Published: (2024)

Modality-Decoupled RGB-Thermal Object Detector via Query Fusion
by: Tian, Chao, et al.
Published: (2026)

SportMamba: Adaptive Non-Linear Multi-Object Tracking with State Space Models for Team Sports
by: Khanna, Dheeraj, et al.
Published: (2025)

Point Cloud Mamba: Point Cloud Learning via State Space Model
by: Zhang, Tao, et al.
Published: (2024)

SMTrack: State-Aware Mamba for Efficient Temporal Modeling in Visual Tracking
by: Ma, Yinchao, et al.
Published: (2026)

MambaTrans: Multimodal Fusion Image Translation via Large Language Model Priors for Downstream Visual Tasks
by: Xu, Yushen, et al.
Published: (2025)

RSDehamba: Lightweight Vision Mamba for Remote Sensing Satellite Image Dehazing
by: Zhou, Huiling, et al.
Published: (2024)

InsectMamba: Insect Pest Classification with State Space Model
by: Wang, Qianning, et al.
Published: (2024)

Mamba-FETrack: Frame-Event Tracking via State Space Model
by: Huang, Ju, et al.
Published: (2024)

DefMamba: Deformable Visual State Space Model
by: Liu, Leiye, et al.
Published: (2025)

MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection
by: He, Haoyang, et al.
Published: (2024)

RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining
by: Wu, Hongtao, et al.
Published: (2024)

Multimodal Instruction Tuning with Hybrid State Space Models
by: Zhou, Jianing, et al.
Published: (2024)

GlobalMamba: Global Image Serialization for Vision Mamba
by: Wang, Chengkun, et al.
Published: (2024)

CardiacMamba: A Multimodal RGB-RF Fusion Framework with State Space Models for Remote Physiological Measurement
by: Wu, Zheng, et al.
Published: (2025)

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
by: Zhu, Lianghui, et al.
Published: (2024)

Mamba-CAD: State Space Model For 3D Computer-Aided Design Generative Modeling
by: Li, Xueyang, et al.
Published: (2026)