Saved in:
| Main Authors: | Liu, Xinqi, Zhou, Li, Zhou, Zikun, Chen, Jianqiu, He, Zhenyu |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.15459 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ZeroBP: Learning Position-Aware Correspondence for Zero-shot 6D Pose Estimation in Bin-Picking
by: Chen, Jianqiu, et al.
Published: (2025)
by: Chen, Jianqiu, et al.
Published: (2025)
Harnessing Vision-Language Pretrained Models with Temporal-Aware Adaptation for Referring Video Object Segmentation
by: Zhou, Zikun, et al.
Published: (2024)
by: Zhou, Zikun, et al.
Published: (2024)
ZeroPose: CAD-Prompted Zero-shot Object 6D Pose Estimation in Cluttered Scenes
by: Chen, Jianqiu, et al.
Published: (2023)
by: Chen, Jianqiu, et al.
Published: (2023)
RTracker: Recoverable Tracking via PN Tree Structured Memory
by: Huang, Yuqing, et al.
Published: (2024)
by: Huang, Yuqing, et al.
Published: (2024)
RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation
by: Liu, Jiaming, et al.
Published: (2024)
by: Liu, Jiaming, et al.
Published: (2024)
Evolving Prompt Adaptation for Vision-Language Models
by: Zhang, Enming, et al.
Published: (2026)
by: Zhang, Enming, et al.
Published: (2026)
VL-Mamba: Exploring State Space Models for Multimodal Learning
by: Qiao, Yanyuan, et al.
Published: (2024)
by: Qiao, Yanyuan, et al.
Published: (2024)
Pan-Mamba: Effective pan-sharpening with State Space Model
by: He, Xuanhua, et al.
Published: (2024)
by: He, Xuanhua, et al.
Published: (2024)
Selective Visual Prompting in Vision Mamba
by: Yao, Yifeng, et al.
Published: (2024)
by: Yao, Yifeng, et al.
Published: (2024)
Instruction-guided Multi-Granularity Segmentation and Captioning with Large Multimodal Model
by: Zhou, Li, et al.
Published: (2024)
by: Zhou, Li, et al.
Published: (2024)
MemoryMamba: Memory-Augmented State Space Model for Defect Recognition
by: Wang, Qianning, et al.
Published: (2024)
by: Wang, Qianning, et al.
Published: (2024)
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation
by: Liao, Bencheng, et al.
Published: (2025)
by: Liao, Bencheng, et al.
Published: (2025)
MambaTrack: A Simple Baseline for Multiple Object Tracking with State Space Model
by: Xiao, Changcheng, et al.
Published: (2024)
by: Xiao, Changcheng, et al.
Published: (2024)
MambaLCT: Boosting Tracking via Long-term Context State Space Model
by: Li, Xiaohai, et al.
Published: (2024)
by: Li, Xiaohai, et al.
Published: (2024)
MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking
by: Zhang, Chunhui, et al.
Published: (2024)
by: Zhang, Chunhui, et al.
Published: (2024)
Shuffle Mamba: State Space Models with Random Shuffle for Multi-Modal Image Fusion
by: Cao, Ke, et al.
Published: (2024)
by: Cao, Ke, et al.
Published: (2024)
VSSD: Vision Mamba with Non-Causal State Space Duality
by: Shi, Yuheng, et al.
Published: (2024)
by: Shi, Yuheng, et al.
Published: (2024)
MambaMOT: State-Space Model as Motion Predictor for Multi-Object Tracking
by: Huang, Hsiang-Wei, et al.
Published: (2024)
by: Huang, Hsiang-Wei, et al.
Published: (2024)
SE-VLN: A Self-Evolving Vision-Language Navigation Framework Based on Multimodal Large Language Models
by: Dong, Xiangyu, et al.
Published: (2025)
by: Dong, Xiangyu, et al.
Published: (2025)
VideoMamba: State Space Model for Efficient Video Understanding
by: Li, Kunchang, et al.
Published: (2024)
by: Li, Kunchang, et al.
Published: (2024)
DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM
by: Li, Xuchen, et al.
Published: (2024)
by: Li, Xuchen, et al.
Published: (2024)
SF-Mamba: Rethinking State Space Model for Vision
by: Yoshimura, Masakazu, et al.
Published: (2026)
by: Yoshimura, Masakazu, et al.
Published: (2026)
OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models
by: Zou, Jialv, et al.
Published: (2025)
by: Zou, Jialv, et al.
Published: (2025)
MambaMIM: Pre-training Mamba with State Space Token Interpolation and its Application to Medical Image Segmentation
by: Tang, Fenghe, et al.
Published: (2024)
by: Tang, Fenghe, et al.
Published: (2024)
Modality-Decoupled RGB-Thermal Object Detector via Query Fusion
by: Tian, Chao, et al.
Published: (2026)
by: Tian, Chao, et al.
Published: (2026)
SportMamba: Adaptive Non-Linear Multi-Object Tracking with State Space Models for Team Sports
by: Khanna, Dheeraj, et al.
Published: (2025)
by: Khanna, Dheeraj, et al.
Published: (2025)
Point Cloud Mamba: Point Cloud Learning via State Space Model
by: Zhang, Tao, et al.
Published: (2024)
by: Zhang, Tao, et al.
Published: (2024)
SMTrack: State-Aware Mamba for Efficient Temporal Modeling in Visual Tracking
by: Ma, Yinchao, et al.
Published: (2026)
by: Ma, Yinchao, et al.
Published: (2026)
MambaTrans: Multimodal Fusion Image Translation via Large Language Model Priors for Downstream Visual Tasks
by: Xu, Yushen, et al.
Published: (2025)
by: Xu, Yushen, et al.
Published: (2025)
RSDehamba: Lightweight Vision Mamba for Remote Sensing Satellite Image Dehazing
by: Zhou, Huiling, et al.
Published: (2024)
by: Zhou, Huiling, et al.
Published: (2024)
InsectMamba: Insect Pest Classification with State Space Model
by: Wang, Qianning, et al.
Published: (2024)
by: Wang, Qianning, et al.
Published: (2024)
Mamba-FETrack: Frame-Event Tracking via State Space Model
by: Huang, Ju, et al.
Published: (2024)
by: Huang, Ju, et al.
Published: (2024)
DefMamba: Deformable Visual State Space Model
by: Liu, Leiye, et al.
Published: (2025)
by: Liu, Leiye, et al.
Published: (2025)
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection
by: He, Haoyang, et al.
Published: (2024)
by: He, Haoyang, et al.
Published: (2024)
RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining
by: Wu, Hongtao, et al.
Published: (2024)
by: Wu, Hongtao, et al.
Published: (2024)
Multimodal Instruction Tuning with Hybrid State Space Models
by: Zhou, Jianing, et al.
Published: (2024)
by: Zhou, Jianing, et al.
Published: (2024)
GlobalMamba: Global Image Serialization for Vision Mamba
by: Wang, Chengkun, et al.
Published: (2024)
by: Wang, Chengkun, et al.
Published: (2024)
CardiacMamba: A Multimodal RGB-RF Fusion Framework with State Space Models for Remote Physiological Measurement
by: Wu, Zheng, et al.
Published: (2025)
by: Wu, Zheng, et al.
Published: (2025)
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
by: Zhu, Lianghui, et al.
Published: (2024)
by: Zhu, Lianghui, et al.
Published: (2024)
Mamba-CAD: State Space Model For 3D Computer-Aided Design Generative Modeling
by: Li, Xueyang, et al.
Published: (2026)
by: Li, Xueyang, et al.
Published: (2026)
Similar Items
-
ZeroBP: Learning Position-Aware Correspondence for Zero-shot 6D Pose Estimation in Bin-Picking
by: Chen, Jianqiu, et al.
Published: (2025) -
Harnessing Vision-Language Pretrained Models with Temporal-Aware Adaptation for Referring Video Object Segmentation
by: Zhou, Zikun, et al.
Published: (2024) -
ZeroPose: CAD-Prompted Zero-shot Object 6D Pose Estimation in Cluttered Scenes
by: Chen, Jianqiu, et al.
Published: (2023) -
RTracker: Recoverable Tracking via PN Tree Structured Memory
by: Huang, Yuqing, et al.
Published: (2024) -
RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation
by: Liu, Jiaming, et al.
Published: (2024)