:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhu, Kai, Cui, Zhenyu, Zang, Zehua, Zhou, Jiahuan
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2603.24295
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

State Space Prompting via Gathering and Spreading Spatio-Temporal Information for Video Understanding
by: Zhou, Jiahuan, et al.
Published: (2025)

LaSSM: Efficient Semantic-Spatial Query Decoding via Local Aggregation and State Space Models for 3D Instance Segmentation
by: Yao, Lei, et al.
Published: (2026)

Bi-C2R: Bidirectional Continual Compatible Representation for Re-indexing Free Lifelong Person Re-identification
by: Cui, Zhenyu, et al.
Published: (2025)

CKDA: Cross-modality Knowledge Disentanglement and Alignment for Visible-Infrared Lifelong Person Re-identification
by: Cui, Zhenyu, et al.
Published: (2025)

VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory
by: Yu, Yifei, et al.
Published: (2025)

Test-Time Perturbation Learning with Delayed Feedback for Vision-Language-Action Models
by: Zang, Zehua, et al.
Published: (2026)

RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation
by: Ma, Xianping, et al.
Published: (2024)

UPP: Unified Point-Level Prompting for Robust Point Cloud Analysis
by: Ai, Zixiang, et al.
Published: (2025)

TrackSSM: A General Motion Predictor by State-Space Model
by: Hu, Bin, et al.
Published: (2024)

UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling
by: Li, Peiming, et al.
Published: (2025)

SSM Meets Video Diffusion Models: Efficient Long-Term Video Generation with Structured State Spaces
by: Oshima, Yuta, et al.
Published: (2024)

Class-aware Domain Knowledge Fusion and Fission for Continual Test-Time Adaptation
by: Zhou, Jiahuan, et al.
Published: (2025)

GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model
by: Ai, Zixiang, et al.
Published: (2025)

Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model
by: Zhu, Qinfeng, et al.
Published: (2024)

TCP-SSM: Efficient Vision State Space Models with Token-Conditioned Poles
by: Shoouri, Sara, et al.
Published: (2026)

TRUST: Test-Time Refinement using Uncertainty-Guided SSM Traverses
by: Dastani, Sahar, et al.
Published: (2025)

RSGMamba: Reliability-Aware Self-Gated State Space Model for Multimodal Semantic Segmentation
by: Xu, Guoan, et al.
Published: (2026)

Selective Visual Prompting in Vision Mamba
by: Yao, Yifeng, et al.
Published: (2024)

Vision Graph Prompting via Semantic Low-Rank Decomposition
by: Ai, Zixiang, et al.
Published: (2025)

VFIMamba: Video Frame Interpolation with State Space Models
by: Zhang, Guozhen, et al.
Published: (2024)

MV-SSM: Multi-View State Space Modeling for 3D Human Pose Estimation
by: Chharia, Aviral, et al.
Published: (2025)

Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation
by: Du, Ye, et al.
Published: (2024)

CAPrompt: Cyclic Prompt Aggregation for Pre-Trained Model Based Class Incremental Learning
by: Li, Qiwei, et al.
Published: (2024)

MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking
by: Liu, Xinqi, et al.
Published: (2024)

Cortical-SSM: A Deep State Space Model for EEG and ECoG Motor Imagery Decoding
by: Suzuki, Shuntaro, et al.
Published: (2025)

Generative Refinement Networks for Visual Synthesis
by: Han, Jian, et al.
Published: (2026)

Refining Segmentation On-the-Fly: An Interactive Framework for Point Cloud Semantic Segmentation
by: Zhang, Peng, et al.
Published: (2024)

Harnessing Vision-Language Pretrained Models with Temporal-Aware Adaptation for Referring Video Object Segmentation
by: Zhou, Zikun, et al.
Published: (2024)

PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation
by: Hu, Yin, et al.
Published: (2024)

Sparse Refinement for Efficient High-Resolution Semantic Segmentation
by: Liu, Zhijian, et al.
Published: (2024)

MambaVF: State Space Model for Efficient Video Fusion
by: Zhao, Zixiang, et al.
Published: (2026)

Cross-modal State Space Modeling for Real-time RGB-thermal Wild Scene Semantic Segmentation
by: Guo, Xiaodong, et al.
Published: (2025)

Learning Spatial-Semantic Features for Robust Video Object Segmentation
by: Li, Xin, et al.
Published: (2024)

RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining
by: Wu, Hongtao, et al.
Published: (2024)

SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
by: Fu, Yunxiang, et al.
Published: (2024)

RefineAnything: Multimodal Region-Specific Refinement for Perfect Local Details
by: Zhou, Dewei, et al.
Published: (2026)

Exploring State Space Model in Wavelet Domain: An Infrared and Visible Image Fusion Network via Wavelet Transform and State Space Model
by: Zhang, Tianpei, et al.
Published: (2025)

AFRDA: Attentive Feature Refinement for Domain Adaptive Semantic Segmentation
by: Khan, Md. Al-Masrur, et al.
Published: (2025)

Planner-Refiner: Dynamic Space-Time Refinement for Vision-Language Alignment in Videos
by: Tran, Tuyen, et al.
Published: (2025)

SemanticGen: Video Generation in Semantic Space
by: Bai, Jianhong, et al.
Published: (2025)