Saved in:
| Main Authors: | Zhu, Kai, Cui, Zhenyu, Zang, Zehua, Zhou, Jiahuan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.24295 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
State Space Prompting via Gathering and Spreading Spatio-Temporal Information for Video Understanding
by: Zhou, Jiahuan, et al.
Published: (2025)
by: Zhou, Jiahuan, et al.
Published: (2025)
LaSSM: Efficient Semantic-Spatial Query Decoding via Local Aggregation and State Space Models for 3D Instance Segmentation
by: Yao, Lei, et al.
Published: (2026)
by: Yao, Lei, et al.
Published: (2026)
Bi-C2R: Bidirectional Continual Compatible Representation for Re-indexing Free Lifelong Person Re-identification
by: Cui, Zhenyu, et al.
Published: (2025)
by: Cui, Zhenyu, et al.
Published: (2025)
CKDA: Cross-modality Knowledge Disentanglement and Alignment for Visible-Infrared Lifelong Person Re-identification
by: Cui, Zhenyu, et al.
Published: (2025)
by: Cui, Zhenyu, et al.
Published: (2025)
VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory
by: Yu, Yifei, et al.
Published: (2025)
by: Yu, Yifei, et al.
Published: (2025)
Test-Time Perturbation Learning with Delayed Feedback for Vision-Language-Action Models
by: Zang, Zehua, et al.
Published: (2026)
by: Zang, Zehua, et al.
Published: (2026)
RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation
by: Ma, Xianping, et al.
Published: (2024)
by: Ma, Xianping, et al.
Published: (2024)
UPP: Unified Point-Level Prompting for Robust Point Cloud Analysis
by: Ai, Zixiang, et al.
Published: (2025)
by: Ai, Zixiang, et al.
Published: (2025)
TrackSSM: A General Motion Predictor by State-Space Model
by: Hu, Bin, et al.
Published: (2024)
by: Hu, Bin, et al.
Published: (2024)
UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling
by: Li, Peiming, et al.
Published: (2025)
by: Li, Peiming, et al.
Published: (2025)
SSM Meets Video Diffusion Models: Efficient Long-Term Video Generation with Structured State Spaces
by: Oshima, Yuta, et al.
Published: (2024)
by: Oshima, Yuta, et al.
Published: (2024)
Class-aware Domain Knowledge Fusion and Fission for Continual Test-Time Adaptation
by: Zhou, Jiahuan, et al.
Published: (2025)
by: Zhou, Jiahuan, et al.
Published: (2025)
GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model
by: Ai, Zixiang, et al.
Published: (2025)
by: Ai, Zixiang, et al.
Published: (2025)
Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model
by: Zhu, Qinfeng, et al.
Published: (2024)
by: Zhu, Qinfeng, et al.
Published: (2024)
TCP-SSM: Efficient Vision State Space Models with Token-Conditioned Poles
by: Shoouri, Sara, et al.
Published: (2026)
by: Shoouri, Sara, et al.
Published: (2026)
TRUST: Test-Time Refinement using Uncertainty-Guided SSM Traverses
by: Dastani, Sahar, et al.
Published: (2025)
by: Dastani, Sahar, et al.
Published: (2025)
RSGMamba: Reliability-Aware Self-Gated State Space Model for Multimodal Semantic Segmentation
by: Xu, Guoan, et al.
Published: (2026)
by: Xu, Guoan, et al.
Published: (2026)
Selective Visual Prompting in Vision Mamba
by: Yao, Yifeng, et al.
Published: (2024)
by: Yao, Yifeng, et al.
Published: (2024)
Vision Graph Prompting via Semantic Low-Rank Decomposition
by: Ai, Zixiang, et al.
Published: (2025)
by: Ai, Zixiang, et al.
Published: (2025)
VFIMamba: Video Frame Interpolation with State Space Models
by: Zhang, Guozhen, et al.
Published: (2024)
by: Zhang, Guozhen, et al.
Published: (2024)
MV-SSM: Multi-View State Space Modeling for 3D Human Pose Estimation
by: Chharia, Aviral, et al.
Published: (2025)
by: Chharia, Aviral, et al.
Published: (2025)
Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation
by: Du, Ye, et al.
Published: (2024)
by: Du, Ye, et al.
Published: (2024)
CAPrompt: Cyclic Prompt Aggregation for Pre-Trained Model Based Class Incremental Learning
by: Li, Qiwei, et al.
Published: (2024)
by: Li, Qiwei, et al.
Published: (2024)
MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking
by: Liu, Xinqi, et al.
Published: (2024)
by: Liu, Xinqi, et al.
Published: (2024)
Cortical-SSM: A Deep State Space Model for EEG and ECoG Motor Imagery Decoding
by: Suzuki, Shuntaro, et al.
Published: (2025)
by: Suzuki, Shuntaro, et al.
Published: (2025)
Generative Refinement Networks for Visual Synthesis
by: Han, Jian, et al.
Published: (2026)
by: Han, Jian, et al.
Published: (2026)
Refining Segmentation On-the-Fly: An Interactive Framework for Point Cloud Semantic Segmentation
by: Zhang, Peng, et al.
Published: (2024)
by: Zhang, Peng, et al.
Published: (2024)
Harnessing Vision-Language Pretrained Models with Temporal-Aware Adaptation for Referring Video Object Segmentation
by: Zhou, Zikun, et al.
Published: (2024)
by: Zhou, Zikun, et al.
Published: (2024)
PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation
by: Hu, Yin, et al.
Published: (2024)
by: Hu, Yin, et al.
Published: (2024)
Sparse Refinement for Efficient High-Resolution Semantic Segmentation
by: Liu, Zhijian, et al.
Published: (2024)
by: Liu, Zhijian, et al.
Published: (2024)
MambaVF: State Space Model for Efficient Video Fusion
by: Zhao, Zixiang, et al.
Published: (2026)
by: Zhao, Zixiang, et al.
Published: (2026)
Cross-modal State Space Modeling for Real-time RGB-thermal Wild Scene Semantic Segmentation
by: Guo, Xiaodong, et al.
Published: (2025)
by: Guo, Xiaodong, et al.
Published: (2025)
Learning Spatial-Semantic Features for Robust Video Object Segmentation
by: Li, Xin, et al.
Published: (2024)
by: Li, Xin, et al.
Published: (2024)
RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining
by: Wu, Hongtao, et al.
Published: (2024)
by: Wu, Hongtao, et al.
Published: (2024)
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
by: Fu, Yunxiang, et al.
Published: (2024)
by: Fu, Yunxiang, et al.
Published: (2024)
RefineAnything: Multimodal Region-Specific Refinement for Perfect Local Details
by: Zhou, Dewei, et al.
Published: (2026)
by: Zhou, Dewei, et al.
Published: (2026)
Exploring State Space Model in Wavelet Domain: An Infrared and Visible Image Fusion Network via Wavelet Transform and State Space Model
by: Zhang, Tianpei, et al.
Published: (2025)
by: Zhang, Tianpei, et al.
Published: (2025)
AFRDA: Attentive Feature Refinement for Domain Adaptive Semantic Segmentation
by: Khan, Md. Al-Masrur, et al.
Published: (2025)
by: Khan, Md. Al-Masrur, et al.
Published: (2025)
Planner-Refiner: Dynamic Space-Time Refinement for Vision-Language Alignment in Videos
by: Tran, Tuyen, et al.
Published: (2025)
by: Tran, Tuyen, et al.
Published: (2025)
SemanticGen: Video Generation in Semantic Space
by: Bai, Jianhong, et al.
Published: (2025)
by: Bai, Jianhong, et al.
Published: (2025)
Similar Items
-
State Space Prompting via Gathering and Spreading Spatio-Temporal Information for Video Understanding
by: Zhou, Jiahuan, et al.
Published: (2025) -
LaSSM: Efficient Semantic-Spatial Query Decoding via Local Aggregation and State Space Models for 3D Instance Segmentation
by: Yao, Lei, et al.
Published: (2026) -
Bi-C2R: Bidirectional Continual Compatible Representation for Re-indexing Free Lifelong Person Re-identification
by: Cui, Zhenyu, et al.
Published: (2025) -
CKDA: Cross-modality Knowledge Disentanglement and Alignment for Visible-Infrared Lifelong Person Re-identification
by: Cui, Zhenyu, et al.
Published: (2025) -
VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory
by: Yu, Yifei, et al.
Published: (2025)