Saved in:
| Main Authors: | Yu, Wei, Qian, Yunhang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.08073 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
QuarterMap: Efficient Post-Training Token Pruning for Visual State Space Models
by: Chi, Tien-Yu, et al.
Published: (2025)
by: Chi, Tien-Yu, et al.
Published: (2025)
MambaEVT: Event Stream based Visual Object Tracking using State Space Model
by: Wang, Xiao, et al.
Published: (2024)
by: Wang, Xiao, et al.
Published: (2024)
LocalMamba: Visual State Space Model with Windowed Selective Scan
by: Huang, Tao, et al.
Published: (2024)
by: Huang, Tao, et al.
Published: (2024)
Event USKT : U-State Space Model in Knowledge Transfer for Event Cameras
by: Lin, Yuhui, et al.
Published: (2024)
by: Lin, Yuhui, et al.
Published: (2024)
Merging Context Clustering with Visual State Space Models for Medical Image Segmentation
by: Zhu, Yun, et al.
Published: (2025)
by: Zhu, Yun, et al.
Published: (2025)
GenIR: Generative Visual Feedback for Mental Image Retrieval
by: Yang, Diji, et al.
Published: (2025)
by: Yang, Diji, et al.
Published: (2025)
Mamba-FETrack: Frame-Event Tracking via State Space Model
by: Huang, Ju, et al.
Published: (2024)
by: Huang, Ju, et al.
Published: (2024)
StyleMamba : State Space Model for Efficient Text-driven Image Style Transfer
by: Wang, Zijia, et al.
Published: (2024)
by: Wang, Zijia, et al.
Published: (2024)
Unlocking Visual Secrets: Inverting Features with Diffusion Priors for Image Reconstruction
by: Zhang, Sai Qian, et al.
Published: (2024)
by: Zhang, Sai Qian, et al.
Published: (2024)
Informative Text-Image Alignment for Visual Affordance Learning with Foundation Models
by: Zhang, Qian, et al.
Published: (2025)
by: Zhang, Qian, et al.
Published: (2025)
MambaFlow: A Novel and Flow-guided State Space Model for Scene Flow Estimation
by: Luo, Jiehao, et al.
Published: (2025)
by: Luo, Jiehao, et al.
Published: (2025)
Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs
by: Han, Kai, et al.
Published: (2024)
by: Han, Kai, et al.
Published: (2024)
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing
by: Lyu, Yueming, et al.
Published: (2023)
by: Lyu, Yueming, et al.
Published: (2023)
V"Mean"ba: Visual State Space Models only need 1 hidden dimension
by: Chi, Tien-Yu, et al.
Published: (2024)
by: Chi, Tien-Yu, et al.
Published: (2024)
DGFamba: Learning Flow Factorized State Space for Visual Domain Generalization
by: Bi, Qi, et al.
Published: (2025)
by: Bi, Qi, et al.
Published: (2025)
AdaIR: Exploiting Underlying Similarities of Image Restoration Tasks with Adapters
by: Chen, Hao-Wei, et al.
Published: (2024)
by: Chen, Hao-Wei, et al.
Published: (2024)
Mamba-FETrack V2: Revisiting State Space Model for Frame-Event based Visual Object Tracking
by: Wang, Shiao, et al.
Published: (2025)
by: Wang, Shiao, et al.
Published: (2025)
Unleashing Diffusion and State Space Models for Medical Image Segmentation
by: Wu, Rong, et al.
Published: (2025)
by: Wu, Rong, et al.
Published: (2025)
Unified Medical Image Segmentation with State Space Modeling Snake
by: Zhang, Ruicheng, et al.
Published: (2025)
by: Zhang, Ruicheng, et al.
Published: (2025)
VEGA: Learning Interleaved Image-Text Comprehension in Vision-Language Large Models
by: Zhou, Chenyu, et al.
Published: (2024)
by: Zhou, Chenyu, et al.
Published: (2024)
Gather-Scatter Mamba: Accelerating Propagation with Efficient State Space Model
by: Ko, Hyun-kyu, et al.
Published: (2025)
by: Ko, Hyun-kyu, et al.
Published: (2025)
MambaLoc: Efficient Camera Localisation via State Space Model
by: Wang, Jialu, et al.
Published: (2024)
by: Wang, Jialu, et al.
Published: (2024)
KineST: A Kinematics-guided Spatiotemporal State Space Model for Human Motion Tracking from Sparse Signals
by: Zhao, Shuting, et al.
Published: (2025)
by: Zhao, Shuting, et al.
Published: (2025)
SF-Mamba: Rethinking State Space Model for Vision
by: Yoshimura, Masakazu, et al.
Published: (2026)
by: Yoshimura, Masakazu, et al.
Published: (2026)
TCP-SSM: Efficient Vision State Space Models with Token-Conditioned Poles
by: Shoouri, Sara, et al.
Published: (2026)
by: Shoouri, Sara, et al.
Published: (2026)
Depth-guided Texture Diffusion for Image Semantic Segmentation
by: Sun, Wei, et al.
Published: (2024)
by: Sun, Wei, et al.
Published: (2024)
Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better
by: Wang, Dianyi, et al.
Published: (2025)
by: Wang, Dianyi, et al.
Published: (2025)
SegMaFormer: A Hybrid State-Space and Transformer Model for Efficient Segmentation
by: Nguyen, Duy D., et al.
Published: (2026)
by: Nguyen, Duy D., et al.
Published: (2026)
Monet: Reasoning in Latent Visual Space Beyond Images and Language
by: Wang, Qixun, et al.
Published: (2025)
by: Wang, Qixun, et al.
Published: (2025)
SSMamba: A Self-Supervised Hybrid State Space Model for Pathological Image Classification
by: Chai, Enhui, et al.
Published: (2026)
by: Chai, Enhui, et al.
Published: (2026)
S$^2$Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification
by: Wang, Guanchun, et al.
Published: (2024)
by: Wang, Guanchun, et al.
Published: (2024)
Mutual Information guided Visual Contrastive Learning
by: Chen, Hanyang, et al.
Published: (2025)
by: Chen, Hanyang, et al.
Published: (2025)
Progressive Language-guided Visual Learning for Multi-Task Visual Grounding
by: Wang, Jingchao, et al.
Published: (2025)
by: Wang, Jingchao, et al.
Published: (2025)
LSSGen: Leveraging Latent Space Scaling in Flow and Diffusion for Efficient Text to Image Generation
by: Tang, Jyun-Ze, et al.
Published: (2025)
by: Tang, Jyun-Ze, et al.
Published: (2025)
HieraTok: Multi-Scale Visual Tokenizer Improves Image Reconstruction and Generation
by: Chen, Cong, et al.
Published: (2025)
by: Chen, Cong, et al.
Published: (2025)
Surface Vision Mamba: Leveraging Bidirectional State Space Model for Efficient Spherical Manifold Representation
by: He, Rongzhao, et al.
Published: (2025)
by: He, Rongzhao, et al.
Published: (2025)
Geometry-Aware State Space Model: A New Paradigm for Whole-Slide Image Representation
by: Chai, Enhui, et al.
Published: (2026)
by: Chai, Enhui, et al.
Published: (2026)
Scalable Cloud-Native Pipeline for Efficient 3D Model Reconstruction from Monocular Smartphone Images
by: Aghilar, Potito, et al.
Published: (2024)
by: Aghilar, Potito, et al.
Published: (2024)
Mamba-CAD: State Space Model For 3D Computer-Aided Design Generative Modeling
by: Li, Xueyang, et al.
Published: (2026)
by: Li, Xueyang, et al.
Published: (2026)
EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models
by: Lee, Eungbean, et al.
Published: (2024)
by: Lee, Eungbean, et al.
Published: (2024)
Similar Items
-
QuarterMap: Efficient Post-Training Token Pruning for Visual State Space Models
by: Chi, Tien-Yu, et al.
Published: (2025) -
MambaEVT: Event Stream based Visual Object Tracking using State Space Model
by: Wang, Xiao, et al.
Published: (2024) -
LocalMamba: Visual State Space Model with Windowed Selective Scan
by: Huang, Tao, et al.
Published: (2024) -
Event USKT : U-State Space Model in Knowledge Transfer for Event Cameras
by: Lin, Yuhui, et al.
Published: (2024) -
Merging Context Clustering with Visual State Space Models for Medical Image Segmentation
by: Zhu, Yun, et al.
Published: (2025)