:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yu, Wei, Qian, Yunhang
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.08073
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

QuarterMap: Efficient Post-Training Token Pruning for Visual State Space Models
by: Chi, Tien-Yu, et al.
Published: (2025)

MambaEVT: Event Stream based Visual Object Tracking using State Space Model
by: Wang, Xiao, et al.
Published: (2024)

LocalMamba: Visual State Space Model with Windowed Selective Scan
by: Huang, Tao, et al.
Published: (2024)

Event USKT : U-State Space Model in Knowledge Transfer for Event Cameras
by: Lin, Yuhui, et al.
Published: (2024)

Merging Context Clustering with Visual State Space Models for Medical Image Segmentation
by: Zhu, Yun, et al.
Published: (2025)

GenIR: Generative Visual Feedback for Mental Image Retrieval
by: Yang, Diji, et al.
Published: (2025)

Mamba-FETrack: Frame-Event Tracking via State Space Model
by: Huang, Ju, et al.
Published: (2024)

StyleMamba : State Space Model for Efficient Text-driven Image Style Transfer
by: Wang, Zijia, et al.
Published: (2024)

Unlocking Visual Secrets: Inverting Features with Diffusion Priors for Image Reconstruction
by: Zhang, Sai Qian, et al.
Published: (2024)

Informative Text-Image Alignment for Visual Affordance Learning with Foundation Models
by: Zhang, Qian, et al.
Published: (2025)

MambaFlow: A Novel and Flow-guided State Space Model for Scene Flow Estimation
by: Luo, Jiehao, et al.
Published: (2025)

Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs
by: Han, Kai, et al.
Published: (2024)

DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing
by: Lyu, Yueming, et al.
Published: (2023)

V"Mean"ba: Visual State Space Models only need 1 hidden dimension
by: Chi, Tien-Yu, et al.
Published: (2024)

DGFamba: Learning Flow Factorized State Space for Visual Domain Generalization
by: Bi, Qi, et al.
Published: (2025)

AdaIR: Exploiting Underlying Similarities of Image Restoration Tasks with Adapters
by: Chen, Hao-Wei, et al.
Published: (2024)

Mamba-FETrack V2: Revisiting State Space Model for Frame-Event based Visual Object Tracking
by: Wang, Shiao, et al.
Published: (2025)

Unleashing Diffusion and State Space Models for Medical Image Segmentation
by: Wu, Rong, et al.
Published: (2025)

Unified Medical Image Segmentation with State Space Modeling Snake
by: Zhang, Ruicheng, et al.
Published: (2025)

VEGA: Learning Interleaved Image-Text Comprehension in Vision-Language Large Models
by: Zhou, Chenyu, et al.
Published: (2024)

Gather-Scatter Mamba: Accelerating Propagation with Efficient State Space Model
by: Ko, Hyun-kyu, et al.
Published: (2025)

MambaLoc: Efficient Camera Localisation via State Space Model
by: Wang, Jialu, et al.
Published: (2024)

KineST: A Kinematics-guided Spatiotemporal State Space Model for Human Motion Tracking from Sparse Signals
by: Zhao, Shuting, et al.
Published: (2025)

SF-Mamba: Rethinking State Space Model for Vision
by: Yoshimura, Masakazu, et al.
Published: (2026)

TCP-SSM: Efficient Vision State Space Models with Token-Conditioned Poles
by: Shoouri, Sara, et al.
Published: (2026)

Depth-guided Texture Diffusion for Image Semantic Segmentation
by: Sun, Wei, et al.
Published: (2024)

Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better
by: Wang, Dianyi, et al.
Published: (2025)

SegMaFormer: A Hybrid State-Space and Transformer Model for Efficient Segmentation
by: Nguyen, Duy D., et al.
Published: (2026)

Monet: Reasoning in Latent Visual Space Beyond Images and Language
by: Wang, Qixun, et al.
Published: (2025)

SSMamba: A Self-Supervised Hybrid State Space Model for Pathological Image Classification
by: Chai, Enhui, et al.
Published: (2026)

S$^2$Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification
by: Wang, Guanchun, et al.
Published: (2024)

Mutual Information guided Visual Contrastive Learning
by: Chen, Hanyang, et al.
Published: (2025)

Progressive Language-guided Visual Learning for Multi-Task Visual Grounding
by: Wang, Jingchao, et al.
Published: (2025)

LSSGen: Leveraging Latent Space Scaling in Flow and Diffusion for Efficient Text to Image Generation
by: Tang, Jyun-Ze, et al.
Published: (2025)

HieraTok: Multi-Scale Visual Tokenizer Improves Image Reconstruction and Generation
by: Chen, Cong, et al.
Published: (2025)

Surface Vision Mamba: Leveraging Bidirectional State Space Model for Efficient Spherical Manifold Representation
by: He, Rongzhao, et al.
Published: (2025)

Geometry-Aware State Space Model: A New Paradigm for Whole-Slide Image Representation
by: Chai, Enhui, et al.
Published: (2026)

Scalable Cloud-Native Pipeline for Efficient 3D Model Reconstruction from Monocular Smartphone Images
by: Aghilar, Potito, et al.
Published: (2024)

Mamba-CAD: State Space Model For 3D Computer-Aided Design Generative Modeling
by: Li, Xueyang, et al.
Published: (2026)

EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models
by: Lee, Eungbean, et al.
Published: (2024)