:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Chen, Changdao
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2604.08435
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Surface Vision Mamba: Leveraging Bidirectional State Space Model for Efficient Spherical Manifold Representation
by: He, Rongzhao, et al.
Published: (2025)

Video Quality Assessment for Online Processing: From Spatial to Temporal Sampling
by: Yan, Jiebin, et al.
Published: (2025)

HGNet: High-Order Spatial Awareness Hypergraph and Multi-Scale Context Attention Network for Colorectal Polyp Detection
by: Liu, Xiaofang, et al.
Published: (2025)

UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling
by: Li, Peiming, et al.
Published: (2025)

S$^2$Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification
by: Wang, Guanchun, et al.
Published: (2024)

Temporal Test-Time Adaptation with State-Space Models
by: Schirmer, Mona, et al.
Published: (2024)

MambaTAD: When State-Space Models Meet Long-Range Temporal Action Detection
by: Lu, Hui, et al.
Published: (2025)

Bidirectional Diffusion Bridge Models
by: Kieu, Duc, et al.
Published: (2025)

Spatial Traces: Enhancing VLA Models with Spatial-Temporal Understanding
by: Patratskiy, Maxim A., et al.
Published: (2025)

STSA: Spatial-Temporal Semantic Alignment for Visual Dubbing
by: Ding, Zijun, et al.
Published: (2025)

Unified Spatial-Temporal Edge-Enhanced Graph Networks for Pedestrian Trajectory Prediction
by: Li, Ruochen, et al.
Published: (2025)

LiteFat: Lightweight Spatio-Temporal Graph Learning for Real-Time Driver Fatigue Detection
by: Ren, Jing, et al.
Published: (2025)

BiDepth: A Bidirectional-Depth Neural Network for Spatio-Temporal Prediction
by: Ehsani, Sina, et al.
Published: (2025)

DeformTrace: A Deformable State Space Model with Relay Tokens for Temporal Forgery Localization
by: Zhu, Xiaodong, et al.
Published: (2026)

Handling Spatial-Temporal Data Heterogeneity for Federated Continual Learning via Tail Anchor
by: Yu, Hao, et al.
Published: (2024)

SwinSF: Image Reconstruction from Spatial-Temporal Spike Streams
by: Jiang, Liangyan, et al.
Published: (2024)

7DGS: Unified Spatial-Temporal-Angular Gaussian Splatting
by: Gao, Zhongpai, et al.
Published: (2025)

Cross Space and Time: A Spatio-Temporal Unitized Model for Traffic Flow Forecasting
by: Ruan, Weilin, et al.
Published: (2024)

Unleashing Diffusion and State Space Models for Medical Image Segmentation
by: Wu, Rong, et al.
Published: (2025)

$Δ$t-Mamba3D: A Time-Aware Spatio-Temporal State-Space Model for Breast Cancer Risk Prediction
by: Zhou, Zhengbo, et al.
Published: (2025)

Efficient Event-Based Object Detection: A Hybrid Neural Network with Spatial and Temporal Attention
by: Ahmed, Soikat Hasan, et al.
Published: (2024)

SpaceMind: Camera-Guided Modality Fusion for Spatial Reasoning in Vision-Language Models
by: Zhao, Ruosen, et al.
Published: (2025)

Deformba: Vision State Space Model with Adaptive State Fusion
by: Ke, Hongyu, et al.
Published: (2026)

Focal Modulation and Bidirectional Feature Fusion Network for Medical Image Segmentation
by: Safdar, Moin, et al.
Published: (2025)

A Lightweight Group Multiscale Bidirectional Interactive Network for Real-Time Steel Surface Defect Detection
by: Zhang, Yong, et al.
Published: (2025)

Enhancing Visual Token Representations for Video Large Language Models via Training-Free Spatial-Temporal Pooling and Gridding
by: Luo, Bingjun, et al.
Published: (2026)

LocalMamba: Visual State Space Model with Windowed Selective Scan
by: Huang, Tao, et al.
Published: (2024)

MambaLLIE: Implicit Retinex-Aware Low Light Enhancement with Global-then-Local State Space
by: Weng, Jiangwei, et al.
Published: (2024)

SEDMamba: Enhancing Selective State Space Modelling with Bottleneck Mechanism and Fine-to-Coarse Temporal Fusion for Efficient Error Detection in Robot-Assisted Surgery
by: Xu, Jialang, et al.
Published: (2024)

BiPVL-Seg: Bidirectional Progressive Vision-Language Fusion with Global-Local Alignment for Medical Image Segmentation
by: Sultan, Rafi Ibn, et al.
Published: (2025)

RASLF: Representation-Aware State Space Model for Light Field Super-Resolution
by: Wei, Zeqiang, et al.
Published: (2026)

STARFlow: Spatial Temporal Feature Re-embedding with Attentive Learning for Real-world Scene Flow
by: Lu, Zhiyang, et al.
Published: (2024)

VidLaDA: Bidirectional Diffusion Large Language Models for Efficient Video Understanding
by: He, Zhihao, et al.
Published: (2026)

SF-Mamba: Rethinking State Space Model for Vision
by: Yoshimura, Masakazu, et al.
Published: (2026)

VFIMamba: Video Frame Interpolation with State Space Models
by: Zhang, Guozhen, et al.
Published: (2024)

ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling
by: Ju, Shaobo, et al.
Published: (2026)

SweetTok: Semantic-Aware Spatial-Temporal Tokenizer for Compact Video Discretization
by: Tan, Zhentao, et al.
Published: (2024)

BiCo-Fusion: Bidirectional Complementary LiDAR-Camera Fusion for Semantic- and Spatial-Aware 3D Object Detection
by: Song, Yang, et al.
Published: (2024)

SSMamba: A Self-Supervised Hybrid State Space Model for Pathological Image Classification
by: Chai, Enhui, et al.
Published: (2026)

SSRFlow: Semantic-aware Fusion with Spatial Temporal Re-embedding for Real-world Scene Flow
by: Lu, Zhiyang, et al.
Published: (2024)