Saved in:
| Main Author: | Chen, Changdao |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.08435 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Surface Vision Mamba: Leveraging Bidirectional State Space Model for Efficient Spherical Manifold Representation
by: He, Rongzhao, et al.
Published: (2025)
by: He, Rongzhao, et al.
Published: (2025)
Video Quality Assessment for Online Processing: From Spatial to Temporal Sampling
by: Yan, Jiebin, et al.
Published: (2025)
by: Yan, Jiebin, et al.
Published: (2025)
HGNet: High-Order Spatial Awareness Hypergraph and Multi-Scale Context Attention Network for Colorectal Polyp Detection
by: Liu, Xiaofang, et al.
Published: (2025)
by: Liu, Xiaofang, et al.
Published: (2025)
UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling
by: Li, Peiming, et al.
Published: (2025)
by: Li, Peiming, et al.
Published: (2025)
S$^2$Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification
by: Wang, Guanchun, et al.
Published: (2024)
by: Wang, Guanchun, et al.
Published: (2024)
Temporal Test-Time Adaptation with State-Space Models
by: Schirmer, Mona, et al.
Published: (2024)
by: Schirmer, Mona, et al.
Published: (2024)
MambaTAD: When State-Space Models Meet Long-Range Temporal Action Detection
by: Lu, Hui, et al.
Published: (2025)
by: Lu, Hui, et al.
Published: (2025)
Bidirectional Diffusion Bridge Models
by: Kieu, Duc, et al.
Published: (2025)
by: Kieu, Duc, et al.
Published: (2025)
Spatial Traces: Enhancing VLA Models with Spatial-Temporal Understanding
by: Patratskiy, Maxim A., et al.
Published: (2025)
by: Patratskiy, Maxim A., et al.
Published: (2025)
STSA: Spatial-Temporal Semantic Alignment for Visual Dubbing
by: Ding, Zijun, et al.
Published: (2025)
by: Ding, Zijun, et al.
Published: (2025)
Unified Spatial-Temporal Edge-Enhanced Graph Networks for Pedestrian Trajectory Prediction
by: Li, Ruochen, et al.
Published: (2025)
by: Li, Ruochen, et al.
Published: (2025)
LiteFat: Lightweight Spatio-Temporal Graph Learning for Real-Time Driver Fatigue Detection
by: Ren, Jing, et al.
Published: (2025)
by: Ren, Jing, et al.
Published: (2025)
BiDepth: A Bidirectional-Depth Neural Network for Spatio-Temporal Prediction
by: Ehsani, Sina, et al.
Published: (2025)
by: Ehsani, Sina, et al.
Published: (2025)
DeformTrace: A Deformable State Space Model with Relay Tokens for Temporal Forgery Localization
by: Zhu, Xiaodong, et al.
Published: (2026)
by: Zhu, Xiaodong, et al.
Published: (2026)
Handling Spatial-Temporal Data Heterogeneity for Federated Continual Learning via Tail Anchor
by: Yu, Hao, et al.
Published: (2024)
by: Yu, Hao, et al.
Published: (2024)
SwinSF: Image Reconstruction from Spatial-Temporal Spike Streams
by: Jiang, Liangyan, et al.
Published: (2024)
by: Jiang, Liangyan, et al.
Published: (2024)
7DGS: Unified Spatial-Temporal-Angular Gaussian Splatting
by: Gao, Zhongpai, et al.
Published: (2025)
by: Gao, Zhongpai, et al.
Published: (2025)
Cross Space and Time: A Spatio-Temporal Unitized Model for Traffic Flow Forecasting
by: Ruan, Weilin, et al.
Published: (2024)
by: Ruan, Weilin, et al.
Published: (2024)
Unleashing Diffusion and State Space Models for Medical Image Segmentation
by: Wu, Rong, et al.
Published: (2025)
by: Wu, Rong, et al.
Published: (2025)
$Δ$t-Mamba3D: A Time-Aware Spatio-Temporal State-Space Model for Breast Cancer Risk Prediction
by: Zhou, Zhengbo, et al.
Published: (2025)
by: Zhou, Zhengbo, et al.
Published: (2025)
Efficient Event-Based Object Detection: A Hybrid Neural Network with Spatial and Temporal Attention
by: Ahmed, Soikat Hasan, et al.
Published: (2024)
by: Ahmed, Soikat Hasan, et al.
Published: (2024)
SpaceMind: Camera-Guided Modality Fusion for Spatial Reasoning in Vision-Language Models
by: Zhao, Ruosen, et al.
Published: (2025)
by: Zhao, Ruosen, et al.
Published: (2025)
Deformba: Vision State Space Model with Adaptive State Fusion
by: Ke, Hongyu, et al.
Published: (2026)
by: Ke, Hongyu, et al.
Published: (2026)
Focal Modulation and Bidirectional Feature Fusion Network for Medical Image Segmentation
by: Safdar, Moin, et al.
Published: (2025)
by: Safdar, Moin, et al.
Published: (2025)
A Lightweight Group Multiscale Bidirectional Interactive Network for Real-Time Steel Surface Defect Detection
by: Zhang, Yong, et al.
Published: (2025)
by: Zhang, Yong, et al.
Published: (2025)
Enhancing Visual Token Representations for Video Large Language Models via Training-Free Spatial-Temporal Pooling and Gridding
by: Luo, Bingjun, et al.
Published: (2026)
by: Luo, Bingjun, et al.
Published: (2026)
LocalMamba: Visual State Space Model with Windowed Selective Scan
by: Huang, Tao, et al.
Published: (2024)
by: Huang, Tao, et al.
Published: (2024)
MambaLLIE: Implicit Retinex-Aware Low Light Enhancement with Global-then-Local State Space
by: Weng, Jiangwei, et al.
Published: (2024)
by: Weng, Jiangwei, et al.
Published: (2024)
SEDMamba: Enhancing Selective State Space Modelling with Bottleneck Mechanism and Fine-to-Coarse Temporal Fusion for Efficient Error Detection in Robot-Assisted Surgery
by: Xu, Jialang, et al.
Published: (2024)
by: Xu, Jialang, et al.
Published: (2024)
BiPVL-Seg: Bidirectional Progressive Vision-Language Fusion with Global-Local Alignment for Medical Image Segmentation
by: Sultan, Rafi Ibn, et al.
Published: (2025)
by: Sultan, Rafi Ibn, et al.
Published: (2025)
RASLF: Representation-Aware State Space Model for Light Field Super-Resolution
by: Wei, Zeqiang, et al.
Published: (2026)
by: Wei, Zeqiang, et al.
Published: (2026)
STARFlow: Spatial Temporal Feature Re-embedding with Attentive Learning for Real-world Scene Flow
by: Lu, Zhiyang, et al.
Published: (2024)
by: Lu, Zhiyang, et al.
Published: (2024)
VidLaDA: Bidirectional Diffusion Large Language Models for Efficient Video Understanding
by: He, Zhihao, et al.
Published: (2026)
by: He, Zhihao, et al.
Published: (2026)
SF-Mamba: Rethinking State Space Model for Vision
by: Yoshimura, Masakazu, et al.
Published: (2026)
by: Yoshimura, Masakazu, et al.
Published: (2026)
VFIMamba: Video Frame Interpolation with State Space Models
by: Zhang, Guozhen, et al.
Published: (2024)
by: Zhang, Guozhen, et al.
Published: (2024)
ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling
by: Ju, Shaobo, et al.
Published: (2026)
by: Ju, Shaobo, et al.
Published: (2026)
SweetTok: Semantic-Aware Spatial-Temporal Tokenizer for Compact Video Discretization
by: Tan, Zhentao, et al.
Published: (2024)
by: Tan, Zhentao, et al.
Published: (2024)
BiCo-Fusion: Bidirectional Complementary LiDAR-Camera Fusion for Semantic- and Spatial-Aware 3D Object Detection
by: Song, Yang, et al.
Published: (2024)
by: Song, Yang, et al.
Published: (2024)
SSMamba: A Self-Supervised Hybrid State Space Model for Pathological Image Classification
by: Chai, Enhui, et al.
Published: (2026)
by: Chai, Enhui, et al.
Published: (2026)
SSRFlow: Semantic-aware Fusion with Spatial Temporal Re-embedding for Real-world Scene Flow
by: Lu, Zhiyang, et al.
Published: (2024)
by: Lu, Zhiyang, et al.
Published: (2024)
Similar Items
-
Surface Vision Mamba: Leveraging Bidirectional State Space Model for Efficient Spherical Manifold Representation
by: He, Rongzhao, et al.
Published: (2025) -
Video Quality Assessment for Online Processing: From Spatial to Temporal Sampling
by: Yan, Jiebin, et al.
Published: (2025) -
HGNet: High-Order Spatial Awareness Hypergraph and Multi-Scale Context Attention Network for Colorectal Polyp Detection
by: Liu, Xiaofang, et al.
Published: (2025) -
UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling
by: Li, Peiming, et al.
Published: (2025) -
S$^2$Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification
by: Wang, Guanchun, et al.
Published: (2024)