Saved in:
| Main Authors: | Qu, Kehua, Ding, Rui, Tang, Jin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.04151 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Relation Learning and Aggregate-attention for Multi-person Motion Prediction
by: Qu, Kehua, et al.
Published: (2024)
by: Qu, Kehua, et al.
Published: (2024)
ChronoForge-RL: Chronological Forging through Reinforcement Learning for Enhanced Video Understanding
by: Chen, Kehua
Published: (2025)
by: Chen, Kehua
Published: (2025)
Spatio-temporal Graph Learning on Adaptive Mined Key Frames for High-performance Multi-Object Tracking
by: Wang, Futian, et al.
Published: (2025)
by: Wang, Futian, et al.
Published: (2025)
Semore: VLM-guided Enhanced Semantic Motion Representations for Visual Reinforcement Learning
by: Wang, Wentao, et al.
Published: (2025)
by: Wang, Wentao, et al.
Published: (2025)
MOT FCG++: Enhanced Representation of Spatio-temporal Motion and Appearance Features
by: Fang, Yanzhao
Published: (2024)
by: Fang, Yanzhao
Published: (2024)
Attention-based Multi-modal Deep Learning Model of Spatio-temporal Crop Yield Prediction with Satellite, Soil and Climate Data
by: Shyam, Gopal Krishna, et al.
Published: (2026)
by: Shyam, Gopal Krishna, et al.
Published: (2026)
SemiHMER: Semi-supervised Handwritten Mathematical Expression Recognition using pseudo-labels
by: Chen, Kehua, et al.
Published: (2025)
by: Chen, Kehua, et al.
Published: (2025)
ASMa: Asymmetric Spatio-temporal Masking for Skeleton Action Representation Learning
by: Anand, Aman, et al.
Published: (2026)
by: Anand, Aman, et al.
Published: (2026)
Context-based Interpretable Spatio-Temporal Graph Convolutional Network for Human Motion Forecasting
by: Medina, Edgar, et al.
Published: (2024)
by: Medina, Edgar, et al.
Published: (2024)
Risk-aware Trajectory Prediction by Incorporating Spatio-temporal Traffic Interaction Analysis
by: Thuremella, Divya, et al.
Published: (2024)
by: Thuremella, Divya, et al.
Published: (2024)
A Spatio-temporal Graph Network Allowing Incomplete Trajectory Input for Pedestrian Trajectory Prediction
by: Long, Juncen, et al.
Published: (2025)
by: Long, Juncen, et al.
Published: (2025)
UniPINN: A Unified PINN Framework for Multi-task Learning of Diverse Navier-Stokes Equations
by: Sun, Dengdi, et al.
Published: (2026)
by: Sun, Dengdi, et al.
Published: (2026)
Video-SwinUNet: Spatio-temporal Deep Learning Framework for VFSS Instance Segmentation
by: Zeng, Chengxi, et al.
Published: (2023)
by: Zeng, Chengxi, et al.
Published: (2023)
Multi-scale Spatio-temporal Transformer-based Imbalanced Longitudinal Learning for Glaucoma Forecasting from Irregular Time Series Images
by: Yang, Xikai, et al.
Published: (2024)
by: Yang, Xikai, et al.
Published: (2024)
Deformable Dynamic Convolution for Accurate yet Efficient Spatio-Temporal Traffic Prediction
by: Jin, Hyeonseok, et al.
Published: (2025)
by: Jin, Hyeonseok, et al.
Published: (2025)
Temporal Continual Learning with Prior Compensation for Human Motion Prediction
by: Tang, Jianwei, et al.
Published: (2025)
by: Tang, Jianwei, et al.
Published: (2025)
FireSentry: A Multi-Modal Spatio-temporal Benchmark Dataset for Fine-Grained Wildfire Spread Forecasting
by: Zhou, Nan, et al.
Published: (2025)
by: Zhou, Nan, et al.
Published: (2025)
Beyond Pixels: Introducing Geometric-Semantic World Priors for Video-based Embodied Models via Spatio-temporal Alignment
by: Tang, Jinzhou, et al.
Published: (2025)
by: Tang, Jinzhou, et al.
Published: (2025)
Spatio-temporal neural distance fields for conditional generative modeling of the heart
by: Sørensen, Kristine, et al.
Published: (2024)
by: Sørensen, Kristine, et al.
Published: (2024)
Multi-Scale Spatio-Temporal Graph Convolutional Network for Facial Expression Spotting
by: Deng, Yicheng, et al.
Published: (2024)
by: Deng, Yicheng, et al.
Published: (2024)
E3AD: An Emotion-Aware Vision-Language-Action Model for Human-Centric End-to-End Autonomous Driving
by: Tang, Yihong, et al.
Published: (2025)
by: Tang, Yihong, et al.
Published: (2025)
MS-Net: A Multi-Path Sparse Model for Motion Prediction in Multi-Scenes
by: Tang, Xiaqiang, et al.
Published: (2024)
by: Tang, Xiaqiang, et al.
Published: (2024)
Mode-as-Sequence: Translating Multimodal Motion Prediction into Unified Sequential Mode Modeling
by: Zhou, Zikang, et al.
Published: (2026)
by: Zhou, Zikang, et al.
Published: (2026)
Multi-modal Spatio-Temporal Transformer for High-resolution Land Subsidence Prediction
by: Yao, Wendong, et al.
Published: (2025)
by: Yao, Wendong, et al.
Published: (2025)
Few-Shot Precise Event Spotting via Unified Multi-Entity Graph and Distillation
by: Liu, Zhaoyu, et al.
Published: (2025)
by: Liu, Zhaoyu, et al.
Published: (2025)
X-VORTEX: Spatio-Temporal Contrastive Learning for Wake Vortex Trajectory Forecasting
by: Qu, Zhan, et al.
Published: (2026)
by: Qu, Zhan, et al.
Published: (2026)
Adversarially-Refined VQ-GAN with Dense Motion Tokenization for Spatio-Temporal Heatmaps
by: Maldonado, Gabriel, et al.
Published: (2025)
by: Maldonado, Gabriel, et al.
Published: (2025)
Capturing More: Learning Multi-Domain Representations for Robust Online Handwriting Verification
by: Zhang, Peirong, et al.
Published: (2025)
by: Zhang, Peirong, et al.
Published: (2025)
Mechanistic Learning with Guided Diffusion Models to Predict Spatio-Temporal Brain Tumor Growth
by: Laslo, Daria, et al.
Published: (2025)
by: Laslo, Daria, et al.
Published: (2025)
Dynamic-Aware Spatio-temporal Representation Learning for Dynamic MRI Reconstruction
by: Baik, Dayoung, et al.
Published: (2025)
by: Baik, Dayoung, et al.
Published: (2025)
LiteFat: Lightweight Spatio-Temporal Graph Learning for Real-Time Driver Fatigue Detection
by: Ren, Jing, et al.
Published: (2025)
by: Ren, Jing, et al.
Published: (2025)
UniMotion: A Unified Framework for Motion-Text-Vision Understanding and Generation
by: Wang, Ziyi, et al.
Published: (2026)
by: Wang, Ziyi, et al.
Published: (2026)
Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic
by: Tang, Jianwei, et al.
Published: (2025)
by: Tang, Jianwei, et al.
Published: (2025)
A Unified Model for Longitudinal Multi-Modal Multi-View Prediction with Missingness
by: Chen, Boqi, et al.
Published: (2024)
by: Chen, Boqi, et al.
Published: (2024)
Co-Fusion4D: Spatio-temporal Collaborative Fusion for Robust 3D Object Detection
by: Li, Wenxuan, et al.
Published: (2026)
by: Li, Wenxuan, et al.
Published: (2026)
ChemVA: Advancing Large Language Models on Chemical Reaction Diagrams Understanding
by: Rao, Mingyang, et al.
Published: (2026)
by: Rao, Mingyang, et al.
Published: (2026)
Unified Spatial-Temporal Edge-Enhanced Graph Networks for Pedestrian Trajectory Prediction
by: Li, Ruochen, et al.
Published: (2025)
by: Li, Ruochen, et al.
Published: (2025)
TrajFlow: Multi-modal Motion Prediction via Flow Matching
by: Yan, Qi, et al.
Published: (2025)
by: Yan, Qi, et al.
Published: (2025)
SOAP: Enhancing Spatio-Temporal Relation and Motion Information Capturing for Few-Shot Action Recognition
by: Huang, Wenbo, et al.
Published: (2024)
by: Huang, Wenbo, et al.
Published: (2024)
CNN-based Multi-In-Multi-Out Model for Efficient Spatiotemporal Prediction
by: Jin, Hyeonseok
Published: (2026)
by: Jin, Hyeonseok
Published: (2026)
Similar Items
-
Relation Learning and Aggregate-attention for Multi-person Motion Prediction
by: Qu, Kehua, et al.
Published: (2024) -
ChronoForge-RL: Chronological Forging through Reinforcement Learning for Enhanced Video Understanding
by: Chen, Kehua
Published: (2025) -
Spatio-temporal Graph Learning on Adaptive Mined Key Frames for High-performance Multi-Object Tracking
by: Wang, Futian, et al.
Published: (2025) -
Semore: VLM-guided Enhanced Semantic Motion Representations for Visual Reinforcement Learning
by: Wang, Wentao, et al.
Published: (2025) -
MOT FCG++: Enhanced Representation of Spatio-temporal Motion and Appearance Features
by: Fang, Yanzhao
Published: (2024)