:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Qu, Kehua, Ding, Rui, Tang, Jin
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2411.04151
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Relation Learning and Aggregate-attention for Multi-person Motion Prediction
by: Qu, Kehua, et al.
Published: (2024)

ChronoForge-RL: Chronological Forging through Reinforcement Learning for Enhanced Video Understanding
by: Chen, Kehua
Published: (2025)

Spatio-temporal Graph Learning on Adaptive Mined Key Frames for High-performance Multi-Object Tracking
by: Wang, Futian, et al.
Published: (2025)

Semore: VLM-guided Enhanced Semantic Motion Representations for Visual Reinforcement Learning
by: Wang, Wentao, et al.
Published: (2025)

MOT FCG++: Enhanced Representation of Spatio-temporal Motion and Appearance Features
by: Fang, Yanzhao
Published: (2024)

Attention-based Multi-modal Deep Learning Model of Spatio-temporal Crop Yield Prediction with Satellite, Soil and Climate Data
by: Shyam, Gopal Krishna, et al.
Published: (2026)

SemiHMER: Semi-supervised Handwritten Mathematical Expression Recognition using pseudo-labels
by: Chen, Kehua, et al.
Published: (2025)

ASMa: Asymmetric Spatio-temporal Masking for Skeleton Action Representation Learning
by: Anand, Aman, et al.
Published: (2026)

Context-based Interpretable Spatio-Temporal Graph Convolutional Network for Human Motion Forecasting
by: Medina, Edgar, et al.
Published: (2024)

Risk-aware Trajectory Prediction by Incorporating Spatio-temporal Traffic Interaction Analysis
by: Thuremella, Divya, et al.
Published: (2024)

A Spatio-temporal Graph Network Allowing Incomplete Trajectory Input for Pedestrian Trajectory Prediction
by: Long, Juncen, et al.
Published: (2025)

UniPINN: A Unified PINN Framework for Multi-task Learning of Diverse Navier-Stokes Equations
by: Sun, Dengdi, et al.
Published: (2026)

Video-SwinUNet: Spatio-temporal Deep Learning Framework for VFSS Instance Segmentation
by: Zeng, Chengxi, et al.
Published: (2023)

Multi-scale Spatio-temporal Transformer-based Imbalanced Longitudinal Learning for Glaucoma Forecasting from Irregular Time Series Images
by: Yang, Xikai, et al.
Published: (2024)

Deformable Dynamic Convolution for Accurate yet Efficient Spatio-Temporal Traffic Prediction
by: Jin, Hyeonseok, et al.
Published: (2025)

Temporal Continual Learning with Prior Compensation for Human Motion Prediction
by: Tang, Jianwei, et al.
Published: (2025)

FireSentry: A Multi-Modal Spatio-temporal Benchmark Dataset for Fine-Grained Wildfire Spread Forecasting
by: Zhou, Nan, et al.
Published: (2025)

Beyond Pixels: Introducing Geometric-Semantic World Priors for Video-based Embodied Models via Spatio-temporal Alignment
by: Tang, Jinzhou, et al.
Published: (2025)

Spatio-temporal neural distance fields for conditional generative modeling of the heart
by: Sørensen, Kristine, et al.
Published: (2024)

Multi-Scale Spatio-Temporal Graph Convolutional Network for Facial Expression Spotting
by: Deng, Yicheng, et al.
Published: (2024)

E3AD: An Emotion-Aware Vision-Language-Action Model for Human-Centric End-to-End Autonomous Driving
by: Tang, Yihong, et al.
Published: (2025)

MS-Net: A Multi-Path Sparse Model for Motion Prediction in Multi-Scenes
by: Tang, Xiaqiang, et al.
Published: (2024)

Mode-as-Sequence: Translating Multimodal Motion Prediction into Unified Sequential Mode Modeling
by: Zhou, Zikang, et al.
Published: (2026)

Multi-modal Spatio-Temporal Transformer for High-resolution Land Subsidence Prediction
by: Yao, Wendong, et al.
Published: (2025)

Few-Shot Precise Event Spotting via Unified Multi-Entity Graph and Distillation
by: Liu, Zhaoyu, et al.
Published: (2025)

X-VORTEX: Spatio-Temporal Contrastive Learning for Wake Vortex Trajectory Forecasting
by: Qu, Zhan, et al.
Published: (2026)

Adversarially-Refined VQ-GAN with Dense Motion Tokenization for Spatio-Temporal Heatmaps
by: Maldonado, Gabriel, et al.
Published: (2025)

Capturing More: Learning Multi-Domain Representations for Robust Online Handwriting Verification
by: Zhang, Peirong, et al.
Published: (2025)

Mechanistic Learning with Guided Diffusion Models to Predict Spatio-Temporal Brain Tumor Growth
by: Laslo, Daria, et al.
Published: (2025)

Dynamic-Aware Spatio-temporal Representation Learning for Dynamic MRI Reconstruction
by: Baik, Dayoung, et al.
Published: (2025)

LiteFat: Lightweight Spatio-Temporal Graph Learning for Real-Time Driver Fatigue Detection
by: Ren, Jing, et al.
Published: (2025)

UniMotion: A Unified Framework for Motion-Text-Vision Understanding and Generation
by: Wang, Ziyi, et al.
Published: (2026)

Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic
by: Tang, Jianwei, et al.
Published: (2025)

A Unified Model for Longitudinal Multi-Modal Multi-View Prediction with Missingness
by: Chen, Boqi, et al.
Published: (2024)

Co-Fusion4D: Spatio-temporal Collaborative Fusion for Robust 3D Object Detection
by: Li, Wenxuan, et al.
Published: (2026)

ChemVA: Advancing Large Language Models on Chemical Reaction Diagrams Understanding
by: Rao, Mingyang, et al.
Published: (2026)

Unified Spatial-Temporal Edge-Enhanced Graph Networks for Pedestrian Trajectory Prediction
by: Li, Ruochen, et al.
Published: (2025)

TrajFlow: Multi-modal Motion Prediction via Flow Matching
by: Yan, Qi, et al.
Published: (2025)

SOAP: Enhancing Spatio-Temporal Relation and Motion Information Capturing for Few-Shot Action Recognition
by: Huang, Wenbo, et al.
Published: (2024)

CNN-based Multi-In-Multi-Out Model for Efficient Spatiotemporal Prediction
by: Jin, Hyeonseok
Published: (2026)