Saved in:
Bibliographic Details
Main Authors: Wang, Weizheng, Yang, Baijian, Hong, Sungeun, Sun, Wenhai, Min, Byung-Cheol
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2401.06344
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908901558976512
author Wang, Weizheng
Yang, Baijian
Hong, Sungeun
Sun, Wenhai
Min, Byung-Cheol
author_facet Wang, Weizheng
Yang, Baijian
Hong, Sungeun
Sun, Wenhai
Min, Byung-Cheol
contents Predicting crowd intentions and trajectories is critical for a range of real-world applications, involving social robotics and autonomous driving. Accurately modeling such behavior remains challenging due to the complexity of pairwise spatial-temporal interactions and the heterogeneous influence of groupwise dynamics. To address these challenges, we propose Hyper-STTN, a Hypergraph-based Spatial-Temporal Transformer Network for crowd trajectory prediction. Hyper-STTN constructs multiscale hypergraphs of varying group sizes to model groupwise correlations, captured through spectral hypergraph convolution based on random-walk probabilities. In parallel, a spatial-temporal transformer is employed to learn pedestrians' pairwise latent interactions across spatial and temporal dimensions. These heterogeneous groupwise and pairwise features are subsequently fused and aligned via a multimodal transformer. Extensive experiments on public pedestrian motion datasets demonstrate that Hyper-STTN consistently outperforms state-of-the-art baselines and ablation models.
format Preprint
id arxiv_https___arxiv_org_abs_2401_06344
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Hyper-STTN: Hypergraph Augmented Spatial-Temporal Transformer Network for Trajectory Prediction
Wang, Weizheng
Yang, Baijian
Hong, Sungeun
Sun, Wenhai
Min, Byung-Cheol
Computer Vision and Pattern Recognition
Machine Learning
Predicting crowd intentions and trajectories is critical for a range of real-world applications, involving social robotics and autonomous driving. Accurately modeling such behavior remains challenging due to the complexity of pairwise spatial-temporal interactions and the heterogeneous influence of groupwise dynamics. To address these challenges, we propose Hyper-STTN, a Hypergraph-based Spatial-Temporal Transformer Network for crowd trajectory prediction. Hyper-STTN constructs multiscale hypergraphs of varying group sizes to model groupwise correlations, captured through spectral hypergraph convolution based on random-walk probabilities. In parallel, a spatial-temporal transformer is employed to learn pedestrians' pairwise latent interactions across spatial and temporal dimensions. These heterogeneous groupwise and pairwise features are subsequently fused and aligned via a multimodal transformer. Extensive experiments on public pedestrian motion datasets demonstrate that Hyper-STTN consistently outperforms state-of-the-art baselines and ablation models.
title Hyper-STTN: Hypergraph Augmented Spatial-Temporal Transformer Network for Trajectory Prediction
topic Computer Vision and Pattern Recognition
Machine Learning
url https://arxiv.org/abs/2401.06344