Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Jiang, Yuqin, Popov, Andrey A., Duan, Tianle, Li, Qingchun
Format: Preprint
Veröffentlicht: 2025
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2509.21703
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866918148302700544
author Jiang, Yuqin
Popov, Andrey A.
Duan, Tianle
Li, Qingchun
author_facet Jiang, Yuqin
Popov, Andrey A.
Duan, Tianle
Li, Qingchun
contents Understanding urban human mobility patterns at various spatial levels is essential for social science. This study presents a machine learning framework to downscale origin-destination (OD) taxi trips flows in New York City from a larger spatial unit to a smaller spatial unit. First, correlations between OD trips and demographic, socioeconomic, and commuting characteristics are developed using four models: Linear Regression (LR), Random Forest (RF), Support Vector Machine (SVM), and Neural Networks (NN). Second, a perturbation-based sensitivity analysis is applied to interpret variable importance for nonlinear models. The results show that the linear regression model failed to capture the complex variable interactions. While NN performs best with the training and testing datasets, SVM shows the best generalization ability in downscaling performance. The methodology presented in this study provides both analytical advancement and practical applications to improve transportation services and urban development.
format Preprint
id arxiv_https___arxiv_org_abs_2509_21703
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Downscaling human mobility data based on demographic socioeconomic and commuting characteristics using interpretable machine learning methods
Jiang, Yuqin
Popov, Andrey A.
Duan, Tianle
Li, Qingchun
Machine Learning
Understanding urban human mobility patterns at various spatial levels is essential for social science. This study presents a machine learning framework to downscale origin-destination (OD) taxi trips flows in New York City from a larger spatial unit to a smaller spatial unit. First, correlations between OD trips and demographic, socioeconomic, and commuting characteristics are developed using four models: Linear Regression (LR), Random Forest (RF), Support Vector Machine (SVM), and Neural Networks (NN). Second, a perturbation-based sensitivity analysis is applied to interpret variable importance for nonlinear models. The results show that the linear regression model failed to capture the complex variable interactions. While NN performs best with the training and testing datasets, SVM shows the best generalization ability in downscaling performance. The methodology presented in this study provides both analytical advancement and practical applications to improve transportation services and urban development.
title Downscaling human mobility data based on demographic socioeconomic and commuting characteristics using interpretable machine learning methods
topic Machine Learning
url https://arxiv.org/abs/2509.21703