Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wang, Shenhao, Mo, Baichuan, Zheng, Yunhan, Hess, Stephane, Zhao, Jinhua
Format:	Preprint
Published:	2021
Subjects:	Machine Learning Econometrics
Online Access:	https://arxiv.org/abs/2102.01130
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910860157386752
author	Wang, Shenhao Mo, Baichuan Zheng, Yunhan Hess, Stephane Zhao, Jinhua
author_facet	Wang, Shenhao Mo, Baichuan Zheng, Yunhan Hess, Stephane Zhao, Jinhua
contents	Numerous studies have compared machine learning (ML) and discrete choice models (DCMs) in predicting travel demand. However, these studies often lack generalizability as they compare models deterministically without considering contextual variations. To address this limitation, our study develops an empirical benchmark by designing a tournament model, thus efficiently summarizing a large number of experiments, quantifying the randomness in model comparisons, and using formal statistical tests to differentiate between the model and contextual effects. This benchmark study compares two large-scale data sources: a database compiled from literature review summarizing 136 experiments from 35 studies, and our own experiment data, encompassing a total of 6,970 experiments from 105 models and 12 model families. This benchmark study yields two key findings. Firstly, many ML models, particularly the ensemble methods and deep learning, statistically outperform the DCM family (i.e., multinomial, nested, and mixed logit models). However, this study also highlights the crucial role of the contextual factors (i.e., data sources, inputs and choice categories), which can explain models' predictive performance more effectively than the differences in model types alone. Model performance varies significantly with data sources, improving with larger sample sizes and lower dimensional alternative sets. After controlling all the model and contextual factors, significant randomness still remains, implying inherent uncertainty in such model comparisons. Overall, we suggest that future researchers shift more focus from context-specific model comparisons towards examining model transferability across contexts and characterizing the inherent uncertainty in ML, thus creating more robust and generalizable next-generation travel demand models.
format	Preprint
id	arxiv_https___arxiv_org_abs_2102_01130
institution	arXiv
publishDate	2021
record_format	arxiv
spellingShingle	Comparing hundreds of machine learning classifiers and discrete choice models in predicting travel behavior: an empirical benchmark Wang, Shenhao Mo, Baichuan Zheng, Yunhan Hess, Stephane Zhao, Jinhua Machine Learning Econometrics Numerous studies have compared machine learning (ML) and discrete choice models (DCMs) in predicting travel demand. However, these studies often lack generalizability as they compare models deterministically without considering contextual variations. To address this limitation, our study develops an empirical benchmark by designing a tournament model, thus efficiently summarizing a large number of experiments, quantifying the randomness in model comparisons, and using formal statistical tests to differentiate between the model and contextual effects. This benchmark study compares two large-scale data sources: a database compiled from literature review summarizing 136 experiments from 35 studies, and our own experiment data, encompassing a total of 6,970 experiments from 105 models and 12 model families. This benchmark study yields two key findings. Firstly, many ML models, particularly the ensemble methods and deep learning, statistically outperform the DCM family (i.e., multinomial, nested, and mixed logit models). However, this study also highlights the crucial role of the contextual factors (i.e., data sources, inputs and choice categories), which can explain models' predictive performance more effectively than the differences in model types alone. Model performance varies significantly with data sources, improving with larger sample sizes and lower dimensional alternative sets. After controlling all the model and contextual factors, significant randomness still remains, implying inherent uncertainty in such model comparisons. Overall, we suggest that future researchers shift more focus from context-specific model comparisons towards examining model transferability across contexts and characterizing the inherent uncertainty in ML, thus creating more robust and generalizable next-generation travel demand models.
title	Comparing hundreds of machine learning classifiers and discrete choice models in predicting travel behavior: an empirical benchmark
topic	Machine Learning Econometrics
url	https://arxiv.org/abs/2102.01130

Similar Items