TY - CPAPER AU - Alina Lazar AU - Ling Jin AU - Caitlin Brown AU - C Anna Spurlock AU - Alex Sim AU - Kesheng Wu AB -
Logistic regression has long been the gold standard for choice modeling in the transportation field. Despite the rising popularity of machine learning (ML), few is applied to predicting the household vehicle transactions. To address the research gap, this paper presents a first use case of ML application to predicting household vehicle transaction decisions by leveraging a newly processed national panel data set. Model performances are reported for four ML models and the traditional multinomial logit model (MNL). Instead of treating the gold standard and ML models as competitors, this paper tries to use ML tools to inform the MNL model building process. We find the two gradient boosting based methods, CatBoost and LightGBM, are the best performing ML models; and improving logistic models with SHAP interpretation tools can achieve similar performance levels to the best performing ML methods.
BT - 2021 IEEE International Conference on Big Data (Big Data) CY - Orlando, FL, USA DA - 01/2022 DO - 10.1109/BigData52589.2021.9671286 LA - eng N2 -Logistic regression has long been the gold standard for choice modeling in the transportation field. Despite the rising popularity of machine learning (ML), few is applied to predicting the household vehicle transactions. To address the research gap, this paper presents a first use case of ML application to predicting household vehicle transaction decisions by leveraging a newly processed national panel data set. Model performances are reported for four ML models and the traditional multinomial logit model (MNL). Instead of treating the gold standard and ML models as competitors, this paper tries to use ML tools to inform the MNL model building process. We find the two gradient boosting based methods, CatBoost and LightGBM, are the best performing ML models; and improving logistic models with SHAP interpretation tools can achieve similar performance levels to the best performing ML methods.
PB - IEEE PP - Orlando, FL, USA PY - 2021 SN - 978-1-6654-3902-2 T2 - 2021 IEEE International Conference on Big Data (Big Data) T3 - 2021 IEEE International Conference on Big Data (Big Data) TI - Performance of the Gold Standard and Machine Learning in Predicting Vehicle Transactions UR - https://ieeexplore.ieee.org/document/9671286/ ER -