4.7 Article

Prediction and analysis of train arrival delay based on XGBoost and Bayesian optimization

期刊

APPLIED SOFT COMPUTING
卷 109, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.asoc.2021.107538

关键词

High-speed railway; Train arrival delay prediction; Extreme gradient boosting machine; Bayesian optimization; Abnormal events

资金

  1. Fundamental Re-search Funds for the Central Universities, China [2019JBM077]

向作者/读者索取更多资源

Accurate train arrival delay prediction is critical for real-time train dispatching and service improvement. This study proposes a data-driven method that combines XGBoost and BO algorithm to predict train arrival delays, which outperforms other benchmark models in terms of performance metrics. Case studies on two high-speed railway lines demonstrate the efficacy of the proposed method for predicting delays at different intervals and stations.
Accurate train arrival delay prediction is critical for real-time train dispatching and for the improvement of the transportation service. This study proposes a data-driven method that combines eXtreme Gradient Boosting (XGBoost) and a Bayesian optimization (BO) algorithm to predict train arrival delays. First, eleven characteristics that may affect the train arrival time at the next scheduled station are identified as independent variables. Second, an XGBoost prediction model that captures the relation between the train arrival delays and various railway system characteristics is established. Third, the BO algorithm is applied to the hyperparameter optimization of the XGBoost model to improve the prediction accuracy. Subsequently, case studies using data from two high-speed railway (HSR) lines in China are performed to analyze the prediction efficiency and accuracy of the proposed model for different delay bins and at different stations. The results on two HSR lines demonstrate that the proposed method outperforms other benchmark models regarding the performance metrics of the determination coefficient (0.9889/0.9905), root-mean-squared error (2.686/1.887), and mean absolute error (0.896/ 0.802). In addition, the statistical test is carried out using Friedman Test (FT) and Wilcoxon Signed Rank Test (WSRT) to validate the efficacy of the proposed method. Furthermore, the train arrival delays at different abnormal events can also be accurately forecasted using the proposed method; the results indicate that the proposed method outperforms other benchmark methods, especially in the prediction of long delays caused by specific abnormal events. (C) 2021 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据