4.6 Article

A hybrid modelling framework of machine learning and extreme value theory for crash risk estimation using traffic conflicts

Journal

ANALYTIC METHODS IN ACCIDENT RESEARCH
Volume 36, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.amar.2022.100248

Keywords

Extreme Value Theory; Machine learning; Traffic conflicts; Anomaly detection; Hybrid framework

Funding

  1. Queensland University of Technology
  2. iMOVE CRC
  3. Australian Government

Ask authors/readers for more resources

This study proposes a hybrid modeling framework that combines machine learning and extreme value theory to estimate crash risk from traffic conflicts. An efficient sampling technique is used to identify extremes, and machine learning methods replace traditional sampling techniques for anomaly detection. The results show that the proposed hybrid models outperform traditional extreme value models in estimating crash risks.
Extreme value theory is the state-of-the-art modelling technique for estimating crash risk from traffic conflicts, with two different sampling techniques, i.e. block maxima and peak -over-threshold, at its core. However, the uncertainty associated with the estimates obtained by these sampling techniques has been too large to enable its widespread practi-cal use. A fundamental reason for this issue is the improper selection of extreme values and a lack of a suitable and efficient sampling mechanism. This study proposes a hybrid mod-elling framework of machine learning and extreme value theory to estimate crash risk from traffic conflicts with an efficient sampling technique for identifying extremes. More specif-ically, a machine learning approach replaces the conventional sampling techniques with anomaly detection techniques since an anomaly is a data point that does not conform with the rest of the data, making it very similar to the definition of an extreme value. Six repre-sentative machine learning-based unsupervised anomaly detection algorithms have been tested in this study. They include iforest, minimum covariance determinant, one-class support vector machine, k-nearest neighbours, local outlier factor, and connectivity-based outlier factor. The extremes identified by these algorithms are then fitted to extreme value distributions for both univariate and bivariate frameworks. These algorithms were tested on a large set of traffic conflict data collected for four weekdays (6 am to 6 pm) from three four-legged intersections in Brisbane, Australia. Results indicate that the proposed hybrid models con-sistently outperform the conventional extreme value models, which use block maxima and peak-over-threshold as the underlying sampling technique. Among the sampling algo-rithms, iforest has been found to perform better than other algorithms in estimating crash risks from traffic conflicts. The proposed hybrid modelling framework represents a methodological advancement in traffic conflict-based crash estimation models and opens new avenues for exploring the possibility of utilising machine learning techniques within the existing traffic conflict techniques.(c) 2022 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available