4.5 Article

Hybrid XGboost model with various Bayesian hyperparameter optimization algorithms for flood hazard susceptibility modeling

Journal

GEOCARTO INTERNATIONAL
Volume 37, Issue 25, Pages 8273-8292

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/10106049.2021.1996641

Keywords

Flood hazard; Bayesian hyperparameter algorithms; Extreme Gradient Boosting; s Kan watershed

Ask authors/readers for more resources

The study developed an optimal model for flood susceptibility mapping in the Kan watershed, Tehran, Iran using three Bayesian optimization hyper-parameter algorithms and evaluated model performance using ROC curves. The evaluation results showed that Bayesian hyperparameter optimization methods significantly increased the efficiency of the XGB model for predicting flood hazard.
The purpose of this investigation is to develop an optimal model to flood susceptibility mapping in the Kan watershed, Tehran, Iran. Therefore, in this study, three Bayesian optimization hyper-parameter algorithms including Upper confidence bound (UCB), Probability of improvement (PI) and Expected improvement (EI) in order to Extreme Gradient Boosting (XGB) machine learning model optimization and Extreme randomize tree (ERT) model for modeling flood hazard were used. In order to perform flood susceptibility mapping, 118 historic flood locations were identified and analyzed using 17 geo-environmental explanatory variables to predict flooding susceptibility. Flood locations data were divided into 70% for training and 30% for testing of models developed. The receiver operating characteristic (ROC) curve parameters were used to evaluate the performance of the models. The evaluation results based on the criterion area under curve (AUC) in the testing stage showed that the ERT and XGB models have efficiencies of 91.37% and 91.95%, respectively. The evaluation of the efficiency of Bayesian hyperparameters optimization methods on the XGB model also showed that these methods increase the efficiency of the XGB model, so that the model efficiency using these methods EI-XGB, POI-XGB and UCB-XGB based on the AUC in the testing stage were 95.89%, 96.87% and 96.38%, respectively. The results of the relative importance of the five models shows that the variables of elevation and distance from the river are the significant compared to other variables in predicting flood hazard in the Kan watershed.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available