4.7 Article

Stacking- and voting-based ensemble deep learning models (SEDL and VEDL) and active learning (AL) for mapping land subsidence

Journal

ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH
Volume 30, Issue 10, Pages 26580-26595

Publisher

SPRINGER HEIDELBERG
DOI: 10.1007/s11356-022-24065-7

Keywords

Land subsidence; Stacking ensemble deep learning (SEDL) model; Model's interpretability; Active learning; Uncertainty; Southern Iran

Ask authors/readers for more resources

This study presents a novel methodology for predicting land subsidence hazard and rate, and its uncertainty, in southern Iran. The methodology combines feature selection, ensemble deep learning models, and active learning approach. The study finds that groundwater drawdown is the most important feature controlling land subsidence hazard. The developed models show high accuracy in predicting land subsidence rate.
This contribution presents a novel methodology based on the feature selection, ensemble deep learning (EDL) models, and active learning (AL) approach for prediction of land subsidence (LS) hazard and rate, and its uncertainty in an area involving two important plains - the Minab and Shamil-Nian plains - in the Hormozgan province, southern Iran. The important features controlling LS hazard were identified by ridge regression. Then, two EDL models were constructed by stacking (SEDL) and voting (VEDL) five dense deep learning (DL) models (model 1 to model 5) for mapping LS hazard. Thereafter, the predictive model performance was assessed by a precision-recall curve and Kolmogorov-Smirnov (KS) plot. A partial dependence plot (PDP), individual conditional expectation plots (ICEP), game theory, and a sensitivity analysis were used for the interpretability of the predictive DL model. According to SEDL - a model with higher accuracy - 34% (1624 km(2)), 14.7% (698 km(2)), and 19.2% (912 km(2)) of the total area were classified as being of very low, low, and moderate hazards, whereas 17.7% (845 km(2)) and 14.4% (683 km(2)) of area were classified as being of high and very high hazards, respectively. Based on all interpretability techniques, aquifer loss or groundwater drawdown is the most important feature controlling LS hazard, and it having the greatest impact on the SEDL model output. Based on a Taylor diagram and R-2 as model performance assessment indicators, SEDL-AL (with R-2 > 95% for training and test datasets) performed better than SEDL for quantify LS rate, the rate of LS ranging between 0 and 48.1 cm. The highest rate of LS occurred in the Minab plain - an area located downstream of the Minab Esteghlal dam. SEDL-AL was used to quantify the uncertainty associated with the LS rate. The observed values fell within predictions provided by SEDL-AL, which indicates a high accuracy of our predictive model. Overall, our newly developed modeling techniques are helpful tools for the spatial mapping of LS susceptibility and rate, and its uncertainty.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available