4.6 Article

An Integrated Statistical-Machine Learning Approach for Runoff Prediction

Journal

SUSTAINABILITY
Volume 14, Issue 13, Pages -

Publisher

MDPI
DOI: 10.3390/su14138209

Keywords

MARS; SVM; RF; rainfall; runoff; rainfall-runoff modeling

Funding

  1. Department of Soil and Water Conservation Engineering, G.B. Pant University of Agriculture and Technology, Pantnagar, Uttarakhand, India
  2. Portuguese Foundation for Science and Technology (FCT)

Ask authors/readers for more resources

This study uses data-driven models for rainfall-runoff prediction in the Gola watershed and evaluates the performance of four heuristic methods. The results show that the RF model outperforms the other models and has higher accuracy.
Nowadays, great attention has been attributed to the study of runoff and its fluctuation over space and time. There is a crucial need for a good soil and water management system to overcome the challenges of water scarcity and other natural adverse events like floods and landslides, among others. Rainfall-runoff (R-R) modeling is an appropriate approach for runoff prediction, making it possible to take preventive measures to avoid damage caused by natural hazards such as floods. In the present study, several data-driven models, namely, multiple linear regression (MLR), multiple adaptive regression splines (MARS), support vector machine (SVM), and random forest (RF), were used for rainfall-runoff prediction of the Gola watershed, located in the south-eastern part of the Uttarakhand. The rainfall-runoff model analysis was conducted using daily rainfall and runoff data for 12 years (2009 to 2020) of the Gola watershed. The first 80% of the complete data was used to train the model, and the remaining 20% was used for the testing period. The performance of the models was evaluated based on the coefficient of determination (R-2), root mean square error (RMSE), Nash-Sutcliffe efficiency (NSE), and percent bias (PBAIS) indices. In addition to the numerical comparison, the models were evaluated. Their performances were evaluated based on graphical plotting, i.e., time-series line diagram, scatter plot, violin plot, relative error plot, and Taylor diagram (TD). The comparison results revealed that the four heuristic methods gave higher accuracy than the MLR model. Among the machine learning models, the RF (RMSE (m(3)/s), R-2, NSE, and PBIAS (%) = 6.31, 0.96, 0.94, and -0.20 during the training period, respectively, and 5.53, 0.95, 0.92, and -0.20 during the testing period, respectively) surpassed the MARS, SVM, and the MLR models in forecasting daily runoff for all cases studied. The RF model outperformed in all four models' training and testing periods. It can be summarized that the RF model is best-in-class and delivers a strong potential for the runoff prediction of the Gola watershed.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available