4.6 Article

An Alternative to Laboratory Testing: Random Forest-Based Water Quality Prediction Framework for Inland and Nearshore Water Bodies

Journal

WATER
Volume 13, Issue 22, Pages -

Publisher

MDPI
DOI: 10.3390/w13223262

Keywords

water quality prediction; machine learning; total nitrogen; random forest; google earth engine

Funding

  1. 2020 Li Ka Shing Foundation Cross-Disciplinary Research Grant [2020LKSFG08D]
  2. Shantou University Scientific Research Start-up Fund Project [NTF18024]
  3. Guangdong province special fund for science and technology (major special projects + task list) project [2019ST043, 210715156881689]

Ask authors/readers for more resources

Water quality monitoring is essential for water environment management, and in this study, a framework utilizing water quality variables to predict total nitrogen concentrations was designed, with random forest algorithm performing the best among different machine learning models tested.
Water quality monitoring plays a vital role in the water environment management, while efficient monitoring provides direction and verification of the effectiveness of water management. Traditional water quality monitoring for a variety of water parameters requires the placement of multiple sensors, and some water quality data (e.g., total nitrogen (TN)) requires testing instruments or laboratory analysis to obtain results, which takes longer than the sensors. In this paper, we designed a water quality prediction framework, which uses available water quality variables (e.g., temperature, pH, conductivity, etc.) to predict total nitrogen concentrations in inland water bodies. The framework was also used to predict nearshore seawater salinity and temperature using remote sensing bands. We conducted experiments on real water quality datasets and random forest was chosen to be the core algorithm of the framework by comparing and analyzing the performance of different machine learning algorithms. The results show that among all tested machine learning models, random forest performs the best. The data prediction error rate of the random forest model in predicting the total nitrogen concentration in inland rivers was 4.9%. Moreover, to explore the prediction effect of random forest algorithm when the independent variable is non-water quality data, we took the reflectance of remote sensing bands as the independent variables and successfully inverted the salinity distribution of Shenzhen Bay in the Google Earth Engine (GEE) platform. According to the experimental results, the random forest-based water quality prediction framework can achieve 92.94% accuracy in predicting the salinity of nearshore waters.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available