4.7 Article

Simulation of dew point temperature in different time scales based on grasshopper algorithm optimized extreme gradient boosting

Journal

JOURNAL OF HYDROLOGY
Volume 606, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.jhydrol.2022.127452

Keywords

Dew point temperature; Grasshopper optimization algorithm; Extreme gradient boosting; Time scale; Cross-validation

Funding

  1. Program of the National Natural Science Foundation of China (NSFC) [51879196, 51790533]

Ask authors/readers for more resources

This study evaluated the ability of a new machine learning model to estimate dew point temperature and compared it with two other models. The results showed that the new model performed best in terms of accuracy and stability. The actual vapor pressure was found to be the most practical input. Additionally, the study found that the models performed similarly at hourly and daily scales, but the hourly scale was better for evaluating simulations in extreme situations.
Dew point temperature (Tdew) plays an important role in hydrology, meteorology, and other related research. This study evaluated the ability of a new machine learning model (hybrid extreme gradient boosting with grasshopper optimization algorithm (GOA-XGBoost)) to estimate Tdew and compared it with two other tree-based models (XGBoost and random forest (RF)). We collected meteorological data namely actual vapor pressure (ea), maximum air temperature (Tmax), minimum air temperature (Tmin), maximum relative humidity (RHmax), minimum relative humidity (RHmin), atmospheric pressure (Pa), 2 m high wind speed (Ud), during 2016-2019 on daily and hourly time scales from the Sijiqinglin station in China to train, test, and validate each model. The results showed that the GOA-XGBoost model performed best, and the RF model had severe over-fitting problems during the validation phase at daily time scale. The models showed the best accuracy and stability when the input was ea (on average R2 =1.000, RMSE = 0.296 degrees C, MBE = 0.001 degrees C, MAE = 0.167 degrees C, and KGE = 0.991). The models had more significant errors when the inputs were Tmax, Tmin (on average R2 = 0.721, RMSE = 6.756 degrees C, MBE = -0.101 degrees C, MAE = 5.071 degrees C, and KGE = 0.771). The estimation loss exhibited by the models were similar for the hourly and daily scale patterns. T and RH were the most basic meteorological factors and adding extraneous factors would affect the estimation accuracy of the model. The variability of meteorological data varied less on an hourly scale than on a daily scale. Therefore, the accuracy of the models was higher, but the data set and the volume of operations became larger. This led to a possible reduction in model stability, but the hourly scales are better suited for assessing the effects of simulations in extreme situations. Taking accuracy and stability into account, the GOA-XGBoost model was the best model and the most practical input for both time scales was ea. Therefore, in subsequent studies, the GOA-XGBoost model can be combined with the input ea to estimate Tdew accurately.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available