4.7 Article

Performance of linear mixed models and random forests for spatial prediction of soil pH

Journal

GEODERMA
Volume 397, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.geoderma.2021.115079

Keywords

Linear mixed models; REML-EBLUP; Random forests; Spatial prediction of soil pH

Categories

Funding

  1. Zambian Ministry of Higher Education
  2. Commonwealth Scholarships Commission - UK government

Ask authors/readers for more resources

The study aimed to map the spatial variation of soil pH across Zambia and found that the empirical best linear unbiased predictor performed better than other methods, with random forests algorithm prone to selecting important spatially correlated variables.
Digital soil maps describe the spatial variation of soil and provide important information on spatial variation of soil properties which provides policy makers with a synoptic view of the state of the soil. This paper presents a study to tackle the task of how to map the spatial variation of soil pH across Zambia. This was part of a project to assess suitability for rice production across the country. Legacy data on the target variable were available along with additional exhaustive environmental covariates as potential predictor variables. We had the option of undertaking spatial prediction by geostatistical or machine learning methods. We set out to compare the approaches from the selection of predictor variables through to model validation, and to test the predictors on a set of validation observations. We also addressed the problem of how to robustly validate models from legacy data when these have, as is often the case, a strongly clustered spatial distribution. The validation statistics results showed that the empirical best linear unbiased predictor (EBLUP) with the only fixed effect a constant mean (ordinary kriging) performed better than the other methods. Random forests had the largest model-based estimates of the expected squared errors. We also noticed that the random forest algorithm was prone to select as important spatially correlated random variables which we had simulated.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available