4.7 Article

Spatiotemporal ozone pollution LUR models: Suitable statistical algorithms and time scales for a megacity scale

期刊

ATMOSPHERIC ENVIRONMENT
卷 237, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.atmosenv.2020.117671

关键词

Ozone; Land use regression; General additive mixed model; Random forest; Epidemiological study; Spatiotemporal

向作者/读者索取更多资源

Ambient air ozone (O-3), a secondary photochemical pollutant, is seriously harmful to human health. Accurate estimation of O-3 exposure requires the ability to monitor O-3 surface concentration with a high spatiotemporal resolution. Several spatiotemporal land use regression (LUR) models have integrated meteorological factors based on different statistical algorithms to support such epidemiological studies. From among such various existing statistical algorithms, we aim to identify a high-efficiency modeling method, as well as the most suitable lengths of the modeling period (time scale). Three types of typical spatiotemporal LUR models based on parametric, semi-parametric, and non-parametric statistic methods, respectively, are considered to predict daily ground-level O-3 in the megacity of Tianjin, China. Based on monthly, seasonal (cold and warm), and annual time scales, these models include: a series of monthly hybrid LUR (Two-stage) models consisting of two sub-models based on the multiple linear regression (MLR) algorithm, general additive mixed models (GAMMs), and land use random forest (LURF) models. Leave-one-out cross-validation was performed to evaluate the temporal and spatial predictive accuracy of each model using the adjusted coefficient of determination (adjR(CV)(2)) and root mean square error (RMSECV). In the GAMMs and LURF models, models using a shorter time scale (monthly models) outperformed those using a longer one. In monthly models, the GAMMs performed the best, with the highest average adjR(CV)(2) (0.747) and the lowest average RMSECV (15.721 mu g/m(3)), followed by the LURF models (average adjR(CV)(2) = 0.695, average RMSECV = 16.405), and the Two-stage models (average adjR(CV)(2) = 0.466, average RMSECV = 23.934). Thus, the modeling format consisting of a shorter time scale and the GAMM algorithm performs relatively well in predicting daily O-3 pollution on a megacity scale. These findings can be used to select appropriate modeling methods for epidemiological research of O-3 pollution.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据