4.5 Article

Nonparametric algorithm for identification of outliers in environmental data

期刊

JOURNAL OF CHEMOMETRICS
卷 32, 期 5, 页码 -

出版社

WILEY
DOI: 10.1002/cem.2997

关键词

change point analysis; data validation; kernel regression; local bandwidth; outliers

资金

  1. University of Defence [PASVRII - DZRO K110]
  2. Institute of Analytical Chemistry of the CAS, v. v. i [RVO: 68081715]
  3. Slovenian Research Agency [L7-5459, P1-0297]

向作者/读者索取更多资源

Outliers that can significantly affect data analysis are frequently present in environmental data sets. Most methods suggested for the detection of outliers impose restrictions on the distribution of analysed variables. However, in many environmental areas, the observed variable is influenced by a lot of different factors and its distribution is often difficult to find or cannot be estimated. Therefore, an approach for the identification of outliers in environmental time series based on nonparametric statistical techniques is presented. The core principle of the algorithm is to smoothen the data using nonparametric regression with variable bandwidth and subsequently analyse the residuals by nonparametric statistical methods. In the case that the distribution of the analysed variable is normal an efficient statistical method based on normality assumptions is presented as well. The proposed procedure is applied for the identification of outliers in hourly concentrations of particulate matter and verified by simulations. The simulation examples have shown that the presented method is suitable for effective detection of outliers that are deviated at least 7 standard deviations from the mean value of the neighbouring observations. The value of the proposed method is that it reduces the number of observations for manual evaluation and saves the time spent on data control.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据