4.6 Article

Detecting influential observations by cluster analysis and Monte Carlo cross-validation

期刊

ANALYST
卷 135, 期 11, 页码 2841-2847

出版社

ROYAL SOC CHEMISTRY
DOI: 10.1039/c0an00345j

关键词

-

资金

  1. National Natural Science Foundation of China [20835002]
  2. Ministry of Science and Technology (MOST) of China [2008 DFA32250]
  3. British Columbia Innovation Council
  4. Natural Sciences and Engineering Research Council of Canada

向作者/读者索取更多资源

The detection of influential observations is an essential step for building high performance models and has been recognized as an important and challenging task in many industrial and laboratorial applications. A new approach for detecting influential observations is developed based on their effect on partial least squares (PLS) modeling. In this method, we build a large number of PLS models by using Monte Carlo cross-validation (MCCV), and then perform principal component analysis (PCA) on the regression coefficients of these models. Because a model with influential observations is different from the one without influential observation, the series of PLS models cluster into different groups in principal component (PC) spaces, based on the different number of influential observations they contain. The influential observations can be therefore recognized according to the frequency number of each sample in each group. By three examples quantitatively modeling near-infrared (NIR) and Raman spectra, it was shown that the method can detect the influential observations intuitively and veraciously.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据