4.7 Article

Adaptive weighted least square support vector machine regression integrated with outlier detection and its application in QSAR

期刊

出版社

ELSEVIER
DOI: 10.1016/j.chemolab.2009.05.008

关键词

Outlier; Robust 3 sigma principle; Weight; Least square support vector machine regression; Quantitative structure-activity relationships

资金

  1. National Natural Science Foundation of China [20776042]
  2. National High-Tech Research and Development Program of China [2007AA04Z164]

向作者/读者索取更多资源

In order to eliminate the influence of unavoidable outliers in training sample on a model's performance, a novel least square support vector machine regression, which combines outlier detection approach and adaptive weight value for the training sample. is proposed and named as adaptive weighted least square support vector machine regression (AWLS-SVM). Firstly, the effective robust 3 sigma principle is used to detect marked outliers for the training sample. Secondly, based on the training sample without marked outliers, least square support vector machine regression is employed to develop the model and the fitting error of each sample data is obtained. Thirdly, according to the fitting error of each sample data, the initial weight is calculated. The bigger the fitting error of sample data is, the smaller the weight value of the sample data. Thus, the potential outliers, which are not detected by the robust 3 sigma principle and have bigger fitting errors, have smaller weight values to reduce the influence of the potential outliers on the performance of model. Then, LS-SVM is applied for the weighted sample to develop the model again. Finally, via the proposed weight value iterative method, the weight values of the training sample are converged, and the model with good predicting performance is obtained. To illustrate the performance of AWLS-SVM. simulation experiment is designed to produce the training sample with marked outlier and some nonmarked outliers. AWLS-SVM, AWLS-SVM without the robust 3 sigma principle, LS-SVM with the robust 3 sigma principle, LS-SVM, and radial basis function network are applied to develop the model based on the designed sample. The results show that the influence of marked and un-marked outliers on the model's performance is eliminated by AWLS-SVM, and that the predicting performance of AWLS-SVM is the best. Furthermore, the AWLS-SVM method was applied to develop the quantitative structure-activity relationships (QSAR) model of HIV-1 protease inhibitors, and the satisfactory result was obtained. (c) 2009 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
Article Automation & Control Systems

Multi-modal hybrid modeling strategy based on Gaussian Mixture Variational Autoencoder and spatial-temporal attention: Application to industrial process prediction

Haifei Peng, Jian Long, Cheng Huang, Shibo Wei, Zhencheng Ye

Summary: This paper proposes a novel multi-modal hybrid modeling strategy (GMVAE-STA) that can effectively extract deep multi-modal representations and complex spatial and temporal relationships, and applies it to industrial process prediction.

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS (2024)