4.4 Article

Quantitative Regression Models for the Prediction of Chemical Properties by an Efficient Workflow

Journal

MOLECULAR INFORMATICS
Volume 34, Issue 10, Pages 679-688

Publisher

WILEY-V C H VERLAG GMBH
DOI: 10.1002/minf.201400119

Keywords

Quantitative regression models; KNIME workflow; Toxicity prediction; Aqueous solubility

Funding

  1. 863 Project [2012AA020308]
  2. National Natural Science Foundation of China [s81273438, 81373329]
  3. Innovation Program of Shanghai Municipal Education Commission [13ZZ044]

Ask authors/readers for more resources

Rapid safety assessment is more and more needed for the increasing chemicals both in chemical industries and regulators around the world. The traditional experimental methods couldn't meet the current demand any more. With the development of the information technology and the growth of experimental data, in silico modeling has become a practical and rapid alternative for the assessment of chemical properties, especially for the toxicity prediction of organic chemicals. In this study, a quantitative regression workflow was built by KNIME to predict chemical properties. With this regression workflow, quanti-tative values of chemical properties can be obtained, which is different from the binary-classification model or multiclassification models that can only give qualitative results. To illustrate the usage of the workflow, two predictive models were constructed based on datasets of Tetrahymena pyriformis toxicity and Aqueous solubility. The qcv 2 and qtest 2 of 5-fold cross validation and external validation for both types of models were greater than 0.7, which implies that our models are robust and reliable, and the workflow is very convenient and efficient in prediction of various chemical properties.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available