4.6 Article

Prediction of Antimalarial Drug-Decorated Nanoparticle Delivery Systems with Random Forest Models

Journal

BIOLOGY-BASEL
Volume 9, Issue 8, Pages -

Publisher

MDPI
DOI: 10.3390/biology9080198

Keywords

decorated nanoparticles; drug delivery; antimalarial compounds; big data; Perturbation Theory; Machine Learning; ChEMBL database

Categories

Funding

  1. Consolidation and Structuring of Competitive Research Units-Competitive Reference Groups - Ministry of Education, University and Vocational Training of Xunta de Galicia [ED431C 2018/49]
  2. EU FEDER funds

Ask authors/readers for more resources

Drug-decorated nanoparticles (DDNPs) have important medical applications. The current work combined Perturbation Theory with Machine Learning and Information Fusion (PTMLIF). Thus, PTMLIF models were proposed to predict the probability of nanoparticle-compound/drug complexes having antimalarial activity (against Plasmodium). The aim is to save experimental resources and time by using a virtual screening for DDNPs. The raw data was obtained by the fusion of experimental data for nanoparticles with compound chemical assays from the ChEMBL database. The inputs for the eight Machine Learning classifiers were transformed features of drugs/compounds and nanoparticles as perturbations of molecular descriptors in specific experimental conditions (experiment-centered features). The resulting dataset contains 107 input features and 249,992 examples. The best classification model was provided by Random Forest, with 27 selected features of drugs/compounds and nanoparticles in all experimental conditions considered. The high performance of the model was demonstrated by the mean Area Under the Receiver Operating Characteristics (AUC) in a test subset with a value of 0.9921 +/- 0.000244 (10-fold cross-validation). The results demonstrated the power of information fusion of the experimental-centered features of drugs/compounds and nanoparticles for the prediction of nanoparticle-compound antimalarial activity. The scripts and dataset for this project are available in the open GitHub repository.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available