☆ 4.6 Article

Benchmarks for interpretation of QSAR models

JOURNAL OF CHEMINFORMATICS (2021)

期刊

JOURNAL OF CHEMINFORMATICS

卷 13, 期 1, 页码 -

出版社

BMC

DOI: 10.1186/s13321-021-00519-x

关键词

QSAR model interpretation; Benchmark data set; Synthetic data set; Interpretability metrics; Atom contributions; Graph convolutional neural networks

类别

Chemistry, Multidisciplinary Computer Science, Information Systems Computer Science, Interdisciplinary Applications

资金

European Regional Development Fund-Project ENOCH [CZ.02.1.01/0.0/0.0/16_019/0000868]
ELIXIR CZ research infrastructure project (MEYS) [LM2018131]
Technology Agency of the Czech Republic [TN01000013]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The interpretation of QSAR models is crucial for understanding complex processes and guiding model validation. This study develops benchmark datasets for evaluating interpretation methods of different complexity levels, proposing quantitative metrics for performance assessment. These benchmarks are applied to various models and neural networks, aiding in the evaluation and investigation of decision-making in complex black box models.

Interpretation of QSAR models is useful to understand the complex nature of biological or physicochemical processes, guide structural optimization or perform knowledge-based validation of QSAR models. Highly predictive models are usually complex and their interpretation is non-trivial. This is particularly true for modern neural networks. Various approaches to interpretation of these models exist. However, it is difficult to evaluate and compare performance and applicability of these ever-emerging methods. Herein, we developed several benchmark data sets with end-points determined by pre-defined patterns. These data sets are purposed for evaluation of the ability of interpretation approaches to retrieve these patterns. They represent tasks with different complexity levels: from simple atom-based additive properties to pharmacophore hypothesis. We proposed several quantitative metrics of interpretation performance. Applicability of benchmarks and metrics was demonstrated on a set of conventional models and end-to-end graph convolutional neural networks, interpreted by the previously suggested universal ML-agnostic approach for structural interpretation. We anticipate these benchmarks to be useful in evaluation of new interpretation approaches and investigation of decision making of complex black box models.

Benchmarks for interpretation of QSAR models

期刊

JOURNAL OF CHEMINFORMATICS

出版社

BMC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Benchmarks for interpretation of QSAR models

期刊

JOURNAL OF CHEMINFORMATICS

出版社

BMC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文