☆ 4.8 Article

The creation and characterisation of a National Compound Collection: the Royal Society of Chemistry pilot

CHEMICAL SCIENCE (2016)

期刊

CHEMICAL SCIENCE

卷 7, 期 6, 页码 3869-3878

出版社

ROYAL SOC CHEMISTRY

DOI: 10.1039/c6sc00264a

关键词

类别

Chemistry, Multidisciplinary

资金

University of Bristol
Research and Enterprise Development group
Elizabeth Blackwell University Research Institute
Royal Society of Chemistry

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

We present a summary of the National Compound Collection (NCC) pilot; which harvested chemical structure data from 746 publicly-available PhD theses to create an enhanced database of diverse and interesting (largely organic) molecular entities. The database comprised similar to 75 000 structure entries, of which 70% were new to ChemSpider at the time of upload. The dataset was evaluated for structural uniqueness by twelve external drug discovery groups from the pharmaceutical, biotech, academic and not-for-profit sectors. These partners generated data reported here comparing the NCC pilot with their in-house compound collections. The proportion of NCC structures considered to be useful for drug discovery ranged from 5-80% depending on the strictness of the filters used; most interestingly from a drug discovery standpoint similar to 13k NCC compounds (18% of the NCC) passed the filters and were of good diversity. These compounds are quite different from those that are already present in the screening collections but not so different that they are no longer considered to be drug-like. In general, the drug discovery teams would consider these compounds to be high value molecules for inclusion in their screening collections. This pilot addressed the potential value of unpublished data and explored the practicalities of large-scale data extraction, to inform both retrospective and prospective extraction of chemical data from theses.

The creation and characterisation of a National Compound Collection: the Royal Society of Chemistry pilot

期刊

CHEMICAL SCIENCE

出版社

ROYAL SOC CHEMISTRY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

The creation and characterisation of a National Compound Collection: the Royal Society of Chemistry pilot

期刊

CHEMICAL SCIENCE

出版社

ROYAL SOC CHEMISTRY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文