4.6 Article

InChIKey collision resistance: an experimental testing

期刊

JOURNAL OF CHEMINFORMATICS
卷 4, 期 -, 页码 -

出版社

BIOMED CENTRAL LTD
DOI: 10.1186/1758-2946-4-39

关键词

-

向作者/读者索取更多资源

InChIKey is a 27-character compacted (hashed) version of InChI which is intended for Internet and database searching/indexing and is based on an SHA-256 hash of the InChI character string. The first block of InChIKey encodes molecular skeleton while the second block represents various kinds of isomerism (stereo, tautomeric, etc.). InChIKey is designed to be a nearly unique substitute for the parent InChI. However, a single InChIKey may occasionally map to two or more InChI strings (collision). The appearance of collision itself does not compromise the signature as collision-free hashing is impossible; the only viable approach is to set and keep a reasonable level of collision resistance which is sufficient for typical applications. We tested, in computational experiments, how well the real-life InChIKey collision resistance corresponds to the theoretical estimates expected by design. For this purpose, we analyzed the statistical characteristics of InChIKey for datasets of variable size in comparison to the theoretical statistical frequencies. For the relatively short second block, an exhaustive direct testing was performed. We computed and compared to theory the numbers of collisions for the stereoisomers of Spongistatin I (using the whole set of 67,108,864 isomers and its subsets). For the longer first block, we generated, using custom-made software, InChIKeys for more than 3 x 1010 chemical structures. The statistical behavior of this block was tested by comparison of experimental and theoretical frequencies for the various four-letter sequences which may appear in the first block body. From the results of our computational experiments we conclude that the observed characteristics of InChIKey collision resistance are in good agreement with theoretical expectations.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Chemistry, Multidisciplinary

The method of appearing solvent: Extraction of metal ions from aqueous solutions into in situ forming ionic liquid

S. V. Smirnova, T. O. Samarina, D. V. Il'in, I. V. Pletnev, Yu. A. Zolotov

DOKLADY CHEMISTRY (2016)

Article Chemistry, Medicinal

Algorithmic Analysis of Cahn-IngoId-Prelog Rules of Stereochemistry: Proposals for Revised Rules and a Guide for Machine Implementation

Robert M. Hanson, Sophia Musacchi, John W. Mayfield, Mikko J. Vainio, Andrey Yerin, Dmitry Redkin

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2018)

Article Electrochemistry

Highly selective solid-state sensor for iodide based on the combined use of platinum (IV) phthalocyanine and solidified pyridinium ionic liquid

Natalya V. Shvedene, Mikhail N. Abashev, Suren A. Arakelyan, Katerina N. Otkidach, Larisa G. Tomilova, Igor V. Pletnev

JOURNAL OF SOLID STATE ELECTROCHEMISTRY (2019)

Review Chemistry, Analytical

New Directions in Using Ionic Liquids in Analytical Chemistry. 1: Liquid-Liquid Extraction

I. V. Pletnev, S. V. Smirnova, N. V. Shvedene

JOURNAL OF ANALYTICAL CHEMISTRY (2019)

Review Chemistry, Analytical

New Directions in Using Ionic Liquids in Analytical Chemistry. 2: Electrochemical Methods

I. V. Pletnev, S. V. Smirnova, N. V. Shvedene

JOURNAL OF ANALYTICAL CHEMISTRY (2019)

Article Chemistry, Multidisciplinary

Nomenclature for boranes and related species (IUPAC Recommendations 2019)

Michael A. Beckett, Bernd Brellochs, Igor T. Chizhevsky, Ture Damhus, Karl-Heinz Hellwich, John D. Kennedy, Risto Laitinen, Warren H. Powell, Daniel Rabinovich, Clara Vinas, Andrey Yerin

PURE AND APPLIED CHEMISTRY (2020)

Article Chemistry, Multidisciplinary

Brief guide to the nomenclature of organic chemistry (IUPAC Technical Report)

Karl-Heinz Hellwich, Richard M. Hartshorn, Andrey Yerin, Ture Damhus, Alan T. Hutton

PURE AND APPLIED CHEMISTRY (2020)

Article Chemistry, Analytical

Extraction and ICP-OES determination of heavy metals using tetrabutylammonium bromide aqueous biphasic system and oleophilic collector

Svetlana Smirnova, Dmitry Ilin, Igor Pletnev

Summary: This study reported the preconcentration of Cd(II), Co(II), Cu(II), Ni(II), Pb(II), and Zn(II) in aqueous biphasic system using TBAB - H2O - (NH4)2SO4, followed by ICP-OES determination for the first time. The method allowed for high preconcentration factor and decreased detection limits for ICP-OES determination of heavy metals.

TALANTA (2021)

Article Chemistry, Analytical

Extraction and determination of synthetic food dyes using tetraalkylammonium based liquid-liquid extraction

Svetlana Smirnova, Kristina A. Lyskovtseva, Igor Pletnev

Summary: Tetraalkylammonium based liquid-liquid biphasic systems were utilized for the extraction and spectrophotometric determination of synthetic food dyes. The study examined various factors affecting the extraction process and established optimal conditions, leading to nearly 100% recovery rates and low detection limits for all analytes under study.

MICROCHEMICAL JOURNAL (2021)

Article Chemistry, Multidisciplinary

InChI version 1.06: now more than 99.99% reliable

Jonathan M. Goodman, Igor Pletnev, Paul Thiessen, Evan Bolton, Stephen R. Heller

Summary: The software for the IUPAC Chemical Identifier, InChI, is highly reliable and has been upgraded to version 1.06 with significant new features including support for pseudo-element atoms and improved description of polymers. Research results show that the accuracy of version 1.05 was 99.996% and version 1.06 represents a step closer to perfection, with few applications needing changes as a result of the upgrade.

JOURNAL OF CHEMINFORMATICS (2021)

Article Chemistry, Multidisciplinary

Terminology and the naming of conjugates based on polymers or other substrates (IUPAC Recommendations 2021)

Michel Vert, Jiazhong Chen, Andrey Yerin, Karl-Heinz Hellwich, Roger C. Hiorns, Richard Jones, Graeme Moad, Gerard P. Moss

Summary: The new IUPAC naming system provides detailed rules for unambiguous and easy naming of any conjugate, primarily applicable to polymer conjugates but also suitable for naming conjugates with other substrates. It should be used when recognition of the substrate and active substance is essential, and when constraints of name length make the otherwise preferred IUPAC nomenclatures untenable.

PURE AND APPLIED CHEMISTRY (2022)

Review Chemistry, Multidisciplinary

New generation extraction solvents: from ionic liquids and aqueous biphasic systems to deep eutectic solvents

Igor Pletnev, Svetlana Smirnova, Andrei Sharov, Yury A. Zolotov

Summary: This review focuses on new generation extraction solvents, including ionic liquids, aqueous biphasic systems, and eutectic solvents. These solvents based on organic ionic components offer environmental safety and green character, but face challenges in uniform terminology and interpreting system behavior.

RUSSIAN CHEMICAL REVIEWS (2021)

Review Chemistry, Inorganic & Nuclear

The past, present, and future in the nomenclature and structure representation of inorganic compounds

Richard M. Hartshorn, Andrey Yerin

DALTON TRANSACTIONS (2019)

Review Chemistry, Analytical

New Ionic Liquids for Extraction Preconcentration

S. V. Smirnova, I. V. Pletnev

JOURNAL OF ANALYTICAL CHEMISTRY (2019)

暂无数据