4.7 Article

AutoGraph: Autonomous Graph-Based Clustering of Small-Molecule Conformations

Journal

JOURNAL OF CHEMICAL INFORMATION AND MODELING
Volume 61, Issue 4, Pages 1647-1656

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/acs.jcim.0c01492

Keywords

-

Funding

  1. NIH [1U2CES030167-01]

Ask authors/readers for more resources

The study introduces an automated conformational clustering algorithm that reduces predefined cluster numbers or thresholds, while preserving geometric/energetic correlations. Automating conformational clustering may alleviate human biases and allow flexibility.
While accurately modeling the conformational ensemble is required for predicting properties of flexible molecules, the optimal method of obtaining the conformational ensemble appears as varied as their applications. Ensemble structures have been modeled by generation, refinement, and clustering of conformations with a sufficient number of samples. We present a conformational clustering algorithm intended to automate the conformational clustering step through the Louvain algorithm, which requires minimal hyperparameters and importantly no predefined number of clusters or threshold values. The conformational graphs produced by this method for O-succinyl-L-homoserine, oxidized nicotinamide adenine dinucleotide, and 200 representative metabolites each preserved the geometric/energetic correlation expected for points on the potential energy surface. Clustering based on these graphs provides partitions informed by the potential energy surface. Automating conformational clustering in a workflow with AutoGraph may mitigate human biases introduced by guess and check over hyperparameter selection while allowing flexibility to the result by not imposing predefined criteria other than optimizing the model's loss function. Associated codes are available at littps://github.corn/TanemuraKiyoto/AutoGraph.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available