4.8 Article

Graph Convolutional Networks for Improved Prediction and Interpretability of Chromatographic Retention Data

Journal

ANALYTICAL CHEMISTRY
Volume 93, Issue 47, Pages 15633-15641

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/acs.analchem.1c02988

Keywords

-

Funding

  1. Research Foundation Flanders (FWO)
  2. Walloon Fund for Scientific Research (FNRS) [30897864]
  3. VLAIO O&O project Amedes [AIO/HBC.2017.0996/AMEDES]
  4. VLAIO [HBC.2020.2205]
  5. Janssen Pharmaceuticals

Ask authors/readers for more resources

This study utilized graph convolutional networks (GCNs) to optimize molecular representations and predict the retention times of molecules in different chromatographic data sets. The performance of GCNs was found to be superior, significantly outperforming other benchmarks, or performing similarly to them. Saliency maps revealed significant differences in important molecular sub-structures for predictions in different chromatographic data sets.
Machine learning is a popular technique to predict the retention times of molecules based on descriptors. Descriptors and associated labels (e.g., retention times) of a set of molecules can be used to train a machine learning algorithm. However, descriptors are fixed molecular features which are not necessarily optimized for the given machine learning problem (e.g., to predict retention times). Recent advances in molecular machine learning make use of so-called graph convolutional networks (GCNs) to learn molecular representations from atoms and their bonds to adjacent atoms to optimize the molecular representation for the given problem. In this study, two GCNs were implemented to predict the retention times of molecules for three different chromatographic data sets and compared to seven benchmarks (including two state-of-the art machine learning models). Additionally, saliency maps were computed from trained GCNs to better interpret the importance of certain molecular sub-structures in the data sets. Based on the overall observations of this study, the GCNs performed better than all benchmarks, either significantly outperforming them (5-25% lower mean absolute error) or performing similar to them (<5% difference). Saliency maps revealed a significant difference in molecular sub-structures that are important for predictions of different chromatographic data sets (reversed-phase liquid chromatography vs hydrophilic interaction liquid chromatography).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available