4.8 Article

Application of Bayesian Multilevel Modeling in the Quantitative Structure-Retention Relationship Studies of Heterogeneous Compounds

Journal

ANALYTICAL CHEMISTRY
Volume 93, Issue 18, Pages 6961-6971

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/acs.analchem.0c05227

Keywords

-

Funding

  1. National Science Centre, Poland [2015/18/E/ST4/00449]

Ask authors/readers for more resources

A Bayesian multilevel model was proposed to characterize isocratic retention time data for 1026 heterogeneous analytes, considering the effects of molecular mass and functional groups. This model helps in understanding chromatographic data, quantifying the effect of functional groups on retention, and predicting analyte retention based on various types of preliminary data. Visualizing uncertainty in predictions and discussing the usefulness in decision making is also highlighted in the study.
Quantitative structure-retention relationships (QSRRs) are used in the field of chromatography to model the relationship between an analyte structure and chromatographic retention. Such models are typically difficult to build and validate for heterogeneous compounds because of their many descriptors and relatively limited analyte-specific data. In this study, a Bayesian multilevel model is proposed to characterize the isocratic retention time data collected for 1026 heterogeneous analytes. The QSRR considers the effects of the molecular mass and 100 functional groups (substituents) on analyte-specific chromatographic parameters of the Neue model (i.e., the retention factor in water, the retention factor in acetonitrile, and the curvature coefficient). A Bayesian multilevel regression model was used to smooth noisy parameter estimates with too few data and to consider the uncertainties in the model parameters. We discuss the benefits of the Bayesian multilevel model (i) to understand chromatographic data, (ii) to quantify the effect of functional groups on chromatographic retention, and (iii) to predict analyte retention based on various types of preliminary data. The uncertainty of isocratic and gradient predictions was visualized using uncertainty chromatograms and discussed in terms of usefulness in decision making. We think that this method will provide the most benefit in providing a unified scheme for analyzing large chromatographic databases and assessing the impact of functional groups and other descriptors on analyte retention.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available