4.4 Article

A quantitative structure-activity relationship (QSAR) study on glycan array data to determine the specificities of glycan-binding proteins

Journal

GLYCOBIOLOGY
Volume 22, Issue 4, Pages 552-560

Publisher

OXFORD UNIV PRESS INC
DOI: 10.1093/glycob/cwr163

Keywords

glycan array; PLS; QSAR

Funding

  1. Institute for Modeling and Simulation Applications at Clemson University
  2. NSF [0960586, EPS-0903787, RC1AI086830]
  3. Office Of The Director
  4. EPSCoR [903787] Funding Source: National Science Foundation

Ask authors/readers for more resources

Advances in glycan array technology have provided opportunities to automatically and systematically characterize the binding specificities of glycan-binding proteins. However, there is still a lack of robust methods for such analyses. In this study, we developed a novel quantitative structure-activity relationship (QSAR) method to analyze glycan array data. We first decomposed glycan chains into mono-, di-, tri- or tetrasaccharide subtrees. The bond information was incorporated into subtrees to help distinguish glycan chain structures. Then, we performed partial least-squares (PLS) regression on glycan array data using the subtrees as features. The application of QSAR to the glycan array data of different glycan-binding proteins demonstrated that PLS regression using subtree features can obtain higher R-2 values and a higher percentage of variance explained in glycan array intensities. Based on the regression coefficients of PLS, we were able to effectively identify subtrees that indicate the binding specificities of a glycan-binding protein. Our approach will facilitate the glycan-binding specificity analysis using the glycan array. A user-friendly web tool of the QSAR method is available at http://bci.clemson.edu/tools/glycan_array.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available