Journal
GLYCOBIOLOGY
Volume 22, Issue 4, Pages 552-560Publisher
OXFORD UNIV PRESS INC
DOI: 10.1093/glycob/cwr163
Keywords
glycan array; PLS; QSAR
Categories
Funding
- Institute for Modeling and Simulation Applications at Clemson University
- NSF [0960586, EPS-0903787, RC1AI086830]
- Office Of The Director
- EPSCoR [903787] Funding Source: National Science Foundation
Ask authors/readers for more resources
Advances in glycan array technology have provided opportunities to automatically and systematically characterize the binding specificities of glycan-binding proteins. However, there is still a lack of robust methods for such analyses. In this study, we developed a novel quantitative structure-activity relationship (QSAR) method to analyze glycan array data. We first decomposed glycan chains into mono-, di-, tri- or tetrasaccharide subtrees. The bond information was incorporated into subtrees to help distinguish glycan chain structures. Then, we performed partial least-squares (PLS) regression on glycan array data using the subtrees as features. The application of QSAR to the glycan array data of different glycan-binding proteins demonstrated that PLS regression using subtree features can obtain higher R-2 values and a higher percentage of variance explained in glycan array intensities. Based on the regression coefficients of PLS, we were able to effectively identify subtrees that indicate the binding specificities of a glycan-binding protein. Our approach will facilitate the glycan-binding specificity analysis using the glycan array. A user-friendly web tool of the QSAR method is available at http://bci.clemson.edu/tools/glycan_array.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available