4.6 Article

Multivariate analysis of flow cytometric data using decision trees

Journal

FRONTIERS IN MICROBIOLOGY
Volume 3, Issue -, Pages -

Publisher

FRONTIERS MEDIA SA
DOI: 10.3389/fmicb.2012.00114

Keywords

flow cytometry; cytokines; machine learning; induction of decision trees; imbalanced data; multidimensionality

Categories

Funding

  1. Deutsche Forschungsgemeinschaft (DIG) Priority Program 1160 Colonisation and infection by human-pathogenic fungi.
  2. Deutsche Forschungsgemeinschaft (DIG) Priority Program 1468 Immunobone [KA 755-6/1]

Ask authors/readers for more resources

Characterization of the response of the host immune system is important in understanding the bidirectional interactions between the host and microbial pathogens. For research on the host site, flow cytometry has become one of the major tools in immunology. Advances in technology and reagents allow now the simultaneous assessment of multiple markers on a single cell level generating multidimensional data sets that require multivariate statistical analysis. We explored the explanatory power of the supervised machine learning method called induction of decision trees in flow cytometric data. In order to examine whether the production of a certain cytokine is depended on other cytokines, datasets from intracellular staining for six cytokines with complex patterns of co-expression were analyzed by induction of decision trees. After weighting the data according to their class probabilities, we created a total of 13,392 different decision trees for each given cytokine with different parameter settings. For a more realistic estimation of the decision trees' quality, we used stratified fivefold cross validation and chose the best tree according to a combination of different quality criteria. While some of the decision trees reflected previously known co-expression patterns, we found that the expression of some cytokines was not only dependent on the co-expression of others per se, but was also dependent on the intensity of expression. Thus, for the first time we successfully used induction of decision trees for the analysis of high dimensional flow cytometric data and demonstrated the feasibility of this method to reveal structural patterns in such data sets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available