☆ 4.5 Article

Multiple factor analysis and clustering of a mixture of quantitative, categorical and frequency data

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2008)

期刊

COMPUTATIONAL STATISTICS & DATA ANALYSIS

卷 52, 期 6, 页码 3255-3268

出版社

ELSEVIER SCIENCE BV

DOI: 10.1016/j.csda.2007.09.023

关键词

mixed data; textual data; distance; multiple factor analysis; multiple factor analysis for contingency tables; clustering; survey

类别

Computer Science, Interdisciplinary Applications Statistics & Probability

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Analysing and clustering units described by a mixture of sets of quantitative, categorical and frequency variables is a relevant challenge. Multiple factor analysis is extended to include these three types of variables in order to balance the influence of the different sets when a global distance between units is computed. Suitable coding is adopted to keep as close as possible to the approach offered by principal axes methods, that is, principal component analysis for quantitative sets, multiple correspondence analysis for categorical sets and correspondence analysis for frequency sets. In addition, the presence of frequency sets poses the problem of selecting the unit weighting, since this is fixed by the user (usually uniform) in principal component analysis and multiple correspondence analysis, but imposed by the table margin in correspondence analysis. The method's main steps are presented and illustrated by an example extracted from a survey that aimed to cluster respondents to a questionnaire that included both closed and open-ended questions. (c) 2007 Elsevier B.V. All rights reserved.

Multiple factor analysis and clustering of a mixture of quantitative, categorical and frequency data

期刊

COMPUTATIONAL STATISTICS & DATA ANALYSIS

出版社

ELSEVIER SCIENCE BV

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multiple factor analysis and clustering of a mixture of quantitative, categorical and frequency data

期刊

COMPUTATIONAL STATISTICS & DATA ANALYSIS

出版社

ELSEVIER SCIENCE BV

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文