☆ 4.2 Article

Fuzzy rule based classification systems for big data with MapReduce: granularity analysis

ADVANCES IN DATA ANALYSIS AND CLASSIFICATION (2017)

期刊

ADVANCES IN DATA ANALYSIS AND CLASSIFICATION

卷 11, 期 4, 页码 711-730

出版社

SPRINGER HEIDELBERG

DOI: 10.1007/s11634-016-0260-z

关键词

Big data; Fuzzy rule based classification systems; Granularity; MapReduce; Hadoop

类别

Statistics & Probability

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Due to the vast amount of information available nowadays, and the advantages related to the processing of this data, the topics of big data and data science have acquired a great importance in the current research. Big data applications are mainly about scalability, which can be achieved via the MapReduce programming model.It is designed to divide the data into several chunks or groups that are processed in parallel, and whose result is assembled to provide a single solution. Among different classification paradigms adapted to this new framework, fuzzy rule based classification systems have shown interesting results with a MapReduce approach for big data. It is well known that the performance of these types of systems has a strong dependence on the selection of a good granularity level for the Data Base. However, in the context of MapReduce this parameter is even harder to determine as it can be also related with the number of Maps chosen for the processing stage. In this paper, we aim at analyzing the interrelation between the number of labels of the fuzzy variables and the scarcity of the data due to the data sampling in MapReduce. Specifically, we consider that as the partitioning of the initial instance set grows, the level of granularity necessary to achieve a good performance also becomes higher. The experimental results, carried out for several Big Data problems, and using the Chi-FRBCS-BigData algorithms, support our claims.

Fuzzy rule based classification systems for big data with MapReduce: granularity analysis

期刊

ADVANCES IN DATA ANALYSIS AND CLASSIFICATION

出版社

SPRINGER HEIDELBERG

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Fuzzy rule based classification systems for big data with MapReduce: granularity analysis

期刊

ADVANCES IN DATA ANALYSIS AND CLASSIFICATION

出版社

SPRINGER HEIDELBERG

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文