☆ 4.7 Article

Sample-Based Attribute Selective AnDE for Large Data

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2017)

期刊

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING

卷 29, 期 1, 页码 172-185

出版社

IEEE COMPUTER SOC

DOI: 10.1109/TKDE.2016.2608881

关键词

Bayesian network classifiers; large data; classification learning; attribute selection; averaged n-dependence estimators (AnDE); leave-one-out cross validation

类别

Computer Science, Artificial Intelligence Computer Science, Information Systems Engineering, Electrical & Electronic

资金

Australian Research Council [DP140100087]
Asian Office of Aerospace Research and Development, Air Force Office of Scientific Research [FA2386-15-1-4007]
National Natural Science Foundation of China [61202135]
Natural Science Foundation of Jiangsu, China [BK20130735]
Natural Science Foundation of Jiangsu Higher Education Institutions of China [14KJB520019, 13KJB520011, 13KJB520013]
Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University
Priority Academic Program Development of Jiangsu Higher Education Institutions
Monash e-Research Center
eSolutions-Research Support Services
Australian Commonwealth Government

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

More and more applications have come with large data sets in the past decade. However, existing algorithms cannot guarantee to scale well on large data. Averaged n-Dependence Estimators (AnDE) allows for flexible learning from out-of-core data, by varying the value of n (number of super parents). Hence, AnDE is especially appropriate for large data learning. In this paper, we propose a sample-based attribute selection technique for AnDE. It needs one more pass through the training data, in which a multitude of approximate AnDE models are built and efficiently assessed by leave-one-out cross validation. The use of a sample reduces the training time. Experiments on 15 large data sets demonstrate that the proposed technique significantly reduces AnDE's error at the cost of a modest increase in training time. This efficient and scalable out-of-core approach delivers superior or comparable performance to typical in-core Bayesian network classifiers.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

主要评分

4.7

评分不足

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

Averaged tree-augmented one-dependence estimators

He Kong, Xiaohu Shi, Limin Wang, Yang Liu, Musa Mammadov, Gaojie Wang

Summary: The paper proposes a novel approach, averaged tree-augmented one-dependence estimators (ATODE), which relaxes the independence assumption of AODE by exploring higher-order conditional dependencies between attributes. Experimental results on 36 datasets demonstrate that the proposed approach can achieve competitive or better classification performance compared to state-of-the-art learners.

APPLIED INTELLIGENCE (2021)