4.7 Article

Semiparametric Bayesian networks

Journal

INFORMATION SCIENCES
Volume 584, Issue -, Pages 564-582

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2021.10.074

Keywords

Bayesian networks; Kernel density estimation; Semiparametric model; Continuous data

Funding

  1. Spanish Ministry of Education, Culture and Sport [FPU16/00921]
  2. Spanish Ministry of Science and Innovation [PID2019-109247GB-I00, RTC2019-006871-7]
  3. BBVA Foundation
  4. BB

Ask authors/readers for more resources

Semiparametric Bayesian networks combine parametric and nonparametric conditional probability distributions to incorporate the advantages of both components. By considering different types of conditional probability distributions and modifying learning algorithms, the proposed approach achieves comparable performance to state-of-the-art methods.
We introduce semiparametric Bayesian networks that combine parametric and nonparametric conditional probability distributions. Their aim is to incorporate the advantages of both components: the bounded complexity of parametric models and the flexibility of nonparametric ones. We demonstrate that semiparametric Bayesian networks generalize two well-known types of Bayesian networks: Gaussian Bayesian networks and kernel density estimation Bayesian networks. For this purpose, we consider two different conditional probability distributions required in a semiparametric Bayesian network. In addition, we present modifications of two well-known algorithms (greedy hill-climbing and PC) to learn the structure of a semiparametric Bayesian network from data. To realize this, we employ a score function based on cross-validation. In addition, using a validation dataset, we apply an early-stopping criterion to avoid overfitting. To evaluate the applicability of the proposed algorithm, we conduct an exhaustive experiment on synthetic data sampled by mixing linear and nonlinear functions, multivariate normal data sampled from Gaussian Bayesian networks, real data from the UCI repository, and bearings degradation data. As a result of this experiment, we conclude that the proposed algorithm accurately learns the combination of parametric and nonparametric components, while achieving a performance comparable with those provided by state-of-the-art methods. (c) 2021 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available