Semiparametric Bayesian networks

Atienza González, David, Bielza Lozoya, María Concepción ORCID: https://orcid.org/0000-0001-7109-2668 and Larrañaga Múgica, Pedro María ORCID: https://orcid.org/0000-0002-1885-4501 (2021). Semiparametric Bayesian networks. "Information Sciences", v. 584 ; pp. 564-582. ISSN 0020-0255. https://doi.org/10.1016/j.ins.2021.10.074.

Description

Title: Semiparametric Bayesian networks
Author/s:
Item Type: Article
Título de Revista/Publicación: Information Sciences
Date: November 2021
ISSN: 0020-0255
Volume: 584
Subjects:
Freetext Keywords: Bayesian networks, Kernel density estimation, Semiparametric model, Continuous data
Faculty: E.T.S. de Ingenieros Informáticos (UPM)
Department: Inteligencia Artificial
Creative Commons Licenses: Recognition - No derivative works - Non commercial

Full text

[thumbnail of LARRANAGA_2022_11.pdf] PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (1MB)

Abstract

We introduce semiparametric Bayesian networks that combine parametric and nonparametric conditional probability distributions. Their aim is to incorporate the advantages of both components: the bounded complexity of parametric models and the flexibility of nonparametric ones. We demonstrate that semiparametric Bayesian networks generalize two well-known types of Bayesian networks: Gaussian Bayesian networks and kernel density estimation Bayesian networks. For this purpose, we consider two different conditional probability distributions required in a semiparametric Bayesian network. In addition, we present modifications of two well-known algorithms (greedy hill-climbing and PC) to learn the structure of a semiparametric Bayesian network from data. To realize this, we employ as core function based on cross-validation. In addition, using a validation dataset, we apply an early-stopping criterion to avoid overfitting. To evaluate the applicability of the proposed algorithm, we conduct an exhaustive experiment on synthetic data sampled by mixing linear and nonlinear functions, multivariate normal data sampled from Gaussian Bayesian networks, real data from the UCI repository, and bearings degradation data. Asa result of this experiment, we conclude that the proposed algorithm accurately learns the combination of parametric and nonparametric components, while achieving a performance comparable with those provided by state-of-the-art methods._ 2021 The Author(s). Published by Elsevier Inc. This is an open access article under the CCBY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

Funding Projects

Type
Code
Acronym
Leader
Title
Government of Spain
PID2019-109247GB-I00
Unspecified
Unspecified
Unspecified
Government of Spain
RTC2019-006871-7
Unspecified
Unspecified
Unspecified

More information

Item ID: 72639
DC Identifier: https://oa.upm.es/72639/
OAI Identifier: oai:oa.upm.es:72639
DOI: 10.1016/j.ins.2021.10.074
Official URL: https://www.sciencedirect.com/science/article/pii/...
Deposited by: Biblioteca Facultad de Informatica
Deposited on: 15 Feb 2023 07:19
Last Modified: 15 Feb 2023 11:05
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM