Full text
|
PDF
- Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (987kB) | Preview |
Benjumeda Barquita, Marco Alberto and Luengo Sánchez, Sergio and Larrañaga Múgica, Pedro María and Bielza Lozoya, María Concepción (2019). Tractable learning of Bayesian networks from partially observed data. "Pattern Recognition", v. 91 ; pp. 190-199. ISSN 0031-3203. https://doi.org/10.1016/j.patcog.2019.02.025.
Title: | Tractable learning of Bayesian networks from partially observed data |
---|---|
Author/s: |
|
Item Type: | Article |
Título de Revista/Publicación: | Pattern Recognition |
Date: | July 2019 |
ISSN: | 0031-3203 |
Volume: | 91 |
Subjects: | |
Freetext Keywords: | Structural expectation-maximization; Bayesian network; Incomplete data; Inference complexity; Structure learning |
Faculty: | E.T.S. de Ingenieros Informáticos (UPM) |
Department: | Inteligencia Artificial |
Creative Commons Licenses: | Recognition - No derivative works - Non commercial |
|
PDF
- Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (987kB) | Preview |
The majority of real-world problems require addressing incomplete data. The use of the structural expectation-maximization algorithm is the most common approach toward learning Bayesian networks from incomplete datasets. However, its main limitation is its demanding computational cost, caused mainly by the need to make an inference at each iteration of the algorithm. In this paper, we propose a new method with the purpose of guaranteeing the efficiency of the learning process while improving the performance of the structural expectation-maximization algorithm. We address the first objective by applying an upper bound to the treewidth of the models to limit the complexity of the inference. To achieve this, we use an efficient heuristic to search the space of the elimination orders. For the second objective, we study the advantages of directly computing the score with respect to the observed data rather than an expectation of the score, and provide a strategy to efficiently perform these computations in the proposed method. We perform exhaustive experiments on synthetic and real-world datasets of varied dimensionalities, including datasets with thousands of variables and hundreds of thousands of instances. The experimental results support our claims empirically.
Type | Code | Acronym | Leader | Title |
---|---|---|---|---|
Government of Spain | C080020-09 | Unspecified | Unspecified | The Cajal Blue Brain |
Government of Spain | TIN2016-79684-P | Unspecified | Universidad Politécnica de Madrid | Avances en clasificación multidimensional y detección de anomalías con redes bayesianas |
Madrid Regional Government | S2013/ICE-2845 | CASI-CAM | Unspecified | Conceptos y aplicaciones de los sistemas inteligentes |
Horizon 2020 | 785907 | HBP SGA2 | École Polytechnique Fédérale de Lausanne | Human Brain Project Specific Grant Agreement 2 |
Government of Spain | BES-2014-068637 | Unspecified | Universidad Politécnica de Madrid | Predoctoral contract for the formation of doctors |
Item ID: | 63470 |
---|---|
DC Identifier: | https://oa.upm.es/63470/ |
OAI Identifier: | oai:oa.upm.es:63470 |
DOI: | 10.1016/j.patcog.2019.02.025 |
Official URL: | https://www.sciencedirect.com/science/article/abs/pii/S0031320319300974?via%3Dihub |
Deposited by: | Memoria Investigacion |
Deposited on: | 27 Oct 2020 07:59 |
Last Modified: | 27 Oct 2020 07:59 |