Multipartition clustering of mixed data with Bayesian networks

Rodríguez Sánchez, Fernando, Bielza Lozoya, María Concepción ORCID: https://orcid.org/0000-0001-7109-2668 and Larrañaga Múgica, Pedro María ORCID: https://orcid.org/0000-0003-0652-9872 (2022). Multipartition clustering of mixed data with Bayesian networks. "International Journal of Intelligent Systems", v. 37 (n. 2); pp. 2188-2218. ISSN 1098-111X. https://doi.org/10.1002/int.22770.

Descripción

Título: Multipartition clustering of mixed data with Bayesian networks
Autor/es:
Tipo de Documento: Artículo
Título de Revista/Publicación: International Journal of Intelligent Systems
Fecha: Marzo 2022
ISSN: 1098-111X
Volumen: 37
Número: 2
Materias:
ODS:
Palabras Clave Informales: Bayesian networks, Mixed data, Model‐based clustering, Multipartition clustering
Escuela: E.T.S. de Ingenieros Informáticos (UPM)
Departamento: Inteligencia Artificial
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[thumbnail of LARRANAGA_2022_12_B.pdf] PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (9MB)

Resumen

Real‐world applications often involve multifaceted data with several reasonable interpretations. To cluster this data, we need methods that are able to produce multiple clustering solutions. To this purpose, it is interesting to learn a finite mixture model with multiple latent variables, where each latent variable represents a unique way to partition the data. However, although there is an extensive literature on multipartition clustering methods for categorical data and for continuous data, there is a lack of work for mixed data. In this paper, we propose a multipartition clustering method that is able to efficiently deal with mixed data by exploiting the Bayesian network factorization and the variational Bayes framework. We show the flexibility and applicability of the proposed method by solving clustering, density estimation, and missing data imputation tasks in real‐world data sets. For reproducibility, all code, data, and results can be found in the following public repository: https://github.com/ferjorosa/mpc‐mixed.

Proyectos asociados

Tipo
Código
Acrónimo
Responsable
Título
Gobierno de España
PID2019‐109247GB‐I00
Sin especificar
Sin especificar
Sin especificar
Horizonte 2020
945539
Sin especificar
Sin especificar
Human Brain Project SGA3

Más información

ID de Registro: 72625
Identificador DC: https://oa.upm.es/72625/
Identificador OAI: oai:oa.upm.es:72625
URL Portal Científico: https://portalcientifico.upm.es/es/ipublic/item/9742317
Identificador DOI: 10.1002/int.22770
URL Oficial: https://onlinelibrary.wiley.com/doi/full/10.1002/i...
Depositado por: Biblioteca Facultad de Informatica
Depositado el: 15 Feb 2023 07:08
Ultima Modificación: 03 Mar 2025 09:59