A comparison of clustering quality indices using outliers and noise

Guerra Velasco, Luis Pelayo, Robles Forcada, Víctor ORCID: https://orcid.org/0000-0003-3937-2269, Bielza Lozoya, María Concepción ORCID: https://orcid.org/0000-0001-7109-2668 and Larrañaga Múgica, Pedro María ORCID: https://orcid.org/0000-0003-0652-9872 (2012). A comparison of clustering quality indices using outliers and noise. "Intelligent Data Analysis", v. 16 (n. 4); pp. 703-715. ISSN 1571-4128. https://doi.org/10.3233/ida-2012-0545.

Descripción

Título: A comparison of clustering quality indices using outliers and noise
Autor/es:
Tipo de Documento: Artículo
Título de Revista/Publicación: Intelligent Data Analysis
Fecha: Julio 2012
ISSN: 1571-4128
Volumen: 16
Número: 4
Materias:
ODS:
Palabras Clave Informales: Clustering, Internal indices, Stopping rules
Escuela: Facultad de Informática (UPM) [antigua denominación]
Departamento: Inteligencia Artificial
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[thumbnail of LARRANAGA_2012_02_1.pdf] PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (604kB)

Resumen

Quality indices in clustering are used not only to assess the quality of the partitions but also to determine the number of clusters in the final result. When these indices are evaluated in a case study, real data conditions or different clustering algorithms are seldom taken in to account. Here, some of the standard indices used in the literature are compared using more realistic databases that include outliers or noisy dimensions, which is more like a real problem-solving approach. Besides, three different clustering methods are used in an attempt to identify different behaviours. Also, the performance of the quality index-clustering algorithm tandem is compared to random grouping, with the aim of running an additional check. The indices are ranked, and index-based conclusions are drawn for all the scenarios.

Proyectos asociados

Tipo
Código
Acrónimo
Responsable
Título
Gobierno de España
TIN2010- 20900-C04-04
Sin especificar
Sin especificar
Sin especificar
Gobierno de España
2010-CSD2007-00018
Sin especificar
Sin especificar
Sin especificar

Más información

ID de Registro: 72861
Identificador DC: https://oa.upm.es/72861/
Identificador OAI: oai:oa.upm.es:72861
URL Portal Científico: https://portalcientifico.upm.es/es/ipublic/item/9170938
Identificador DOI: 10.3233/ida-2012-0545
URL Oficial: https://content.iospress.com/articles/intelligent-...
Depositado por: Biblioteca Facultad de Informatica
Depositado el: 21 Mar 2023 10:57
Ultima Modificación: 12 Nov 2025 00:00