Fuzzy Semantic Labeling of Semi-structured Numerical Datasets

Alobaid, Ahmad

and Corcho, Oscar

(2018). Fuzzy Semantic Labeling of Semi-structured Numerical Datasets. En: "21st International Conference on Knowledge Engineering and Knowledge Management", 12-16 Nov 2018, Nancy, France. ISBN 978-3-030-03667-6. pp. 19-33. https://doi.org/10.1007/978-3-030-03667-6_2.

Descripción

Título:	Fuzzy Semantic Labeling of Semi-structured Numerical Datasets
Autor/es:	Alobaid, Ahmad https://orcid.org/0000-0001-8637-6313 Corcho, Oscar https://orcid.org/0000-0002-9260-0753
Tipo de Documento:	Ponencia en Congreso o Jornada (Artículo)
Título del Evento:	21st International Conference on Knowledge Engineering and Knowledge Management
Fechas del Evento:	12-16 Nov 2018
Lugar del Evento:	Nancy, France
Título del Libro:	Knowledge Engineering and Knowledge Management
Fecha:	2018
ISBN:	978-3-030-03667-6
Volumen:	11313
Materias:	Informática
ODS:	09. Industria, innovación e infraestructura
Palabras Clave Informales:	Fuzzy clustering Semantic labeling Semantic web
Escuela:	E.T.S. de Ingenieros Informáticos (UPM)
Departamento:	Inteligencia Artificial
Grupo Investigación UPM:	Ontology Engineering Group – OEG
Licencias Creative Commons:	Ninguna

Versiones

Esta es la última versión de este eprint.

Fuzzy Semantic Labeling of Semi-structured Numerical Datasets. (deposited 02 Sep 2019 05:39)
- Fuzzy Semantic Labeling of Semi-structured Numerical Datasets. (deposited 05 Sep 2019 08:26) [Mostrada Ahora]

Texto completo

[thumbnail of fuzzy_semantic_labeling_self_archive.pdf]

Vista Previa

PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (286kB) | Vista Previa

Resumen

SPARQL endpoints provide access to rich sources of data (e.g. knowledge graphs), which can be used to classify other less structured datasets (e.g. CSV files or HTML tables on the Web). We propose an approach to suggest types for the numerical columns of a collection of input files available as CSVs. Our approach is based on the application of the fuzzy c-means clustering technique to numerical data in the input files, using existing SPARQL endpoints to generate training datasets. Our approach has three major advantages: it works directly with live knowledge graphs, it does not require knowledge-graph profiling beforehand, and it avoids tedious and costly manual training to match values with types. We evaluate our approach against manually annotated datasets. The results show that the proposed approach classifies most of the types correctly for our test sets.

Más información

ID de Registro:	56289
Identificador DC:	https://oa.upm.es/56289/
Identificador OAI:	oai:oa.upm.es:56289
Identificador DOI:	10.1007/978-3-030-03667-6_2
URL Oficial:	https://link.springer.com/chapter/10.1007/978-3-03...
Depositado por:	Ahmad Alobaid
Depositado el:	05 Sep 2019 08:26
Ultima Modificación:	28 Abr 2026 10:03

Estadísticas

Exportar cita

Editar (sólo personal del Archivo)

En esta página

Menú principal

Buscar

Fuzzy Semantic Labeling of Semi-structured Numerical Datasets

Cita

Descripción

Versiones

Texto completo

Resumen

Más información

Acciones

Metrics

Altmetrics probando

Dimensions

Documentos

El repositorio

Agrupados por ...

Datos Investigación

Financiadores

Especiales

En otros formatos

Redes sociales

Información adicional