Texto completo
Vista Previa |
PDF (Portable Document Format)
- Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (286kB) | Vista Previa |
ORCID: https://orcid.org/0000-0001-8637-6313 and Corcho, Oscar
ORCID: https://orcid.org/0000-0002-9260-0753
(2018).
Fuzzy Semantic Labeling of Semi-structured Numerical Datasets.
En: "21st International Conference on Knowledge Engineering and Knowledge Management", 12-16 Nov 2018, Nancy, France. ISBN 978-3-030-03667-6. pp. 19-33.
https://doi.org/10.1007/978-3-030-03667-6_2.
| Título: | Fuzzy Semantic Labeling of Semi-structured Numerical Datasets |
|---|---|
| Autor/es: |
|
| Tipo de Documento: | Ponencia en Congreso o Jornada (Artículo) |
| Título del Evento: | 21st International Conference on Knowledge Engineering and Knowledge Management |
| Fechas del Evento: | 12-16 Nov 2018 |
| Lugar del Evento: | Nancy, France |
| Título del Libro: | Knowledge Engineering and Knowledge Management |
| Fecha: | 2018 |
| ISBN: | 978-3-030-03667-6 |
| Volumen: | 11313 |
| Materias: | |
| ODS: | |
| Palabras Clave Informales: | Fuzzy clustering Semantic labeling Semantic web |
| Escuela: | E.T.S. de Ingenieros Informáticos (UPM) |
| Departamento: | Inteligencia Artificial |
| Grupo Investigación UPM: | Ontology Engineering Group – OEG |
| Licencias Creative Commons: | Ninguna |
Esta es la última versión de este eprint.
Vista Previa |
PDF (Portable Document Format)
- Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (286kB) | Vista Previa |
SPARQL endpoints provide access to rich sources of data (e.g. knowledge graphs), which can be used to classify other less structured datasets (e.g. CSV files or HTML tables on the Web). We propose an approach to suggest types for the numerical columns of a collection of input files available as CSVs. Our approach is based on the application of the fuzzy c-means clustering technique to numerical data in the input files, using existing SPARQL endpoints to generate training datasets. Our approach has three major advantages: it works directly with live knowledge graphs, it does not require knowledge-graph profiling beforehand, and it avoids tedious and costly manual training to match values with types. We evaluate our approach against manually annotated datasets. The results show that the proposed approach classifies most of the types correctly for our test sets.
| ID de Registro: | 56289 |
|---|---|
| Identificador DC: | https://oa.upm.es/56289/ |
| Identificador OAI: | oai:oa.upm.es:56289 |
| Identificador DOI: | 10.1007/978-3-030-03667-6_2 |
| URL Oficial: | https://link.springer.com/chapter/10.1007/978-3-03... |
| Depositado por: | Ahmad Alobaid |
| Depositado el: | 05 Sep 2019 08:26 |
| Ultima Modificación: | 28 Abr 2026 10:03 |
Publicar en el Archivo Digital desde el Portal Científico