SPARQL-based linking of knowledge graphs

Jiang, Wenqi (2022). SPARQL-based linking of knowledge graphs. Thesis (Master thesis), E.T.S. de Ingenieros Informáticos (UPM).

Description

Title: SPARQL-based linking of knowledge graphs
Author/s:
  • Jiang, Wenqi
Contributor/s:
  • García Castro, Raúl
  • Cimmino Arriaga, Andrea Jesús
Item Type: Thesis (Master thesis)
Masters title: Ciencia de Datos
Date: July 2022
Subjects:
Freetext Keywords: RDF, SPARQL, Linking algorithm, Apache Jena, OAEI
Faculty: E.T.S. de Ingenieros Informáticos (UPM)
Department: Inteligencia Artificial
Creative Commons Licenses: Recognition - No derivative works - Non commercial

Full text

[img] PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (1MB)

Abstract

Nowadays, both traditional and emerging enterprises rely heavily on Internet data, yet these data are disorganized and scattered in different data sources all over the world. In most of such data, the models may be different, but their contents may be similar, to utilize these data more efficiently, it is imperative to link these isolated data to create a unified view. Link discovery has always been a trending problem, and many companies tried to solve it. However, many of the solutions from these companies are not based on a common standard and thus result in a steep learning curve. This thesis focuses on the specification and development of link discovery proposal based on standard SPARQL, as well as, the implementation of several strings similarity metrics used for the linking. Specifically, it is a Resource Description Framework (RDF) linking task that produces connections between RDF resources from different or the same data sources, such as DBpedia, Wikidata and BNE datos. RDF linking is based on one or more link rules that specify the conditions that two RDF resources can be considered the same, such as text similarity or geographical position. SPARQL Protocol and RDF Query Language (SPARQL) is one of the most widely used and standard query language and protocol for Linked Open Data on the web or RDF triplestores. So, this TFM featured a SPARQL-based application to link knowledge graphs using Apache Jena as the SPARQL engine, a free and open-source Java framework for building Semantic Web and Linked Data applications. This TFM used multiple linking rules and evaluated the results with several data sets from the Ontology Alignment Evaluation Initiative (OAEI) competition in 2021 and achieved excellent results.

More information

Item ID: 71452
DC Identifier: https://oa.upm.es/71452/
OAI Identifier: oai:oa.upm.es:71452
Deposited by: Biblioteca Facultad de Informatica
Deposited on: 29 Jul 2022 09:38
Last Modified: 29 Jul 2022 09:38
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM