Boosting Knowledge Graph Generation from Tabular Data with RML Views

Arenas-Guerrero, Julián, Alobaid, Ahmad ORCID: https://orcid.org/0000-0001-8637-6313, Navas Loro, María ORCID: https://orcid.org/0000-0003-1011-5023, Pérez Hernández, María S. and Corcho, Oscar (2023). Boosting Knowledge Graph Generation from Tabular Data with RML Views. In: "Extended Semantic Web Conference", 2023, Hersonissos.

Description

Title: Boosting Knowledge Graph Generation from Tabular Data with RML Views
Author/s:
  • Arenas-Guerrero, Julián
  • Alobaid, Ahmad https://orcid.org/0000-0001-8637-6313
  • Navas Loro, María https://orcid.org/0000-0003-1011-5023
  • Pérez Hernández, María S.
  • Corcho, Oscar
Item Type: Presentation at Congress or Conference (Article)
Event Title: Extended Semantic Web Conference
Event Dates: 2023
Event Location: Hersonissos
Title of Book: Proceedings of the 20th Extended Semantic Web Conference
Date: 2023
Subjects:
Faculty: E.T.S. de Ingenieros Informáticos (UPM)
Department: Arquitectura y Tecnología de Sistemas Informáticos
UPM's Research Group: Ontology Engineering Group OEG
Creative Commons Licenses: Recognition

Full text

[thumbnail of _2023___ESWC__RML_Tabular_Views.pdf] PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (605kB)

Abstract

A large amount of data is available in tabular form. RML is commonly used to declare how such data can be transformed into RDF. However, RML presents limitations that lead, in many cases, to the need for additional preprocessing using scripting. Although some proposed extensions (e.g., FnO or RML fields) address some of these limitations, they are verbose, unfamiliar to most data engineers, and implemented in systems that do not scale up when large volumes of data need to be processed. In this work, we expand RML views to tabular sources so as to address the limitations of this mapping language. In this way, transformation functions, complex joins, or mixed syntax can be defined directly in SQL queries. We present our extension of Morph-KGC to efficiently support RML views for tabular sources. We validate our implementation adapting R2RML test cases with views and compare it against state-of-the-art RML+FnO systems showing that our system is significantly more scalable. Moreover, we present specific examples of a real use case in the public procurement domain where basic RML mappings could not be used without additional preprocessing.

More information

Item ID: 73463
DC Identifier: https://oa.upm.es/73463/
OAI Identifier: oai:oa.upm.es:73463
Deposited by: Julián Arenas-Guerrero
Deposited on: 21 Apr 2023 09:42
Last Modified: 21 Apr 2023 09:42
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM