Creation and enrichment of a Terminological Knowledge Graph in the legal domain

Martín Chozas, Patricia (2019). Creation and enrichment of a Terminological Knowledge Graph in the legal domain. In: "Proceedings of the Seventh JURIX 2019 Doctoral Consortium", 11 Dic 2019, Madrid, España. pp. 1-11.

Description

Title: Creation and enrichment of a Terminological Knowledge Graph in the legal domain
Author/s:
  • Martín Chozas, Patricia
Item Type: Presentation at Congress or Conference (Article)
Event Title: Proceedings of the Seventh JURIX 2019 Doctoral Consortium
Event Dates: 11 Dic 2019
Event Location: Madrid, España
Title of Book: DC JURIX 2019: Seventh Doctoral Consortium of JURIX 2019
Date: 2019
Volume: 2598
Subjects:
Freetext Keywords: Terminology Management, Linguistic Linked Data, Knowledge Graphs, Semantic Web
Faculty: E.T.S. de Ingenieros Informáticos (UPM)
Department: Lingüistica Aplicada a la Ciencia y a la Tecnología
Creative Commons Licenses: Recognition - No derivative works - Non commercial

Full text

[img]
Preview
PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (689kB) | Preview

Abstract

Domain-specific terminologies are of great use in a number of contexts, such as information retrieval from text documents or supporting humans in translation tasks. However, automated terminology extraction tools usually render plain lists with no additional information (hierarchical relations, definitions or examples of use, amongst others). The output of these tools is very often offered in non-open formats, hampering their reuse and interoperability. Moreover, terminology management tools demand a lot of manual work to curate and enrich the resources and they do not support the representation of terminological relations beyond broader/narrower. The contributions of this Thesis mitigate these problems by automating the creation of rich terminologies from plain text documents, by establishing links to external resources, and by adopting the W3C standards for the Semantic Web. The proposed method comprises six tasks: refinement, disambiguation, enrichment, relation validation, relation extraction and RDF conversion. We have applied this methodology to two different legal corpora, i.e., contracts and collective agreements. The result of this methodology will be a Terminological Knowledge Graph that can be exploited by different Natural Language Processing applications.

Funding Projects

TypeCodeAcronymLeaderTitle
Horizon 2020825182Pret-a-LLODNational University of Ireland GalwayReady-to-use multilingual linked language data for knowledge services across sectors
Horizon 2020780602LynxUniversidad Politécnica de MadridBuilding the legal knowledge graph for smart compliance services in multilingual Europe

More information

Item ID: 67256
DC Identifier: http://oa.upm.es/67256/
OAI Identifier: oai:oa.upm.es:67256
Official URL: http://ceur-ws.org/Vol-2598/paper-07.pdf
Deposited by: Memoria Investigacion
Deposited on: 27 May 2021 06:53
Last Modified: 27 May 2021 06:53
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM