Integrating WordNet and Wiktionary with lemon

McCrae, J. and Montiel-Ponsoda, Elena and Cimiano, Philipp (2012). Integrating WordNet and Wiktionary with lemon. In: "Linked Data in Linguistics". Springer Berlin Heidelberg, pp. 25-34. ISBN 978-3-642-28248-5. https://doi.org/10.1007/978-3-642-28249-2_3.

Description

Title: Integrating WordNet and Wiktionary with lemon
Author/s:
  • McCrae, J.
  • Montiel-Ponsoda, Elena
  • Cimiano, Philipp
Editor/s:
  • Chiarcos, Christian
  • Nordhoff, Sebastian
  • Hellmann, Sebastian
Item Type: Book Section
Title of Book: Linked Data in Linguistics
Date: 2012
ISBN: 978-3-642-28248-5
Subjects:
Faculty: Facultad de Informática (UPM)
Department: Inteligencia Artificial
UPM's Research Group: oeg
Creative Commons Licenses: None

Full text

[thumbnail of Integrating_WordNet_and_Wiktionary_with_lemon.pdf]
Preview
PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (106kB) | Preview

Abstract

Nowadays, there is a significant quantity of linguistic data available on the Web. However, linguistic resources are often published using proprietary formats and, as such, it can be difficult to interface with one another and they end up confined in “data silos”. The creation of web standards for the publishing of data on the Web and projects to create Linked Data have lead to interest in the creation of resources that can be published using Web principles. One of the most important aspects of “Lexical Linked Data” is the sharing of lexica and machine readable dictionaries. It is for this reason, that the lemon format has been proposed, which we briefly describe. We then consider two resources that seem ideal candidates for the Linked Data cloud, namely WordNet 3.0 and Wiktionary, a large document based dictionary. We discuss the challenges of converting both resources to lemon , and in particular for Wiktionary, the challenge of processing the mark-up, and handling inconsistencies and underspecification in the source material. Finally, we turn to the task of creating links between the two resources and present a novel algorithm for linking lexica as lexical Linked Data.

More information

Item ID: 21448
DC Identifier: https://oa.upm.es/21448/
OAI Identifier: oai:oa.upm.es:21448
DOI: 10.1007/978-3-642-28249-2_3
Deposited by: Dr Oscar Corcho
Deposited on: 28 Oct 2013 11:38
Last Modified: 21 Apr 2016 12:03
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM