Integrating WordNet and Wiktionary with lemon

McCrae, John P. ORCID: https://orcid.org/0000-0002-7227-1331, Montiel Ponsoda, Elena ORCID: https://orcid.org/0000-0003-3263-3403 and Cimiano, Philipp (2012). Integrating WordNet and Wiktionary with lemon. En: "Linked Data in Linguistics". Springer Berlin Heidelberg, pp. 25-34. ISBN 978-3-642-28248-5. https://doi.org/10.1007/978-3-642-28249-2_3.

Descripción

Título: Integrating WordNet and Wiktionary with lemon
Autor/es:
Editor/es:
  • Chiarcos, Christian
  • Nordhoff, Sebastian
  • Hellmann, Sebastian
Tipo de Documento: Sección de Libro
Título del Libro: Linked Data in Linguistics
Fecha: 2012
ISBN: 978-3-642-28248-5
Materias:
ODS:
Escuela: Facultad de Informática (UPM) [antigua denominación]
Departamento: Inteligencia Artificial
Grupo Investigación UPM: Ontology Engineering Group – OEG
Licencias Creative Commons: Ninguna

Texto completo

[thumbnail of Integrating_WordNet_and_Wiktionary_with_lemon.pdf]
Vista Previa
PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (106kB) | Vista Previa

Resumen

Nowadays, there is a significant quantity of linguistic data available on the Web. However, linguistic resources are often published using proprietary formats and, as such, it can be difficult to interface with one another and they end up confined in “data silos”. The creation of web standards for the publishing of data on the Web and projects to create Linked Data have lead to interest in the creation of resources that can be published using Web principles. One of the most important aspects of “Lexical Linked Data” is the sharing of lexica and machine readable dictionaries. It is for this reason, that the lemon format has been proposed, which we briefly describe. We then consider two resources that seem ideal candidates for the Linked Data cloud, namely WordNet 3.0 and Wiktionary, a large document based dictionary. We discuss the challenges of converting both resources to lemon , and in particular for Wiktionary, the challenge of processing the mark-up, and handling inconsistencies and underspecification in the source material. Finally, we turn to the task of creating links between the two resources and present a novel algorithm for linking lexica as lexical Linked Data.

Más información

ID de Registro: 21448
Identificador DC: https://oa.upm.es/21448/
Identificador OAI: oai:oa.upm.es:21448
Identificador DOI: 10.1007/978-3-642-28249-2_3
Depositado por: Dr Oscar Corcho
Depositado el: 28 Oct 2013 11:38
Ultima Modificación: 05 May 2026 14:18