Combining statistical and semantic approaches to the translation of ontologies and taxonomies

McCrae, J. and Montiel-Ponsoda, Elena and Aguado de Cea, G. and Espinoza Mejía, Mauricio and Cimiano, Philipp (2011). Combining statistical and semantic approaches to the translation of ontologies and taxonomies. In: "SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation", 23/06/2011 - 23/06/2011, Portland, Oregon, USA. ISBN 978-1-932432-99-2. pp. 116-125.

Description

Title: Combining statistical and semantic approaches to the translation of ontologies and taxonomies
Author/s:
  • McCrae, J.
  • Montiel-Ponsoda, Elena
  • Aguado de Cea, G.
  • Espinoza Mejía, Mauricio
  • Cimiano, Philipp
Item Type: Presentation at Congress or Conference (Article)
Event Title: SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation
Event Dates: 23/06/2011 - 23/06/2011
Event Location: Portland, Oregon, USA
Title of Book: Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation
Date: 2011
ISBN: 978-1-932432-99-2
Subjects:
Faculty: Facultad de Informática (UPM)
Department: Lingüistica Aplicada a la Ciencia y a la Tecnología
Creative Commons Licenses: Recognition - No derivative works - Non commercial

Full text

[img]
Preview
PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (186kB) | Preview

Abstract

Ontologies and taxonomies are widely used to organize concepts providing the basis for activities such as indexing, and as background knowledge for NLP tasks. As such, translation of these resources would prove useful to adapt these systems to new languages. However, we show that the nature of these resources is significantly different from the "free-text" paradigm used to train most statistical machine translation systems. In particular, we see significant differences in the linguistic nature of these resources and such resources have rich additional semantics. We demonstrate that as a result of these linguistic differences, standard SMT methods, in particular evaluation metrics, can produce poor performance. We then look to the task of leveraging these semantics for translation, which we approach in three ways: by adapting the translation system to the domain of the resource; by examining if semantics can help to predict the syntactic structure used in translation; and by evaluating if we can use existing translated taxonomies to disambiguate translations. We present some early results from these experiments, which shed light on the degree of success we may have with each approach

More information

Item ID: 13368
DC Identifier: http://oa.upm.es/13368/
OAI Identifier: oai:oa.upm.es:13368
Official URL: http://dl.acm.org/citation.cfm?id=2024274
Deposited by: Memoria Investigacion
Deposited on: 24 Oct 2012 10:15
Last Modified: 21 Apr 2016 12:41
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM