Annotador: a temporal tagger for Spanish

Navas Loro, María ORCID: https://orcid.org/0000-0003-1011-5023 and Rodríguez Doncel, Víctor ORCID: https://orcid.org/0000-0003-1076-2511 (2020). Annotador: a temporal tagger for Spanish. "Journal of Intelligent & Fuzzy Systems", v. 39 (n. 2); pp. 1979-1991. ISSN 1875-8967. https://doi.org/10.3233/JIFS-179865.

Descripción

Título: Annotador: a temporal tagger for Spanish
Autor/es:
Tipo de Documento: Artículo
Título de Revista/Publicación: Journal of Intelligent & Fuzzy Systems
Fecha: 29 Junio 2020
ISSN: 1875-8967
Volumen: 39
Número: 2
Materias:
ODS:
Palabras Clave Informales: Time expression, Temporal tagger, Spanish language, NLP
Escuela: E.T.S. de Ingenieros Informáticos (UPM)
Departamento: Inteligencia Artificial
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[thumbnail of 913-1.pdf] PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (508kB)

Resumen

Temporal information is crucial in knowledge extraction. Being able to locate events in a timeline is necessary to understand the narrative behind every text. To this aim, several temporal taggers have been proposed in literature –nevertheless, not all languages received the same attention. Most taggers work only for English texts, and not many have been developed for other languages. Also the scarcity of annotated corpora in other languages notably hinders the task. In this paper we present a new rule-based tagger called Annotador (Añotador in Spanish) able to process texts both in Spanish and English. Furthermore, a new corpus with more than 300 short texts containing common temporal expressions, called the HourGlass corpus, has been built in order to test it and to facilitate the development of new resources and tools. Professionals from different domains intervened in the gathering of the text, making it heterogeneous and easy to use thanks to the tags added to each entry. Finally, we analyzed main challenges in the time expression extraction task.

Proyectos asociados

Tipo
Código
Acrónimo
Responsable
Título
Horizonte 2020
780602
LYNX
Sin especificar
Legal Knowledge Graph for Multilingual Compliance Services

Más información

ID de Registro: 93339
Identificador DC: https://oa.upm.es/93339/
Identificador OAI: oai:oa.upm.es:93339
URL Portal Científico: https://portalcientifico.upm.es/es/ipublic/item/8679931
Identificador DOI: 10.3233/JIFS-179865
URL Oficial: https://journals.sagepub.com/doi/abs/10.3233/JIFS-...
Depositado por: María Navas Loro
Depositado el: 23 Ene 2026 14:38
Ultima Modificación: 23 Ene 2026 14:38