Report of MIRACLE team for the Ad-Hoc track in CLEF 2006

Goñi Menoyo, José Miguel; González Cristóbal, José Carlos y Villena Román, Julio (2006). Report of MIRACLE team for the Ad-Hoc track in CLEF 2006. En: "7th Workshop of the Cross-Language Evaluation Forum, CLEF 2006", 20/09/2006-22/09/2006, Alicante, España. ISBN 2-912335-23-x.

Descripción

Título: Report of MIRACLE team for the Ad-Hoc track in CLEF 2006
Autor/es:
  • Goñi Menoyo, José Miguel
  • González Cristóbal, José Carlos
  • Villena Román, Julio
Tipo de Documento: Ponencia en Congreso o Jornada (Artículo)
Título del Evento: 7th Workshop of the Cross-Language Evaluation Forum, CLEF 2006
Fechas del Evento: 20/09/2006-22/09/2006
Lugar del Evento: Alicante, España
Título del Libro: Working Notes for the CLEF 2006 Workshop
Fecha: 2006
ISBN: 2-912335-23-x
Materias:
Palabras Clave Informales: Linguistic Engineering, Information Retrieval, Trie Indexing
Escuela: E.T.S.I. Telecomunicación (UPM)
Departamento: Matemática Aplicada a las Tecnologías de la Información [hasta 2014]
Grupo Investigación UPM: Grupo de Sistemas Inteligentes
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[img]
Vista Previa
PDF (Document Portable Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (239kB) | Vista Previa

Resumen

This paper presents the 2006 MIRACLE’s team approach to the AdHoc Information Retrieval track. The experiments for this campaign keep on testing our IR approach. First, a baseline set of runs is obtained, including standard components: stemming, transforming, filtering, entities detection and extracting, and others. Then, a extended set of runs is obtained using several types of combinations of these baseline runs. The improvements introduced for this campaign have been a few ones: we have used an entity recognition and indexing prototype tool into our tokenizing scheme, and we have run more combining experiments for the robust multilingual case than in previous campaigns. However, no significative improvements have been achieved. For the this campaign, runs were submitted for the following languages and tracks: - Monolingual: Bulgarian, French, Hungarian, and Portuguese. - Bilingual: English to Bulgarian, French, Hungarian, and Portuguese; Spanish to French and Portuguese; and French to Portuguese. - Robust monolingual: German, English, Spanish, French, Italian, and Dutch. - Robust bilingual: English to German, Italian to Spanish, and French to Dutch. - Robust multilingual: English to robust monolingual languages. We still need to work harder to improve some aspects of our processing scheme, being the most important, to our knowledge, the entities recognition and normalization.

Más información

ID de Registro: 4688
Identificador DC: http://oa.upm.es/4688/
Identificador OAI: oai:oa.upm.es:4688
URL Oficial: http://ims-sites.dei.unipd.it/documents/71612/86365/CLEF2006wn-adhoc-GoniMenoyoEt2006.pdf
Depositado por: Memoria Investigacion
Depositado el: 22 Oct 2010 10:59
Ultima Modificación: 20 Abr 2016 13:48
  • Open Access
  • Open Access
  • Sherpa-Romeo
    Compruebe si la revista anglosajona en la que ha publicado un artículo permite también su publicación en abierto.
  • Dulcinea
    Compruebe si la revista española en la que ha publicado un artículo permite también su publicación en abierto.
  • Recolecta
  • e-ciencia
  • Observatorio I+D+i UPM
  • OpenCourseWare UPM