Texto completo
Vista Previa |
PDF (Portable Document Format)
- Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (239kB) | Vista Previa |
ORCID: https://orcid.org/0000-0001-8922-5529, González Cristóbal, José Carlos
ORCID: https://orcid.org/0000-0002-1461-2695 and Villena Román, Julio
(2006).
Report of MIRACLE team for the Ad-Hoc track in CLEF 2006.
En: "7th Workshop of the Cross-Language Evaluation Forum, CLEF 2006", 20/09/2006-22/09/2006, Alicante, España. ISBN 2-912335-23-x.
| Título: | Report of MIRACLE team for the Ad-Hoc track in CLEF 2006 |
|---|---|
| Autor/es: |
|
| Tipo de Documento: | Ponencia en Congreso o Jornada (Artículo) |
| Título del Evento: | 7th Workshop of the Cross-Language Evaluation Forum, CLEF 2006 |
| Fechas del Evento: | 20/09/2006-22/09/2006 |
| Lugar del Evento: | Alicante, España |
| Título del Libro: | Working Notes for the CLEF 2006 Workshop |
| Fecha: | 2006 |
| ISBN: | 2-912335-23-x |
| Materias: | |
| ODS: | |
| Palabras Clave Informales: | Linguistic Engineering, Information Retrieval, Trie Indexing |
| Escuela: | E.T.S.I. Telecomunicación (UPM) |
| Departamento: | Matemática Aplicada a las Tecnologías de la Información [hasta 2014] |
| Grupo Investigación UPM: | Grupo de Sistemas Inteligentes |
| Licencias Creative Commons: | Reconocimiento - Sin obra derivada - No comercial |
Vista Previa |
PDF (Portable Document Format)
- Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (239kB) | Vista Previa |
This paper presents the 2006 MIRACLE’s team approach to the AdHoc Information Retrieval track. The experiments for this campaign keep on testing our IR approach. First, a baseline set of runs is obtained, including standard components: stemming, transforming, filtering, entities detection and extracting, and others. Then, a extended set of runs is obtained using several types of combinations of these baseline runs. The improvements introduced for this campaign have been a few ones: we have used an entity recognition and indexing prototype tool into our tokenizing scheme, and we have run more combining experiments for the robust multilingual case than in previous campaigns. However, no significative improvements have been achieved. For the this campaign, runs were submitted for the following languages and tracks: - Monolingual: Bulgarian, French, Hungarian, and Portuguese. - Bilingual: English to Bulgarian, French, Hungarian, and Portuguese; Spanish to French and Portuguese; and French to Portuguese. - Robust monolingual: German, English, Spanish, French, Italian, and Dutch. - Robust bilingual: English to German, Italian to Spanish, and French to Dutch. - Robust multilingual: English to robust monolingual languages. We still need to work harder to improve some aspects of our processing scheme, being the most important, to our knowledge, the entities recognition and normalization.
| ID de Registro: | 4688 |
|---|---|
| Identificador DC: | https://oa.upm.es/4688/ |
| Identificador OAI: | oai:oa.upm.es:4688 |
| URL Oficial: | http://ims-sites.dei.unipd.it/documents/71612/8636... |
| Depositado por: | Memoria Investigacion |
| Depositado el: | 22 Oct 2010 10:59 |
| Ultima Modificación: | 20 Abr 2016 13:48 |
Publicar en el Archivo Digital desde el Portal Científico