HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish

Fernández Martínez, Fernando; Lucas Cuesta, Juan Manuel; Barra Chicote, Roberto; Ferreiros López, Javier y Macías Guarasa, Javier (2010). HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish. En: "LREC 2010, 7th International Conference on Language Resources and Evaluation", 17/05/2010 - 23/05/2010, Republica de Malta. ISBN 2-9517408-6-7.

Descripción

Título: HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish
Autor/es:
  • Fernández Martínez, Fernando
  • Lucas Cuesta, Juan Manuel
  • Barra Chicote, Roberto
  • Ferreiros López, Javier
  • Macías Guarasa, Javier
Tipo de Documento: Ponencia en Congreso o Jornada (Artículo)
Título del Evento: LREC 2010, 7th International Conference on Language Resources and Evaluation
Fechas del Evento: 17/05/2010 - 23/05/2010
Lugar del Evento: Republica de Malta
Título del Libro: Proceedings of LREC 2010, 7th International Conference on Language Resources and Evaluation
Fecha: 2010
ISBN: 2-9517408-6-7
Materias:
Escuela: E.T.S.I. Telecomunicación (UPM)
Departamento: Ingeniería Electrónica
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[img]
Vista Previa
PDF (Document Portable Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (511kB) | Vista Previa

Resumen

In this paper, we describe a new multi-purpose audio-visual database on the context of speech interfaces for controlling household electronic devices. The database comprises speech and video recordings of 19 speakers interacting with a HIFI audio box by means of a spoken dialogue system. Dialogue management is based on Bayesian Networks and the system is provided with contextual information handling strategies. Each speaker was requested to fulfil different sets of specific goals following predefined scenarios, according to both different complexity levels and degrees of freedom or initiative allowed to the user. Due to a careful design and its size, the recorded database allows comprehensive studies on speech recognition, speech understanding, dialogue modeling and management, microphone array based speech processing, and both speech and video-based acoustic source localisation. The database has been labelled for quality and efficiency studies on dialogue performance. The whole database has been validated through both objective and subjective tests.

Más información

ID de Registro: 7434
Identificador DC: http://oa.upm.es/7434/
Identificador OAI: oai:oa.upm.es:7434
URL Oficial: http://www.lrec-conf.org/
Depositado por: Memoria Investigacion
Depositado el: 10 Jun 2011 10:06
Ultima Modificación: 20 Abr 2016 16:33
  • Open Access
  • Open Access
  • Sherpa-Romeo
    Compruebe si la revista anglosajona en la que ha publicado un artículo permite también su publicación en abierto.
  • Dulcinea
    Compruebe si la revista española en la que ha publicado un artículo permite también su publicación en abierto.
  • Recolecta
  • e-ciencia
  • Observatorio I+D+i UPM
  • OpenCourseWare UPM