HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish

Fernández Martínez, Fernando and Lucas Cuesta, Juan Manuel and Barra Chicote, Roberto and Ferreiros López, Javier and Macías Guarasa, Javier (2010). HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish. In: "LREC 2010, 7th International Conference on Language Resources and Evaluation", 17/05/2010 - 23/05/2010, Republica de Malta. ISBN 2-9517408-6-7.

Description

Title: HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish
Author/s:
  • Fernández Martínez, Fernando
  • Lucas Cuesta, Juan Manuel
  • Barra Chicote, Roberto
  • Ferreiros López, Javier
  • Macías Guarasa, Javier
Item Type: Presentation at Congress or Conference (Article)
Event Title: LREC 2010, 7th International Conference on Language Resources and Evaluation
Event Dates: 17/05/2010 - 23/05/2010
Event Location: Republica de Malta
Title of Book: Proceedings of LREC 2010, 7th International Conference on Language Resources and Evaluation
Date: 2010
ISBN: 2-9517408-6-7
Subjects:
Faculty: E.T.S.I. Telecomunicación (UPM)
Department: Ingeniería Electrónica
Creative Commons Licenses: Recognition - No derivative works - Non commercial

Full text

[img]
Preview
PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (511kB) | Preview

Abstract

In this paper, we describe a new multi-purpose audio-visual database on the context of speech interfaces for controlling household electronic devices. The database comprises speech and video recordings of 19 speakers interacting with a HIFI audio box by means of a spoken dialogue system. Dialogue management is based on Bayesian Networks and the system is provided with contextual information handling strategies. Each speaker was requested to fulfil different sets of specific goals following predefined scenarios, according to both different complexity levels and degrees of freedom or initiative allowed to the user. Due to a careful design and its size, the recorded database allows comprehensive studies on speech recognition, speech understanding, dialogue modeling and management, microphone array based speech processing, and both speech and video-based acoustic source localisation. The database has been labelled for quality and efficiency studies on dialogue performance. The whole database has been validated through both objective and subjective tests.

More information

Item ID: 7434
DC Identifier: http://oa.upm.es/7434/
OAI Identifier: oai:oa.upm.es:7434
Official URL: http://www.lrec-conf.org/
Deposited by: Memoria Investigacion
Deposited on: 10 Jun 2011 10:06
Last Modified: 20 Apr 2016 16:33
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM