Low-resource language recognition using a fusion of phoneme posteriorgram counts, acoustic and glottal-based i-vectors

D'haro Enríquez, Luis Fernando; Córdoba Herralde, Ricardo de; Caraballo Morcillo, Miguel Ángel y Pardo Muñoz, José Manuel (2013). Low-resource language recognition using a fusion of phoneme posteriorgram counts, acoustic and glottal-based i-vectors. En: "IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)", 26/05/2013 - 31/05/2013, Vancouver, Canada. pp. 6852-6856.

Descripción

Título: Low-resource language recognition using a fusion of phoneme posteriorgram counts, acoustic and glottal-based i-vectors
Autor/es:
  • D'haro Enríquez, Luis Fernando
  • Córdoba Herralde, Ricardo de
  • Caraballo Morcillo, Miguel Ángel
  • Pardo Muñoz, José Manuel
Tipo de Documento: Ponencia en Congreso o Jornada (Artículo)
Título del Evento: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Fechas del Evento: 26/05/2013 - 31/05/2013
Lugar del Evento: Vancouver, Canada
Título del Libro: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Título de Revista/Publicación: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Fecha: 2013
Materias:
Escuela: E.T.S.I. Telecomunicación (UPM)
Departamento: Ingeniería Electrónica
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[img]
Vista Previa
PDF (Document Portable Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (858kB)

Resumen

This paper presents a description of our system for the Albayzin 2012 LRE competition. One of the main characteristics of this evaluation was the reduced number of available files for training the system, especially for the empty condition where no training data set was provided but only a development set. In addition, the whole database was created from online videos and around one third of the training data was labeled as noisy files. Our primary system was the fusion of three different i-vector based systems: one acoustic system based on MFCCs, a phonotactic system using trigrams of phone-posteriorgram counts, and another acoustic system based on RPLPs that improved robustness against noise. A contrastive system that included new features based on the glottal source was also presented. Official and postevaluation results for all the conditions using the proposed metrics for the evaluation and the Cavg metric are presented in the paper.

Más información

ID de Registro: 26034
Identificador DC: http://oa.upm.es/26034/
Identificador OAI: oai:oa.upm.es:26034
URL Oficial: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=6638989
Depositado por: Memoria Investigacion
Depositado el: 17 May 2014 09:27
Ultima Modificación: 22 Sep 2014 11:39
  • Open Access
  • Open Access
  • Sherpa-Romeo
    Compruebe si la revista anglosajona en la que ha publicado un artículo permite también su publicación en abierto.
  • Dulcinea
    Compruebe si la revista española en la que ha publicado un artículo permite también su publicación en abierto.
  • Recolecta
  • e-ciencia
  • Observatorio I+D+i UPM
  • OpenCourseWare UPM