Speaker Diarization Based on Intensity Channel Contribution

Barra Chicote, Roberto; Pardo Muñoz, José Manuel; Ferreiros López, Javier y Montero Martínez, Juan Manuel (2011). Speaker Diarization Based on Intensity Channel Contribution. "IEEE Transactions on Audio, Speech and Language Processing", v. 19 (n. 4); pp. 754-761. ISSN 1558-7916. https://doi.org/10.1109/TASL.2010.2062507.

Descripción

Título: Speaker Diarization Based on Intensity Channel Contribution
Autor/es:
  • Barra Chicote, Roberto
  • Pardo Muñoz, José Manuel
  • Ferreiros López, Javier
  • Montero Martínez, Juan Manuel
Tipo de Documento: Artículo
Título de Revista/Publicación: IEEE Transactions on Audio, Speech and Language Processing
Fecha: 2011
Volumen: 19
Materias:
Palabras Clave Informales: Intensity channel contribution (ICC) , speaker diarization , speaker segmentation , speech processing in meetings
Escuela: E.T.S.I. Telecomunicación (UPM)
Departamento: Ingeniería Electrónica
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[img]
Vista Previa
PDF (Document Portable Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (1MB)

Resumen

The time delay of arrival (TDOA) between multiple microphones has been used since 2006 as a source of information (localization) to complement the spectral features for speaker diarization. In this paper, we propose a new localization feature, the intensity channel contribution (ICC) based on the relative energy of the signal arriving at each channel compared to the sum of the energy of all the channels. We have demonstrated that by joining the ICC features and the TDOA features, the robustness of the localization features is improved and that the diarization error rate (DER) of the complete system (using localization and spectral features) has been reduced. By using this new localization feature, we have been able to achieve a 5.2% DER relative improvement in our development data, a 3.6% DER relative improvement in the RT07 evaluation data and a 7.9% DER relative improvement in the last year's RT09 evaluation data.

Más información

ID de Registro: 11778
Identificador DC: http://oa.upm.es/11778/
Identificador OAI: oai:oa.upm.es:11778
Identificador DOI: 10.1109/TASL.2010.2062507
URL Oficial: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5551177
Depositado por: Memoria Investigacion
Depositado el: 08 Nov 2012 11:39
Ultima Modificación: 22 Sep 2014 10:51
  • Open Access
  • Open Access
  • Sherpa-Romeo
    Compruebe si la revista anglosajona en la que ha publicado un artículo permite también su publicación en abierto.
  • Dulcinea
    Compruebe si la revista española en la que ha publicado un artículo permite también su publicación en abierto.
  • Recolecta
  • e-ciencia
  • Observatorio I+D+i UPM
  • OpenCourseWare UPM