Selection of TDOA Parameters for MDM Speaker Diarization

Martínez González, Beatriz and Pardo Muñoz, José Manuel and Echeverry Correa, Julian David and Vallejo Pinto, José Ángel and Barra Chicote, Roberto (2012). Selection of TDOA Parameters for MDM Speaker Diarization. In: "InterSpeech 2012, 13th Annual Conference of the International Speech Communication Association", 09/09/2012 - 13/09/2012, Portland, Oregon. pp. 1-4.

Description

Title: Selection of TDOA Parameters for MDM Speaker Diarization
Author/s:
  • Martínez González, Beatriz
  • Pardo Muñoz, José Manuel
  • Echeverry Correa, Julian David
  • Vallejo Pinto, José Ángel
  • Barra Chicote, Roberto
Item Type: Presentation at Congress or Conference (Article)
Event Title: InterSpeech 2012, 13th Annual Conference of the International Speech Communication Association
Event Dates: 09/09/2012 - 13/09/2012
Event Location: Portland, Oregon
Title of Book: InterSpeech 2012, 13th Annual Conference of the International Speech Communication Association
Date: September 2012
Subjects:
Freetext Keywords: Speaker diarization, speaker localization, speaker identification, speaker segmentation
Faculty: E.T.S.I. Telecomunicación (UPM)
Department: Ingeniería Electrónica
Creative Commons Licenses: Recognition - No derivative works - Non commercial

Full text

[img]
Preview
PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (477kB) | Preview

Abstract

Several methods to improve multiple distant microphone (MDM) speaker diarization based on Time Delay of Arrival (TDOA) features are evaluated in this paper. All of them avoid the use of a single reference channel to calculate the TDOA values and, based on different criteria, select among all possible pairs of microphones a set of pairs that will be used to estimate the TDOA's. The evaluated methods have been named the "Dynamic Margin" (DM), the "Extreme Regions" (ER), the "Most Common" (MC), the "Cross Correlation" (XCorr) and the "Principle Component Analysis" (PCA). It is shown that all methods improve the baseline results for the development set and four of them improve also the results for the evaluation set. Improvements of 3.49% and 10.77% DER relative are obtained for DM and ER respectively for the test set. The XCorr and PCA methods achieve an improvement of 36.72% and 30.82% DER relative for the test set. Moreover, the computational cost for the XCorr method is 20% less than the baseline.

More information

Item ID: 20413
DC Identifier: http://oa.upm.es/20413/
OAI Identifier: oai:oa.upm.es:20413
Deposited by: Memoria Investigacion
Deposited on: 05 Oct 2013 10:18
Last Modified: 21 Apr 2016 23:12
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM