UPM-UC3M system for music and speech segmentation

Gallardo Antolín, Ascensión and San Segundo Hernández, Rubén (2010). UPM-UC3M system for music and speech segmentation. In: "VI Jornadas en Tecnología del Habla and II Iberian SLTech Workshop", 10/11/2010 - 12/11/2010, Vigo, España.

Description

Title: UPM-UC3M system for music and speech segmentation
Author/s:
  • Gallardo Antolín, Ascensión
  • San Segundo Hernández, Rubén
Item Type: Presentation at Congress or Conference (Article)
Event Title: VI Jornadas en Tecnología del Habla and II Iberian SLTech Workshop
Event Dates: 10/11/2010 - 12/11/2010
Event Location: Vigo, España
Title of Book: Proceedings of the VI Jornadas en Tecnología del Habla and II Iberian SLTech Workshop
Date: 2010
Subjects:
Faculty: E.T.S.I. Telecomunicación (UPM)
Department: Ingeniería Electrónica
Creative Commons Licenses: Recognition - No derivative works - Non commercial

Full text

[img]
Preview
PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (347kB) | Preview

Abstract

This paper describes the UPM-UC3M system for the Albayzín evaluation 2010 on Audio Segmentation. This evaluation task consists of segmenting a broadcast news audio document into clean speech, music, speech with noise in background and speech with music in background. The UPM-UC3M system is based on Hidden Markov Models (HMMs), including a 3-state HMM for every acoustic class. The number of states and the number of Gaussian per state have been tuned for this evaluation. The main analysis during system development has been focused on feature selection. Also, two different architectures have been tested: the first one corresponds to an one-step system whereas the second one is a hierarchical system in which different features have been used for segmenting the different audio classes. For both systems, we have considered long term statistics of MFCC (Mel Frequency Ceptral Coefficients), spectral entropy and CHROMA coefficients. For the best configuration of the one-step system, we have obtained a 25.3% average error rate and 18.7% diarization error (using the NIST tool) and a 23.9% average error rate and 17.9% diarization error for the hierarchical one.

More information

Item ID: 6947
DC Identifier: http://oa.upm.es/6947/
OAI Identifier: oai:oa.upm.es:6947
Official URL: http://fala2010.uvigo.es/
Deposited by: Memoria Investigacion
Deposited on: 10 May 2011 09:11
Last Modified: 20 Apr 2016 16:03
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM