Acoustic Emotion Recognition using Dynamic Bayesian Networks and Multi-Space Distributions

Barra Chicote, Roberto, Fernández Martínez, Fernando

, Lebai Lutfi, Syaheerah Binti, Lucas Cuesta, Juan Manuel, Macías Guarasa, Javier, Montero Martínez, Juan Manuel

, San Segundo Hernández, Rubén

and Pardo Muñoz, José Manuel

(2009). Acoustic Emotion Recognition using Dynamic Bayesian Networks and Multi-Space Distributions. En: "10th Annual Conference of the International Speech Communication Association, Interspeech 2009", 06/09/2009 - 10/09/2009, Brighton, UK.

Descripción

Título:	Acoustic Emotion Recognition using Dynamic Bayesian Networks and Multi-Space Distributions
Autor/es:	Barra Chicote, Roberto Fernández Martínez, Fernando https://orcid.org/0000-0003-3877-0089 Lebai Lutfi, Syaheerah Binti Lucas Cuesta, Juan Manuel Macías Guarasa, Javier Montero Martínez, Juan Manuel https://orcid.org/0000-0002-7908-5400 San Segundo Hernández, Rubén https://orcid.org/0000-0001-9659-5464 Pardo Muñoz, José Manuel https://orcid.org/0000-0002-1009-590X
Tipo de Documento:	Ponencia en Congreso o Jornada (Artículo)
Título del Evento:	10th Annual Conference of the International Speech Communication Association, Interspeech 2009
Fechas del Evento:	06/09/2009 - 10/09/2009
Lugar del Evento:	Brighton, UK
Título del Libro:	Proceedings of 10th Annual Conference of the International Speech Communication Association, Interspeech 2009
Fecha:	2009
Materias:	Informática
ODS:	09. Industria, innovación e infraestructura
Palabras Clave Informales:	automatic emotion recognition, multi-space probability distribution, dynamic bayesian networks, emotion challenge.
Escuela:	E.T.S.I. Telecomunicación (UPM)
Departamento:	Ingeniería Electrónica
Licencias Creative Commons:	Reconocimiento - Sin obra derivada - No comercial

Texto completo

Vista Previa

PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (288kB) | Vista Previa

Resumen

In this paper we describe the acoustic emotion recognition system built at the Speech Technology Group of the Universidad Politecnica de Madrid (Spain) to participate in the INTERSPEECH 2009 Emotion Challenge. Our proposal is based on the use of a Dynamic Bayesian Network (DBN) to deal with the temporal modelling of the emotional speech information. The selected features (MFCC, F0, Energy and their variants) are modelled as different streams, and the F0 related ones are integrated under a Multi Space Distribution (MSD) framework, to properly model its dual nature (voiced/unvoiced). Experimental evaluation on the challenge test set, show a 67.06%and 38.24% of unweighted recall for the 2 and 5-classes tasks respectively. In the 2-class case, we achieve similar results compared with the baseline, with a considerable less number of features. In the 5-class case, we achieve a statistically signiﬁcant 6.5% relative improvement

Más información

ID de Registro:	5575
Identificador DC:	https://oa.upm.es/5575/
Identificador OAI:	oai:oa.upm.es:5575
URL Oficial:	http://www.isca-speech.org/archive/interspeech_200...
Depositado por:	Memoria Investigacion
Depositado el:	23 Dic 2010 08:44
Ultima Modificación:	20 Abr 2016 14:21

Estadísticas

Exportar cita

Editar (sólo personal del Archivo)

En esta página

Menú principal

Buscar

Acoustic Emotion Recognition using Dynamic Bayesian Networks and Multi-Space Distributions

Cita

Descripción

Texto completo

Resumen

Más información

Acciones

Documentos

El repositorio

Agrupados por ...

Datos Investigación

Financiadores

Especiales

En otros formatos

Redes sociales

Información adicional