Analysis of complexity and modulation spectra parameterizations to characterize voice roughness

Moro Velázquez, Laureano and Gómez García, Jorge Andrés and Godino Llorente, Juan Ignacio (2014). Analysis of complexity and modulation spectra parameterizations to characterize voice roughness. In: "IberSpeech 2014", 19/11/2014 - 21/11/2014, Las Palmas de Gran Canaria, Spain. pp. 1-10.

Description

Title: Analysis of complexity and modulation spectra parameterizations to characterize voice roughness
Author/s:
  • Moro Velázquez, Laureano
  • Gómez García, Jorge Andrés
  • Godino Llorente, Juan Ignacio
Item Type: Presentation at Congress or Conference (Article)
Event Title: IberSpeech 2014
Event Dates: 19/11/2014 - 21/11/2014
Event Location: Las Palmas de Gran Canaria, Spain
Title of Book: IberSPEECH 2014
Date: 2014
Subjects:
Freetext Keywords: GRBAS, Complexity, Modulation Spectra, Kappa Index, GMM, voice pathology, Roughness
Faculty: E.T.S.I. Telecomunicación (UPM)
Department: Otro
Creative Commons Licenses: Recognition - No derivative works - Non commercial

Full text

[img]
Preview
PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (1MB) | Preview

Abstract

Disordered voices are frequently assessed by speech pathologists using acoustic perceptual evaluations. This might lead to problems due to the subjective nature of the process and due to the in uence of external factors which compromise the quality of the assessment. In order to increase the reliability of the evaluations the design of new indicator parameters obtained from voice signal processing is desirable. With that in mind, this paper presents an automatic evaluation system which emulates perceptual assessments of the roughness level in human voice. Two parameterization methods are used: complexity, which has already been used successfully in previous works, and modulation spectra. For the latter, a new group of parameters has been proposed as Low Modulation Ratio (LMR), Contrast (MSW) and Homogeneity (MSH). The tested methodology also employs PCA and LDA to reduce the dimensionality of the feature space, and GMM classiffers for evaluating the ability of the proposed features on distinguishing the different roughness levels. An effciency of 82% and a Cohen's Kappa Index of 0:73 is obtained using the modulation spectra parameters, while the complexity parameters performed 73% and 0:58 respectively. The obtained results indicate the usefulness of the proposed modulation spectra features for the automatic evaluation of voice roughness which can derive in new parameters to be useful for clinicians.

Funding Projects

TypeCodeAcronymLeaderTitle
Government of SpainTEC2012-38630-C04-01UnspecifiedUniversidad Politécnica de MadridEvaluación multimodal de trastornos neurológicos mediante la caracterización de la voz, dinámica de los pliegues vocales y secuencias sacadicas
Madrid Regional GovernmentUnspecifiedUnspecifiedUnspecifiedUnspecified

More information

Item ID: 44611
DC Identifier: http://oa.upm.es/44611/
OAI Identifier: oai:oa.upm.es:44611
Deposited by: Memoria Investigacion
Deposited on: 07 Mar 2017 16:09
Last Modified: 07 Mar 2017 16:09
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM