Effects of audio compression in automatic detection of voice pathologies

Sáenz Lechón, Nicolas and Osma Ruiz, Víctor and Godino Llorente, Juan Ignacio and Blanco Velasco, Manuel and Cruz Roldán, Fernando and Arias Londoño, Julian (2008). Effects of audio compression in automatic detection of voice pathologies. "IEEE Transactions on Biomedical Engineering", v. 55 (n. 12); pp. 2831-2835. ISSN 0018-9294. https://doi.org/10.1109/TBME.2008.923769.

Description

Title: Effects of audio compression in automatic detection of voice pathologies
Author/s:
  • Sáenz Lechón, Nicolas
  • Osma Ruiz, Víctor
  • Godino Llorente, Juan Ignacio
  • Blanco Velasco, Manuel
  • Cruz Roldán, Fernando
  • Arias Londoño, Julian
Item Type: Article
Título de Revista/Publicación: IEEE Transactions on Biomedical Engineering
Date: December 2008
ISSN: 0018-9294
Volume: 55
Subjects:
Faculty: E.U.I.T. Telecomunicación (UPM)
Department: Ingeniería de Circuitos y Sistemas [hasta 2014]
UPM's Research Group: DET curve Gaussian mixtures models MP3 format ROC curve audio compression effects automatic system performance automatic voice pathology detection system cepstral measurements detector error tradeoff curve noise measurements receiver operating characteristic curve support vector machines telemedicine voice signal characterisation
Creative Commons Licenses: Recognition - Non commercial - Share

Full text

[img]
Preview
PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (353kB) | Preview

Abstract

This paper investigates the performance of an automatic system for voice pathology detection when the voice samples have been compressed in MP3 format and different binary rates (160, 96, 64, 48, 24, and 8 kb/s). The detectors employ cepstral and noise measurements, along with their derivatives, to characterize the voice signals. The classification is performed using Gaussian mixtures models and support vector machines. The results between the different proposed detectors are compared by means of detector error tradeoff (DET) and receiver operating characteristic (ROC) curves, concluding that there are no significant differences in the performance of the detector when the binary rates of the compressed data are above 64 kb/s. This has useful applications in telemedicine, reducing the storage space of voice recordings or transmitting them over narrow-band communications channels.

More information

Item ID: 2019
DC Identifier: http://oa.upm.es/2019/
OAI Identifier: oai:oa.upm.es:2019
DOI: 10.1109/TBME.2008.923769
Official URL: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4494380
Deposited by: Memoria Investigacion
Deposited on: 21 Dec 2009 08:35
Last Modified: 20 Apr 2016 11:49
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM