Effects of audio compression in automatic detection of voice pathologies

Sáenz Lechón, Nicolas; Osma Ruiz, Víctor; Godino Llorente, Juan Ignacio; Blanco Velasco, Manuel; Cruz Roldán, Fernando y Arias Londoño, Julian (2008). Effects of audio compression in automatic detection of voice pathologies. "IEEE Transactions on Biomedical Engineering", v. 55 (n. 12); pp. 2831-2835. ISSN 0018-9294. https://doi.org/10.1109/TBME.2008.923769.

Descripción

Título: Effects of audio compression in automatic detection of voice pathologies
Autor/es:
  • Sáenz Lechón, Nicolas
  • Osma Ruiz, Víctor
  • Godino Llorente, Juan Ignacio
  • Blanco Velasco, Manuel
  • Cruz Roldán, Fernando
  • Arias Londoño, Julian
Tipo de Documento: Artículo
Título de Revista/Publicación: IEEE Transactions on Biomedical Engineering
Fecha: Diciembre 2008
Volumen: 55
Materias:
Escuela: E.U.I.T. Telecomunicación (UPM) [antigua denominación]
Departamento: Ingeniería de Circuitos y Sistemas [hasta 2014]
Grupo Investigación UPM: DET curve Gaussian mixtures models MP3 format ROC curve audio compression effects automatic system performance automatic voice pathology detection system cepstral measurements detector error tradeoff curve noise measurements receiver operating characteristic curve support vector machines telemedicine voice signal characterisation
Licencias Creative Commons: Reconocimiento - No comercial - Compartir igual

Texto completo

[img]
Vista Previa
PDF (Document Portable Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (353kB) | Vista Previa

Resumen

This paper investigates the performance of an automatic system for voice pathology detection when the voice samples have been compressed in MP3 format and different binary rates (160, 96, 64, 48, 24, and 8 kb/s). The detectors employ cepstral and noise measurements, along with their derivatives, to characterize the voice signals. The classification is performed using Gaussian mixtures models and support vector machines. The results between the different proposed detectors are compared by means of detector error tradeoff (DET) and receiver operating characteristic (ROC) curves, concluding that there are no significant differences in the performance of the detector when the binary rates of the compressed data are above 64 kb/s. This has useful applications in telemedicine, reducing the storage space of voice recordings or transmitting them over narrow-band communications channels.

Más información

ID de Registro: 2019
Identificador DC: http://oa.upm.es/2019/
Identificador OAI: oai:oa.upm.es:2019
Identificador DOI: 10.1109/TBME.2008.923769
URL Oficial: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4494380
Depositado por: Memoria Investigacion
Depositado el: 21 Dic 2009 08:35
Ultima Modificación: 20 Abr 2016 11:49
  • Open Access
  • Open Access
  • Sherpa-Romeo
    Compruebe si la revista anglosajona en la que ha publicado un artículo permite también su publicación en abierto.
  • Dulcinea
    Compruebe si la revista española en la que ha publicado un artículo permite también su publicación en abierto.
  • Recolecta
  • e-ciencia
  • Observatorio I+D+i UPM
  • OpenCourseWare UPM