Emulating the perceptual capabilities of a human evaluator to map the GRB scale for the assessment of voice disorders

Gómez García, Jorge Andrés and Moro Velázquez, Laureano and Mendes Laureano, Janaína and Castellanos Domínguez, Germán and Godino Llorente, Juan Ignacio (2019). Emulating the perceptual capabilities of a human evaluator to map the GRB scale for the assessment of voice disorders. "Engineering Applications of Artificial Intelligence", v. 82 ; pp. 236-251. ISSN 0952-1976. https://doi.org/10.1016/j.engappai.2019.03.027.

Description

Title: Emulating the perceptual capabilities of a human evaluator to map the GRB scale for the assessment of voice disorders
Author/s:
  • Gómez García, Jorge Andrés
  • Moro Velázquez, Laureano
  • Mendes Laureano, Janaína
  • Castellanos Domínguez, Germán
  • Godino Llorente, Juan Ignacio
Item Type: Article
Título de Revista/Publicación: Engineering Applications of Artificial Intelligence
Date: June 2019
ISSN: 0952-1976
Volume: 82
Subjects:
Freetext Keywords: Automatic voice quality analysis; GRBAS scale; Voice assessment; Breathiness; Roughness; Hoarseness
Faculty: E.T.S.I. Telecomunicación (UPM)
Department: Otro
Creative Commons Licenses: Recognition - No derivative works - Non commercial

Full text

[img] PDF - Users in campus UPM only until 1 July 2021 - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (9MB)

Abstract

This paper presents the design of an automatic voice quality analysis system for the assessment of voice pathologies, which emulates the perceptual capabilities of a human evaluator according the GRB scale. For this purpose, a novel methodology based on multiple sets of characteristics, ordinal classification and Gaussian regression is proposed. In particular, a reduced subset of characteristics is identified, and the regressor is used to convert the discrete perceptual scale to a continuum, more in agreement to the nature of the problem under study. The robustness of the system is evaluated in several cross-dataset experiments. Similarly, a clinical evaluation of the predictions provided by the system is carried out. Results indicate that the proposed methodology is proficient in modelling the perceptual capabilities of the human evaluator. They also show that it is possible to extend the GRB scale to a continuum through regression techniques while maintaining the consistency of the results. On average, the deviation between the labels assessed by the expert and the ones provided by the system is of about 0.5 units (in a scale from 0 to 3) for G and B, and of 0.7 units for R. Similarly, the deviation of the labels predicted by the system in the clinical assessment trials is about 0.3 units for G, 0.4 units for B, and 0.5 units for R

Funding Projects

TypeCodeAcronymLeaderTitle
Government of SpainDPI2017-83405-R1UnspecifiedUnspecifiedUnspecified

More information

Item ID: 64436
DC Identifier: http://oa.upm.es/64436/
OAI Identifier: oai:oa.upm.es:64436
DOI: 10.1016/j.engappai.2019.03.027
Official URL: https://www.sciencedirect.com/science/article/pii/S0952197619300776
Deposited by: Memoria Investigacion
Deposited on: 20 Dec 2020 09:09
Last Modified: 20 Dec 2020 09:09
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM