Citation
Arias-Londoño, Julián David and Gómez García, Jorge Andrés and Moro Velázquez, Laureano and Godino Llorente, Juan Ignacio
(2018).
ByoVoz Automatic Voice Condition Analysis System for the 2018 FEMH Challenge.
In: "Proceedings IEEE International Conference on Big Data 2018", 10/12/2018 -13/12/2018, Seatle, WA, USA. ISBN 978-1-5386-5035-6. pp. 5228-5232.
https://doi.org/10.1109/BigData.2018.8622498.
Abstract
This paper presents the methods and results used by the ByoVoz team for the design of an automatic voice condition analysis system, which was submitted to the 2018 Far East Memorial Hospital voice data challenge. The proposed methodology is based on a cascading scheme that firstly discriminates between pathological and normophonic voices, and then identifies the type of disorder. By using diverse feature selection techniques, a subset of complexity, spectral/cepstral and perturbation characteristics were identified for the proposed tasks. Then, several generative classification methodologies based on Gaussian Mixture Models and Gradient Boosting were employed to provide decisions about the input voices in the binary classification, and using onevs-one classification systems based on Random Forests for the categorization according to the type of disorder. By using a 4-folds cross-validation approach on the training partition a sensitivity=0.93 and specificity=0.74 were obtained. Similarly, an unweighted average recall of 0.63 and an accuracy of 66% was obtained for the identification task. Using the scoring metric proposed in the challenge the final resulting score considering both detection and identification is of 0.77.