Evaluation of a transplantation algorithm for expressive speech synthesis

Lorenzo Trueba, Jaime and Barra Chicote, Roberto and Yamagishi, J. and Watts, Oliver and Montero Martínez, Juan Manuel (2013). Evaluation of a transplantation algorithm for expressive speech synthesis. In: "IV Congreso Español de Informática (CEDI 2013). Workshop en Tecnologías Accesibles", 17/09/2013 - 20/09/2013, Madrid, Spain. pp. 1-10.

Description

Title: Evaluation of a transplantation algorithm for expressive speech synthesis
Author/s:
  • Lorenzo Trueba, Jaime
  • Barra Chicote, Roberto
  • Yamagishi, J.
  • Watts, Oliver
  • Montero Martínez, Juan Manuel
Item Type: Presentation at Congress or Conference (Article)
Event Title: IV Congreso Español de Informática (CEDI 2013). Workshop en Tecnologías Accesibles
Event Dates: 17/09/2013 - 20/09/2013
Event Location: Madrid, Spain
Title of Book: IV Congreso Español de Informática (CEDI 2013). Workshop en Tecnologías Accesibles
Date: 2013
Subjects:
Freetext Keywords: Expressive speech synthesis, emotions, adaptation, expressiveness transplantation
Faculty: E.T.S.I. Telecomunicación (UPM)
Department: Ingeniería Electrónica
Creative Commons Licenses: Recognition - No derivative works - Non commercial

Full text

[img]
Preview
PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (319kB)

Abstract

When designing human-machine interfaces it is important to consider not only the bare bones functionality but also the ease of use and accessibility it provides. When talking about voice-based inter- faces, it has been proven that imbuing expressiveness into the synthetic voices increases signi?cantly its perceived naturalness, which in the end is very helpful when building user friendly interfaces. This paper proposes an adaptation based expressiveness transplantation system capable of copying the emotions of a source speaker into any desired target speaker with just a few minutes of read speech and without requiring the record- ing of additional expressive data. This system was evaluated through a perceptual test for 3 speakers showing up to an average of 52% emotion recognition rates relative to the natural voice recognition rates, while at the same time keeping good scores in similarity and naturality.

More information

Item ID: 26490
DC Identifier: http://oa.upm.es/26490/
OAI Identifier: oai:oa.upm.es:26490
Deposited by: Memoria Investigacion
Deposited on: 14 Jun 2014 12:30
Last Modified: 22 Sep 2014 11:41
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM