Scaling and universality in the human voice

Luque Serrano, Jordi and Luque Serrano, Bartolome and Lacasa Saiz de Arce, Lucas (2015). Scaling and universality in the human voice. "Journal of the Royal Society Interface", v. 12 (n. 105); pp. 1-6. ISSN 1742-5689. https://doi.org/10.1098/rsif.2014.1344.

Description

Title: Scaling and universality in the human voice
Author/s:
  • Luque Serrano, Jordi
  • Luque Serrano, Bartolome
  • Lacasa Saiz de Arce, Lucas
Item Type: Article
Título de Revista/Publicación: Journal of the Royal Society Interface
Date: 18 February 2015
ISSN: 1742-5689
Volume: 12
Subjects:
Freetext Keywords: voice, human, universality
Faculty: E.T.S. de Ingeniería Aeronáutica y del Espacio (UPM)
Department: Matemática Aplicada a la Ingeniería Aeroespacial
Creative Commons Licenses: Recognition - No derivative works - Non commercial

Full text

[img]
Preview
PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (1MB) | Preview

Abstract

Speech is a distinctive complex feature of human capabilities. In order to understand the physics underlying speech production, in this work, we empirically analyse the statistics of large human speech datasets ranging several languages. We first show that during speech, the energy is unevenly released and powerlaw distributed, reporting a universal robust Gutenberg–Richter-like law in speech. We further show that such ‘earthquakes in speech’ show temporal correlations, as the interevent statistics are again power-law distributed. As this feature takes place in the intraphoneme range, we conjecture that the process responsible for this complex phenomenon is not cognitive, but it resides in the physiological (mechanical) mechanisms of speech production. Moreover, we show that these waiting time distributions are scale invariant under a renormalization group transformation, suggesting that the process of speech generation is indeed operating close to a critical point. These results are put in contrast with current paradigms in speech processing, which point towards low dimensional deterministic chaos as the origin of nonlinear traits in speech fluctuations. As these latter fluctuations are indeed the aspects that humanize synthetic speech, these findings may have an impact in future speech synthesis technologies. Results are robust and independent of the communication language or the number of speakers, pointing towards a universal pattern and yet another hint of complexity in human speech.

Funding Projects

TypeCodeAcronymLeaderTitle
Government of SpainFIS2013-41057-PUnspecifiedUnspecifiedANALISIS MULTI-ESCALA DE REDES COMPLEJAS: TEORIA, EXPERIMENTOS Y APLICACIONES

More information

Item ID: 44558
DC Identifier: http://oa.upm.es/44558/
OAI Identifier: oai:oa.upm.es:44558
DOI: 10.1098/rsif.2014.1344
Official URL: http://rsif.royalsocietypublishing.org/content/12/105/20141344
Deposited by: Memoria Investigacion
Deposited on: 28 Apr 2017 07:26
Last Modified: 17 Jun 2019 07:09
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM