Scaling and universality in the human voice

Luque Serrano, Jordi; Luque Serrano, Bartolome y Lacasa Saiz de Arce, Lucas (2015). Scaling and universality in the human voice. "Journal of the Royal Society Interface", v. 12 (n. 105); pp. 1-6. ISSN 1742-5689. https://doi.org/10.1098/rsif.2014.1344.

Descripción

Título: Scaling and universality in the human voice
Autor/es:
  • Luque Serrano, Jordi
  • Luque Serrano, Bartolome
  • Lacasa Saiz de Arce, Lucas
Tipo de Documento: Artículo
Título de Revista/Publicación: Journal of the Royal Society Interface
Fecha: 18 Febrero 2015
Volumen: 12
Materias:
Palabras Clave Informales: voice, human, universality
Escuela: E.T.S. de Ingeniería Aeronáutica y del Espacio (UPM)
Departamento: Matemática Aplicada a la Ingeniería Aeroespacial
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[img]
Vista Previa
PDF (Document Portable Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (1MB) | Vista Previa

Resumen

Speech is a distinctive complex feature of human capabilities. In order to understand the physics underlying speech production, in this work, we empirically analyse the statistics of large human speech datasets ranging several languages. We first show that during speech, the energy is unevenly released and powerlaw distributed, reporting a universal robust Gutenberg–Richter-like law in speech. We further show that such ‘earthquakes in speech’ show temporal correlations, as the interevent statistics are again power-law distributed. As this feature takes place in the intraphoneme range, we conjecture that the process responsible for this complex phenomenon is not cognitive, but it resides in the physiological (mechanical) mechanisms of speech production. Moreover, we show that these waiting time distributions are scale invariant under a renormalization group transformation, suggesting that the process of speech generation is indeed operating close to a critical point. These results are put in contrast with current paradigms in speech processing, which point towards low dimensional deterministic chaos as the origin of nonlinear traits in speech fluctuations. As these latter fluctuations are indeed the aspects that humanize synthetic speech, these findings may have an impact in future speech synthesis technologies. Results are robust and independent of the communication language or the number of speakers, pointing towards a universal pattern and yet another hint of complexity in human speech.

Proyectos asociados

TipoCódigoAcrónimoResponsableTítulo
Gobierno de EspañaFIS2013-41057-PSin especificarMinisterio de economía y competitividadANALISIS MULTI-ESCALA DE REDES COMPLEJAS: TEORIA, EXPERIMENTOS Y APLICACIONES

Más información

ID de Registro: 44558
Identificador DC: http://oa.upm.es/44558/
Identificador OAI: oai:oa.upm.es:44558
Identificador DOI: 10.1098/rsif.2014.1344
URL Oficial: http://rsif.royalsocietypublishing.org/content/12/105/20141344
Depositado por: Memoria Investigacion
Depositado el: 28 Abr 2017 07:26
Ultima Modificación: 28 Abr 2017 07:26
  • Open Access
  • Open Access
  • Sherpa-Romeo
    Compruebe si la revista anglosajona en la que ha publicado un artículo permite también su publicación en abierto.
  • Dulcinea
    Compruebe si la revista española en la que ha publicado un artículo permite también su publicación en abierto.
  • Recolecta
  • e-ciencia
  • Observatorio I+D+i UPM
  • OpenCourseWare UPM