Development of a speech enhancement system using deep neural networks

Montoro Rodríguez, Daniel (2020). Development of a speech enhancement system using deep neural networks. Thesis (Master thesis), E.T.S.I. Telecomunicación (UPM).

Description

Title: Development of a speech enhancement system using deep neural networks
Author/s:
  • Montoro Rodríguez, Daniel
Contributor/s:
Item Type: Thesis (Master thesis)
Masters title: Teoría de la Señal y Comunicaciones
Date: 2020
Subjects:
Freetext Keywords: machine learning, deep learning, speech processing, digital audio processing, digital signal processing, dsp, noise reduction, speech enhancement
Faculty: E.T.S.I. Telecomunicación (UPM)
Department: Señales, Sistemas y Radiocomunicaciones
Creative Commons Licenses: Recognition - No derivative works - Non commercial

Full text

[thumbnail of TESIS_MASTER_DANIEL_MONTORO_RODRIGUEZ.pdf]
Preview
PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (3MB) | Preview

Abstract

In the last years, deep neural networks have become an important tool in speech technologies,
yielding notable advances in the fields of speaker and speech recognition and speech synthesis. In
this project a design is proposed for a deep neural network for speech enhancement, that is capable
of reducing the level of noise in speech recordings taken in real world scenarios such as a public
transportation or a cafeteria. The proposed design is intended to reduce the high requirements of
computing power of other models that make up the state-of-the-art in audio processing with deep
neural networks, as well as the complexity of their architectures. In addition, a loss function is
introduced that is based on a measure highly correlated to the perceived quality of speech, and the
effect of using it during training is analyzed. The performance of the model is evaluated using
objective measures of speech quality.

More information

Item ID: 63224
DC Identifier: https://oa.upm.es/63224/
OAI Identifier: oai:oa.upm.es:63224
Deposited by: Biblioteca ETSI Telecomunicación
Deposited on: 24 Jul 2020 10:56
Last Modified: 16 Dec 2022 18:28
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM