Glottal Source Cepstrum Coefficients Applied to NIST SRE 2010

Mazaira Fernández, Luis Miguel, Álvarez Marquina, Agustín

, Gómez Vilda, Pedro

, Martínez Olalla, Rafael

and Muñoz, Cristina (2010). Glottal Source Cepstrum Coefficients Applied to NIST SRE 2010. En: "V Jornadas de Reconocimiento Biométrico de Personas JRBP10", 02/09/2010 - 03/09/2010, Huesca, España.

Descripción

Título:	Glottal Source Cepstrum Coefficients Applied to NIST SRE 2010
Autor/es:	Mazaira Fernández, Luis Miguel Álvarez Marquina, Agustín https://orcid.org/0000-0002-3387-6709 Gómez Vilda, Pedro https://orcid.org/0000-0003-3283-378X Martínez Olalla, Rafael https://orcid.org/0000-0003-2336-9145 Muñoz, Cristina
Tipo de Documento:	Ponencia en Congreso o Jornada (Artículo)
Título del Evento:	V Jornadas de Reconocimiento Biométrico de Personas JRBP10
Fechas del Evento:	02/09/2010 - 03/09/2010
Lugar del Evento:	Huesca, España
Título del Libro:	Actas de las V Jornadas de Reconocimiento Biométrico de Personas JRBP10
Fecha:	2010
Materias:	Telecomunicaciones
ODS:	09. Industria, innovación e infraestructura
Palabras Clave Informales:	Glotal Source, Speaker Characterization, Speaker Recognition, GMM, Speech production, NIST SRE 2010
Escuela:	Facultad de Informática (UPM) [antigua denominación]
Departamento:	Arquitectura y Tecnología de Sistemas Informáticos
Licencias Creative Commons:	Reconocimiento - Sin obra derivada - No comercial

Texto completo

Vista Previa

PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (273kB) | Vista Previa

Resumen

Through the present paper, a novel feature set for speaker recognition based on glottal estimate information is presented. An iterative algorithm is used to derive the vocal tract and glottal source estimations from speech signal. In order to test the importance of glottal source information in speaker characterization, the novel feature set has been tested in the 2010 NIST Speaker Recognition Evaluation (NIST SRE10). The proposed system uses glottal estimate parameter templates and classical cepstral information to build a model for each speaker involved in the recognition process. ALIZE [1] open-source software has been used to create the GMM models for both background and target speakers. Compared to using mel-frequency cepstrum coefficients (MFCC), the misclassification rate for the NIST SRE 2010 reduced from 29.43% to 27.15% when glottal source features are used

Más información

ID de Registro:	7905
Identificador DC:	https://oa.upm.es/7905/
Identificador OAI:	oai:oa.upm.es:7905
URL Oficial:	http://jrbp10.unizar.es/
Depositado por:	Memoria Investigacion
Depositado el:	27 Jul 2011 11:35
Ultima Modificación:	19 Feb 2025 12:30

Estadísticas

Exportar cita

Editar (sólo personal del Archivo)

En esta página

Menú principal

Buscar

Glottal Source Cepstrum Coefficients Applied to NIST SRE 2010

Cita

Descripción

Texto completo

Resumen

Más información

Acciones

Documentos

El repositorio

Agrupados por ...

Datos Investigación

Financiadores

Especiales

En otros formatos

Redes sociales

Información adicional