Dynamic topic-based adaptation of language models: a comparison between different approaches

Echeverry Correa, Julian David, Martínez González, Beatriz, San Segundo Hernández, Rubén ORCID: https://orcid.org/0000-0001-9659-5464, Córdoba Herralde, Ricardo de ORCID: https://orcid.org/0000-0002-7136-9636 and Ferreiros López, Javier ORCID: https://orcid.org/0000-0001-8834-3080 (2014). Dynamic topic-based adaptation of language models: a comparison between different approaches. En: "VIII Jornadas en Tecnologías del Habla and IV Iberian SLTech Workshop (IberSPEECH 2014)", 19/10/2014 - 21/10/2014, Las Palmas de Gran Canaria, Spain. pp. 139-148.

Descripción

Título: Dynamic topic-based adaptation of language models: a comparison between different approaches
Autor/es:
Tipo de Documento: Ponencia en Congreso o Jornada (Artículo)
Título del Evento: VIII Jornadas en Tecnologías del Habla and IV Iberian SLTech Workshop (IberSPEECH 2014)
Fechas del Evento: 19/10/2014 - 21/10/2014
Lugar del Evento: Las Palmas de Gran Canaria, Spain
Título del Libro: VIII Jornadas en Tecnologías del Habla and IV Iberian SLTech Workshop (IberSPEECH 2014)
Fecha: 2014
Materias:
ODS:
Palabras Clave Informales: Language model adaptation, topic identification, automatic speech recognition, information retrieval
Escuela: E.T.S.I. Telecomunicación (UPM)
Departamento: Ingeniería Electrónica
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[thumbnail of INVE_MEM_2014_193789.pdf]
Vista Previa
PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (2MB) | Vista Previa

Resumen

This paper presents a dynamic LM adaptation based on the topic that has been identified on a speech segment. We use LSA and the given topic labels in the training dataset to obtain and use the topic models. We propose a dynamic language model adaptation to improve the recognition performance in "a two stages" AST system. The final stage makes use of the topic identification with two variants: the first on uses just the most probable topic and the other one depends on the relative distances of the topics that have been identified. We perform the adaptation of the LM as a linear interpolation between a background model and topic-based LM. The interpolation weight id dynamically adapted according to different parameters. The proposed method is evaluated on the Spanish partition of the EPPS speech database. We achieved a relative reduction in WER of 11.13% over the baseline system which uses a single blackground LM.

Proyectos asociados

Tipo
Código
Acrónimo
Responsable
Título
Gobierno de España
TIN2011-28169-C05-03
Sin especificar
Sin especificar
Sin especificar
Comunidad de Madrid
S2009/TIC-1542
Sin especificar
Sin especificar
Sin especificar

Más información

ID de Registro: 37537
Identificador DC: https://oa.upm.es/37537/
Identificador OAI: oai:oa.upm.es:37537
Depositado por: Memoria Investigacion
Depositado el: 14 Oct 2015 17:36
Ultima Modificación: 25 Mar 2023 10:29