THAURUS: An Innovative Multimodal Chatbot Based on the Next Generation of Conversational AI

Estecha Garitagoitia, Marcos Santiago ORCID: https://orcid.org/0000-0001-8153-0182, Rodríguez Cantelar, Mario ORCID: https://orcid.org/0000-0001-9703-4458, Garrachón Ruiz, Alfredo, Fernández García, Claudia Garoé, Esteban Romero, Sergio ORCID: https://orcid.org/0009-0008-6336-7877, Conforto López, Cristina, Saiz Fernández, Alberto, Fernández Salvador, Luis Fernando and D'Haro Enríquez, Luis Fernando ORCID: https://orcid.org/0000-0002-3411-7384 (2023). THAURUS: An Innovative Multimodal Chatbot Based on the Next Generation of Conversational AI. En: "Alexa Socialbot Grand Challenge 5", 2023.

Descripción

Título: THAURUS: An Innovative Multimodal Chatbot Based on the Next Generation of Conversational AI
Autor/es:
Tipo de Documento: Ponencia en Congreso o Jornada (Artículo)
Título del Evento: Alexa Socialbot Grand Challenge 5
Fechas del Evento: 2023
Título del Libro: Alexa Prize SocialBot Grand Challenge 5 Proceedings
Fecha: 2023
Materias:
ODS:
Escuela: E.T.S.I. Telecomunicación (UPM)
Departamento: Ingeniería Electrónica
Grupo Investigación UPM: Tecnología del Habla y Aprendizaje Automático THAU
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[thumbnail of thaurus-an-innovative-multimodal-chatbot-based-on-the-next-generation-of-conversational-ai.pdf] PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (4MB)

Resumen

The next generation of conversational AI has brought incredible capabilities such as high contextuality, naturalness, multimodality, and extended knowledge, but also important challenges such as high user expectations, high latencies, large computational requirements, as well as more subtle problems such as mismatch on existing databases for fine-tuning purposes, difficulties for pre-trained LLMs models to handle dialogue interactions, and the integration of multimodal capabilities.

This paper describes the architecture, methodology, and results of our THAURUS chatbot developed for the Alexa Prize Socialbot Grand Challenge (SGC5). Our proposal relies on several innovative ideas to take advantage of existing LLMs to create engaging user experiences that are capable of handling real users in a scalable way and without compromising the competition rules. Different SotA dialogue generators were fine-tuned and incorporated to give variability and handling the wide range of topic conversations; we also developed mechanisms to control the quality of the responses (e.g., detecting and handling toxic interactions, keeping topic coherence, and increasing engagement by providing up-to-date information in a conversational style).

In addition, our system extends the capabilities of the Cobot architecture by incorporating modules to automatically generate images, provide voice cloning capabilities with fictional characters, serve contextual sounds for detected entities in the dialogue, better capitalization and punctuation capabilities, and to provide natural expressions of interest.

Finally, we also included a trained generative selector and a reference-free model for automatic evaluation of turns that could reduce latencies and complement the ranker’s capabilities to select the best generative answer.

Proyectos asociados

Tipo
Código
Acrónimo
Responsable
Título
Horizonte Europa
101071191
ASTOUND
Luis Fernando D'Haro
Improving social competences of virtual agents through artificial consciousness based on the Attention Schema Theory.
Gobierno de España
PID2021-126061OB-C43
BEWORD
Ricardo de Córdoba
Descubriendo el significado y la intención más allá de la palabra hablada: hacia un entorno inteligente para abordar los documentos multimedia.
Gobierno de España
PID2020-113096RB-I00
ACOGES
Fernando Matía Espada
Cognitive PersonalAssistance for Social Environments.

Más información

ID de Registro: 81719
Identificador DC: https://oa.upm.es/81719/
Identificador OAI: oai:oa.upm.es:81719
URL Oficial: https://www.amazon.science/alexa-prize/proceedings...
Depositado por: Mario Rodríguez Cantelar
Depositado el: 10 May 2024 16:49
Ultima Modificación: 10 May 2024 16:49