Context or Retrieval? Evaluating RAG Methods for Art and Museum QA System

Ramos Varela, Samuel, Bellver Soler, Jaime ORCID: https://orcid.org/0009-0006-7973-4913, Estecha Garitagoitia, Marcos Santiago ORCID: https://orcid.org/0000-0001-8153-0182 and D'Haro Enríquez, Luis Fernando ORCID: https://orcid.org/0000-0002-3411-7384 (2025). Context or Retrieval? Evaluating RAG Methods for Art and Museum QA System. En: "15th International Workshop on Spoken Dialogue Systems Technology (IWSDS25)", 27-30 May 2025, Bilbao, Spain.

Descripción

Título: Context or Retrieval? Evaluating RAG Methods for Art and Museum QA System
Autor/es:
Tipo de Documento: Ponencia en Congreso o Jornada (Artículo)
Título del Evento: 15th International Workshop on Spoken Dialogue Systems Technology (IWSDS25)
Fechas del Evento: 27-30 May 2025
Lugar del Evento: Bilbao, Spain
Título del Libro: Proceedings of the 15th International Workshop on Spoken Dialogue Systems Technology
Fecha: Mayo 2025
Materias:
ODS:
Escuela: E.T.S.I. Telecomunicación (UPM)
Departamento: Ingeniería Electrónica
Licencias Creative Commons: Reconocimiento - No comercial - Compartir igual

Texto completo

[thumbnail of 2025iwsds-1-10-2.pdf] PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (182kB)

Resumen

Recent studies suggest that increasing the context window of language models could outperform retrieval-augmented generation (RAG) methods in certain tasks. However, in domains such as art and museums, where information is inherently multimodal, combining images and detailed textual descriptions, this assumption needs closer examination. To explore this, we compare RAG techniques with direct large-context input approaches for answering questions about artworks. Using a dataset of painting images paired with textual information, we develop a synthetic database of question-answer (QA) pairs for evaluating these methods. The focus is on assessing the efficiency and accuracy of RAG in retrieving and using relevant information compared to passing the entire textual context to a language model. Additionally, we experiment with various strategies for segmenting and retrieving text to optimise the RAG pipeline. The results aim to clarify the trade-offs between these approaches and provide valuable insights for interactive systems designed for art and museum contexts.

Proyectos asociados

Tipo
Código
Acrónimo
Responsable
Título
Horizonte Europa
101071191
ASTOUND
Luis Fernando D'Haro
Sin especificar
Gobierno de España
PID2021-126061OB-C43
Sin especificar
Sin especificar
Sin especificar
Comunidad de Madrid
PHS-2024/PH-HUM-52
Sin especificar
Sin especificar
Sin especificar

Más información

ID de Registro: 90882
Identificador DC: https://oa.upm.es/90882/
Identificador OAI: oai:oa.upm.es:90882
Depositado por: Jaime Bellver Soler
Depositado el: 17 Sep 2025 06:55
Ultima Modificación: 17 Sep 2025 08:07