Texto completo
|
PDF (Portable Document Format)
- Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (182kB) |
ORCID: https://orcid.org/0009-0006-7973-4913, Estecha Garitagoitia, Marcos Santiago
ORCID: https://orcid.org/0000-0001-8153-0182 and D'Haro Enríquez, Luis Fernando
ORCID: https://orcid.org/0000-0002-3411-7384
(2025).
Context or Retrieval? Evaluating RAG Methods for Art and Museum QA System.
En: "15th International Workshop on Spoken Dialogue Systems Technology (IWSDS25)", 27-30 May 2025, Bilbao, Spain.
| Título: | Context or Retrieval? Evaluating RAG Methods for Art and Museum QA System |
|---|---|
| Autor/es: |
|
| Tipo de Documento: | Ponencia en Congreso o Jornada (Artículo) |
| Título del Evento: | 15th International Workshop on Spoken Dialogue Systems Technology (IWSDS25) |
| Fechas del Evento: | 27-30 May 2025 |
| Lugar del Evento: | Bilbao, Spain |
| Título del Libro: | Proceedings of the 15th International Workshop on Spoken Dialogue Systems Technology |
| Fecha: | Mayo 2025 |
| Materias: | |
| ODS: | |
| Escuela: | E.T.S.I. Telecomunicación (UPM) |
| Departamento: | Ingeniería Electrónica |
| Licencias Creative Commons: | Reconocimiento - No comercial - Compartir igual |
|
PDF (Portable Document Format)
- Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (182kB) |
Recent studies suggest that increasing the context window of language models could outperform retrieval-augmented generation (RAG) methods in certain tasks. However, in domains such as art and museums, where information is inherently multimodal, combining images and detailed textual descriptions, this assumption needs closer examination. To explore this, we compare RAG techniques with direct large-context input approaches for answering questions about artworks. Using a dataset of painting images paired with textual information, we develop a synthetic database of question-answer (QA) pairs for evaluating these methods. The focus is on assessing the efficiency and accuracy of RAG in retrieving and using relevant information compared to passing the entire textual context to a language model. Additionally, we experiment with various strategies for segmenting and retrieving text to optimise the RAG pipeline. The results aim to clarify the trade-offs between these approaches and provide valuable insights for interactive systems designed for art and museum contexts.
| ID de Registro: | 90882 |
|---|---|
| Identificador DC: | https://oa.upm.es/90882/ |
| Identificador OAI: | oai:oa.upm.es:90882 |
| Depositado por: | Jaime Bellver Soler |
| Depositado el: | 17 Sep 2025 06:55 |
| Ultima Modificación: | 17 Sep 2025 08:07 |
Publicar en el Archivo Digital desde el Portal Científico