Using large language models to estimate features of multi-word expressions: Concreteness, valence, arousal

Martínez Ruiz, Gonzalo ORCID: https://orcid.org/0000-0002-9125-6225, Molero García-Morato, Juan, González Chamoso, Sandra, Conde Díaz, Javier ORCID: https://orcid.org/0000-0002-5304-0626, Brysbaert, Marc ORCID: https://orcid.org/0000-0002-3645-3189 and Reviriego Vasallo, Pedro ORCID: https://orcid.org/0000-0003-2540-5234 (2024). Using large language models to estimate features of multi-word expressions: Concreteness, valence, arousal. "Behavior Research Methods", v. 57 (n. 5); pp. 2-11. https://doi.org/10.3758/s13428-024-02515-z.

Descripción

Título: Using large language models to estimate features of multi-word expressions: Concreteness, valence, arousal
Autor/es:
Tipo de Documento: Artículo
Título de Revista/Publicación: Behavior Research Methods
Fecha: Diciembre 2024
Volumen: 57
Número: 5
Materias:
Palabras Clave Informales: word norms, concreteness, valence, arousal, multi-word expressions, large language model
Escuela: E.T.S.I. Telecomunicación (UPM)
Departamento: Ingeniería de Sistemas Telemáticos
Grupo Investigación UPM: Internet de Nueva Generación
Licencias Creative Commons: Ninguna

Texto completo

[thumbnail of accpeted_version_with_data.pdf] PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (1MB)

Resumen

This study investigates the potential of large language models (LLMs) to provide accurate estimates of concreteness, valence and arousal for multi-word expressions. Unlike previous artificial intelligence (AI) methods, LLMs can capture the nuanced meanings of multi-word expressions. We systematically evaluated GPT-4o's ability to predict concreteness, valence and arousal. In Study 1, GPT-4o showed strong correlations with human concreteness ratings (r = .8) for multi-word expressions. In Study 2, these findings were repeated for valence and arousal ratings of individual words, matching or outperforming previous AI models. Studies 3-5 extended the valence and arousal analysis to multi-word expressions and showed good validity of the LLM-generated estimates for these stimuli as well. To help researchers with stimulus selection, we provide datasets with LLM-generated norms of concreteness, valence and arousal for 126,397 English single words and 63,680 multi-word expressions.

Proyectos asociados

Tipo
Código
Acrónimo
Responsable
Título
Gobierno de España
PID2022-136684OB-C21/C22
FUN4DATE
Sin especificar
FUN4DATE

Más información

ID de Registro: 85232
Identificador DC: https://oa.upm.es/85232/
Identificador OAI: oai:oa.upm.es:85232
URL Portal Científico: https://portalcientifico.upm.es/es/ipublic/item/10316892
Identificador DOI: 10.3758/s13428-024-02515-z
URL Oficial: https://link.springer.com/article/10.3758/s13428-0...
Depositado por: Javier Conde Díaz
Depositado el: 09 Dic 2024 13:47
Ultima Modificación: 05 Dic 2025 01:45