Using large language models to estimate features of multi-word expressions: Concreteness, valence, arousal

Martínez Ruiz, Gonzalo

, Molero García-Morato, Juan, González Chamoso, Sandra, Conde Díaz, Javier

, Brysbaert, Marc

and Reviriego Vasallo, Pedro

(2024). Using large language models to estimate features of multi-word expressions: Concreteness, valence, arousal. "Behavior Research Methods", v. 57 (n. 5); pp. 2-11. https://doi.org/10.3758/s13428-024-02515-z.

Descripción

Título:	Using large language models to estimate features of multi-word expressions: Concreteness, valence, arousal
Autor/es:	Martínez Ruiz, Gonzalo https://orcid.org/0000-0002-9125-6225 Molero García-Morato, Juan González Chamoso, Sandra Conde Díaz, Javier https://orcid.org/0000-0002-5304-0626 Brysbaert, Marc https://orcid.org/0000-0002-3645-3189 Reviriego Vasallo, Pedro https://orcid.org/0000-0003-2540-5234
Tipo de Documento:	Artículo
Título de Revista/Publicación:	Behavior Research Methods
Fecha:	Diciembre 2024
Volumen:	57
Número:	5
Materias:	Informática Matemáticas Psicología
Palabras Clave Informales:	word norms, concreteness, valence, arousal, multi-word expressions, large language model
Escuela:	E.T.S.I. Telecomunicación (UPM)
Departamento:	Ingeniería de Sistemas Telemáticos
Grupo Investigación UPM:	Internet de Nueva Generación
Licencias Creative Commons:	Ninguna

Texto completo

[thumbnail of accpeted_version_with_data.pdf]

PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (1MB)

Resumen

This study investigates the potential of large language models (LLMs) to provide accurate estimates of concreteness, valence and arousal for multi-word expressions. Unlike previous artificial intelligence (AI) methods, LLMs can capture the nuanced meanings of multi-word expressions. We systematically evaluated GPT-4o's ability to predict concreteness, valence and arousal. In Study 1, GPT-4o showed strong correlations with human concreteness ratings (r = .8) for multi-word expressions. In Study 2, these findings were repeated for valence and arousal ratings of individual words, matching or outperforming previous AI models. Studies 3-5 extended the valence and arousal analysis to multi-word expressions and showed good validity of the LLM-generated estimates for these stimuli as well. To help researchers with stimulus selection, we provide datasets with LLM-generated norms of concreteness, valence and arousal for 126,397 English single words and 63,680 multi-word expressions.

Proyectos asociados

Tipo

Código

Acrónimo

Responsable

Título

Gobierno de España

PID2022-136684OB-C21/C22

FUN4DATE

Sin especificar

FUN4DATE

Más información

ID de Registro:	85232
Identificador DC:	https://oa.upm.es/85232/
Identificador OAI:	oai:oa.upm.es:85232
URL Portal Científico:	https://portalcientifico.upm.es/es/ipublic/item/10316892
Identificador DOI:	10.3758/s13428-024-02515-z
URL Oficial:	https://link.springer.com/article/10.3758/s13428-0...
Depositado por:	Javier Conde Díaz
Depositado el:	09 Dic 2024 13:47
Ultima Modificación:	05 Dic 2025 01:45

Estadísticas

Exportar cita

Editar (sólo personal del Archivo)

En esta página

Menú principal

Buscar

Using large language models to estimate features of multi-word expressions: Concreteness, valence, arousal

Cita

Descripción

Texto completo

Resumen

Proyectos asociados

Más información

Acciones

Metrics

Altmetrics probando

Dimensions

Documentos

El repositorio

Agrupados por ...

Datos Investigación

Financiadores

Especiales

En otros formatos

Redes sociales

Información adicional