Towards a unified sentiment lexicon based on graphics processing units

Barbosa Santillán, Liliana Ibeth y Alvarez de Mon Rego, Inmaculada (2014). Towards a unified sentiment lexicon based on graphics processing units. "Mathematical Problems in Engineering", v. 2014 (n. null); pp. 1-19. ISSN 1024-123X. https://doi.org/10.1155/2014/429629.

Descripción

Título: Towards a unified sentiment lexicon based on graphics processing units
Autor/es:
  • Barbosa Santillán, Liliana Ibeth
  • Alvarez de Mon Rego, Inmaculada
Tipo de Documento: Artículo
Título de Revista/Publicación: Mathematical Problems in Engineering
Fecha: Abril 2014
Volumen: 2014
Materias:
Escuela: E.T.S.I. y Sistemas de Telecomunicación (UPM)
Departamento: Lingüistica Aplicada a la Ciencia y a la Tecnología
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[img]
Vista Previa
PDF (Document Portable Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (6MB) | Vista Previa

Resumen

This paper presents an approach to create what we have called a Unified Sentiment Lexicon (USL). This approach aims at aligning, unifying, and expanding the set of sentiment lexicons which are available on the web in order to increase their robustness of coverage. One problem related to the task of the automatic unification of different scores of sentiment lexicons is that there are multiple lexical entries for which the classification of positive, negative, or neutral {P, Z, N} depends on the unit of measurement used in the annotation methodology of the source sentiment lexicon. Our USL approach computes the unified strength of polarity of each lexical entry based on the Pearson correlation coefficient which measures how correlated lexical entries are with a value between 1 and -1, where 1 indicates that the lexical entries are perfectly correlated, 0 indicates no correlation, and -1 means they are perfectly inversely correlated and so is the UnifiedMetrics procedure for CPU and GPU, respectively. Another problem is the high processing time required for computing all the lexical entries in the unification task. Thus, the USL approach computes a subset of lexical entries in each of the 1344 GPU cores and uses parallel processing in order to unify 155802 lexical entries. The results of the analysis conducted using the USL approach show that the USL has 95.430 lexical entries, out of which there are 35.201 considered to be positive, 22.029 negative, and 38.200 neutral. Finally, the runtime was 10 minutes for 95.430 lexical entries; this allows a reduction of the time computing for the UnifiedMetrics by 3 times.

Más información

ID de Registro: 35949
Identificador DC: http://oa.upm.es/35949/
Identificador OAI: oai:oa.upm.es:35949
Identificador DOI: 10.1155/2014/429629
URL Oficial: http://www.hindawi.com/journals/mpe/2014/429629/
Depositado por: Memoria Investigacion
Depositado el: 09 Feb 2016 19:43
Ultima Modificación: 09 Feb 2016 19:50
  • Open Access
  • Open Access
  • Sherpa-Romeo
    Compruebe si la revista anglosajona en la que ha publicado un artículo permite también su publicación en abierto.
  • Dulcinea
    Compruebe si la revista española en la que ha publicado un artículo permite también su publicación en abierto.
  • Recolecta
  • e-ciencia
  • Observatorio I+D+i UPM
  • OpenCourseWare UPM