FacTeR-Check: Semi-automated fact-checking through semantic similarity and natural language inference

Martín García, Alejandro ORCID: https://orcid.org/0000-0002-0800-7632, Huertas Tato, Javier ORCID: https://orcid.org/0000-0003-4127-5505, Huertas García, Álvaro ORCID: https://orcid.org/0000-0003-2165-0144, Villar Rodriguez, Guillermo ORCID: https://orcid.org/0000-0001-7942-2879 and Camacho Fernández, David ORCID: https://orcid.org/0000-0002-5051-3475 (2022). FacTeR-Check: Semi-automated fact-checking through semantic similarity and natural language inference. "Knowledge-Based Systems", v. 251 ; ISSN 0950-7051. https://doi.org/10.1016/j.knosys.2022.109265.

Descripción

Título: FacTeR-Check: Semi-automated fact-checking through semantic similarity and natural language inference
Autor/es:
Tipo de Documento: Artículo
Título de Revista/Publicación: Knowledge-Based Systems
Fecha: 5 Septiembre 2022
ISSN: 0950-7051
Volumen: 251
Materias:
ODS:
Palabras Clave Informales: Misinformation, Transformers, COVID-19, Hoax, Natural language inference, Semantic similarity
Escuela: E.T.S.I. de Sistemas Informáticos (UPM)
Departamento: Sistemas Informáticos
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[thumbnail of 1-s2.0-S0950705122006323-main.pdf] PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (2MB)

Resumen

Our society produces and shares overwhelming amounts of information through Online Social Net-works (OSNs). Within this environment, misinformation and disinformation have proliferated, becom-ing a public safety concern in most countries. Allowing the public and professionals to efficiently find reliable evidence about the factual veracity of a claim is a crucial step to mitigate this harmful spread. To this end, we propose FacTeR-Check, a multilingual architecture for semi-automated fact-checking and hoaxes propagation analysis that can be used to implement applications designed for both the general public and for fact-checking organisations. FacTeR-Check implements three different modules relying on the XLM-RoBERTa Transformer architecture to evaluate semantic similarity, to calculate natural language inference and to build search queries through automatic keywords extraction and Named-Entity Recognition. The three modules have been validated using state-of-the-art benchmark datasets, exhibiting good performance in all of them. Besides, FacTeR-Check is employed to collect and label a dataset, called NLI19-SP, composed of more than 40,000 tweets supporting or denying 60 hoaxes related to COVID-19, released publicly. Finally, an analysis of the data collected in this dataset is provided, which allows to obtain a deep insight of how disinformation operated during the COVID-19 pandemic in Spanish-speaking countries.

Proyectos asociados

Tipo
Código
Acrónimo
Responsable
Título
Gobierno de España
PID2020-117263GB-100
FightDIS
Sin especificar
Sin especificar
Gobierno de España
PLEC2021-007681
XAI-Disinfodemics
Sin especificar
Sin especificar
Comunidad de Madrid
S2018/TCS-4566
Sin especificar
Sin especificar
Sin especificar
Sin especificar
2020-EU-IA-0252
IBERIFIER
Sin especificar
Iberian Digital Media Research and Fact-Checking Hub

Más información

ID de Registro: 88876
Identificador DC: https://oa.upm.es/88876/
Identificador OAI: oai:oa.upm.es:88876
URL Portal Científico: https://portalcientifico.upm.es/es/ipublic/item/9950821
Identificador DOI: 10.1016/j.knosys.2022.109265
URL Oficial: https://www.sciencedirect.com/science/article/pii/...
Depositado por: iMarina Portal Científico
Depositado el: 05 May 2025 15:48
Ultima Modificación: 05 May 2025 15:48