Use of conditional generative adversarial networks to create demographic collaborative filtering datasets

Bobadilla Sancho, Jesús ORCID: https://orcid.org/0000-0003-0619-1322 and Gutiérrez Rodríguez, Abraham ORCID: https://orcid.org/0000-0001-6974-7514 (2025). Use of conditional generative adversarial networks to create demographic collaborative filtering datasets. "Applied Soft Computing", v. 169 (n. 112608); ISSN 15684946. https://doi.org/10.1016/j.asoc.2024.112608.

Descripción

Título: Use of conditional generative adversarial networks to create demographic collaborative filtering datasets
Autor/es:
Tipo de Documento: Artículo
Título de Revista/Publicación: Applied Soft Computing
Fecha: Enero 2025
ISSN: 15684946
Volumen: 169
Número: 112608
Materias:
ODS:
Palabras Clave Informales: CGANRS, Conditional Generative Adversarial Networks, Fairness, Collaborative Filtering, Recommender Systems,Synthetic Datasets
Escuela: E.T.S.I. de Sistemas Informáticos (UPM)
Departamento: Sistemas Informáticos
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[thumbnail of 10310783.pdf] PDF (Portable Document Format) - Acceso permitido solamente a usuarios en el campus de la UPM hasta 31 Enero 2027 - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (18MB)

Resumen

This paper proposes a method to create synthetic collaborative filtering datasets that can be used to test both current and new fair recommender systems models. The proposed "Conditional Generative Adversarial Network for Recommender Systems (CGANRS)" method generalizes the existing generative adversarial network for recommender systems one, and it makes use of a conditional generative adversarial network to artificially generate synthetic profiles from a source dataset such as MovieLens. The created datasets can be parameterized to have different sizes and to include different number of users and items. Additionally, the provided parameters include the proportion of multi-categorical demographic information such as the number of male vs. female users, or the proportions of very young, young, adult, and senior users. To test the proposed method, three sets of synthetic databases have been created, containing different a) numbers of users, b) numbers of items, and c) proportions of male users versus female users. Results show an adequate behavior of the generated datasets, testing their a) profiles separability, b) main statistical distributions, and c) recommendation accuracies. Synthetic data sets created using the proposed conditional generative adversarial network for recommender systems method are particularly useful to improve research in the fairness field of the recommender systems area. To extend its use and to facilitate reproducibility, the source code is provided to generate as many demographic datasets as desired, as well as the artificially generated datasets in this research. Some promising future works are proposed, including a) the variation of the stochastic Gaussian distribution used to create the random noise vectors that feed the adversarial network generator model, and b) testing the fairness of the most relevant collaborative filtering models on different synthetic scenarios.

Proyectos asociados

Tipo
Código
Acrónimo
Responsable
Título
Gobierno de España
PID2019–106493RB-I00
Sin especificar
Sin especificar
Sin especificar

Más información

ID de Registro: 91904
Identificador DC: https://oa.upm.es/91904/
Identificador OAI: oai:oa.upm.es:91904
URL Portal Científico: https://portalcientifico.upm.es/es/ipublic/item/10310783
Identificador DOI: 10.1016/j.asoc.2024.112608
URL Oficial: https://www.sciencedirect.com/science/article/pii/...
Depositado por: iMarina Portal Científico
Depositado el: 24 Nov 2025 17:01
Ultima Modificación: 24 Nov 2025 17:14