Advances in AI-Generated Images and Videos

Bougueffa, Hessen ORCID: https://orcid.org/0009-0009-0556-9996, Keita, Mamadou ORCID: https://orcid.org/0009-0009-7618-9253, Hamidouche, Wassim ORCID: https://orcid.org/0000-0002-0143-1756, Taleb-Ahmed, Abdelmalik ORCID: https://orcid.org/0000-0001-7218-3799, Liz López, Helena ORCID: https://orcid.org/0000-0003-4962-6314, Martín García, Alejandro ORCID: https://orcid.org/0000-0002-0800-7632, Camacho Fernández, David ORCID: https://orcid.org/0000-0002-5051-3475 and Hadid, Abdenour ORCID: https://orcid.org/0000-0001-9092-735X (2024). Advances in AI-Generated Images and Videos. "International journal of interactive multimedia and artificial intelligence", v. 9 (n. 1); pp. 173-208. ISSN 1989-1660. https://doi.org/10.9781/ijimai.2024.11.003.

Descripción

Título: Advances in AI-Generated Images and Videos
Autor/es:
Tipo de Documento: Artículo
Título de Revista/Publicación: International journal of interactive multimedia and artificial intelligence
Fecha: 2024
ISSN: 1989-1660
Volumen: 9
Número: 1
Materias:
ODS:
Palabras Clave Informales: AI-Generated Content, Image generation, Multimodal, Video Generation
Escuela: E.T.S.I. de Sistemas Informáticos (UPM)
Departamento: Sistemas Informáticos
Licencias Creative Commons: Reconocimiento

Texto completo

[thumbnail of ijimai_9_1_16.pdf] PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (840kB)

Resumen

In recent years generative AI models and tools have experienced a significant increase, especially techniques to generate synthetic multimedia content, such as images or videos. These methodologies present a wide range of possibilities; however, they can also present several risks that should be taken into account. In this survey we describe in detail different techniques for generating synthetic multimedia content, and we also analyse the most recent techniques for their detection. In order to achieve these objectives, a key aspect is the availability of datasets, so we have also described the main datasets available in the state of the art. Finally, from our analysis we have extracted the main trends for the future, such as transparency and interpretability, the generation of multimodal multimedia content, the robustness of models and the increased use of diffusion models. We find a roadmap of deep challenges, including temporal consistency, computation requirements, generalizability, ethical aspects, and constant adaptation.

Proyectos asociados

Tipo
Código
Acrónimo
Responsable
Título
Gobierno de España
PCI2022-134990-2
MARTINI
Sin especificar
Malicious actors profiling and detection in Online Social Networks through Artificial Intelligence
Gobierno de España
PLEC 2021-007681
XAI-Disinfodemics
Sin especificar
eXplainable AI for disinformation and conspiracy detection during infodemics
Horizonte Europa
101158511
IBERIFIER Plus
Sin especificar
Iberian Digital Media Observatory
Horizonte Europa
101168344
TUAI
Sin especificar
Towards an Understanding of Artificial Intelligence via a transparent, open and explainable perspective
Comunidad de Madrid
TEC-2024/COM-404
CIRMA-CAM
Sin especificar
Ciberseguridad e Inteligencia Artificial Resiliente para la Comunidad de Madrid

Más información

ID de Registro: 88855
Identificador DC: https://oa.upm.es/88855/
Identificador OAI: oai:oa.upm.es:88855
URL Portal Científico: https://portalcientifico.upm.es/es/ipublic/item/10307383
Identificador DOI: 10.9781/ijimai.2024.11.003
URL Oficial: https://www.ijimai.org/journal/bibcite/reference/3...
Depositado por: iMarina Portal Científico
Depositado el: 30 Abr 2025 16:52
Ultima Modificación: 30 Abr 2025 16:55