Texto completo
|
PDF (Portable Document Format)
- Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (840kB) |
ORCID: https://orcid.org/0009-0009-0556-9996, Keita, Mamadou
ORCID: https://orcid.org/0009-0009-7618-9253, Hamidouche, Wassim
ORCID: https://orcid.org/0000-0002-0143-1756, Taleb-Ahmed, Abdelmalik
ORCID: https://orcid.org/0000-0001-7218-3799, Liz López, Helena
ORCID: https://orcid.org/0000-0003-4962-6314, Martín García, Alejandro
ORCID: https://orcid.org/0000-0002-0800-7632, Camacho Fernández, David
ORCID: https://orcid.org/0000-0002-5051-3475 and Hadid, Abdenour
ORCID: https://orcid.org/0000-0001-9092-735X
(2024).
Advances in AI-Generated Images and Videos.
"International journal of interactive multimedia and artificial intelligence", v. 9
(n. 1);
pp. 173-208.
ISSN 1989-1660.
https://doi.org/10.9781/ijimai.2024.11.003.
| Título: | Advances in AI-Generated Images and Videos |
|---|---|
| Autor/es: |
|
| Tipo de Documento: | Artículo |
| Título de Revista/Publicación: | International journal of interactive multimedia and artificial intelligence |
| Fecha: | 2024 |
| ISSN: | 1989-1660 |
| Volumen: | 9 |
| Número: | 1 |
| Materias: | |
| ODS: | |
| Palabras Clave Informales: | AI-Generated Content, Image generation, Multimodal, Video Generation |
| Escuela: | E.T.S.I. de Sistemas Informáticos (UPM) |
| Departamento: | Sistemas Informáticos |
| Licencias Creative Commons: | Reconocimiento |
|
PDF (Portable Document Format)
- Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (840kB) |
In recent years generative AI models and tools have experienced a significant increase, especially techniques to generate synthetic multimedia content, such as images or videos. These methodologies present a wide range of possibilities; however, they can also present several risks that should be taken into account. In this survey we describe in detail different techniques for generating synthetic multimedia content, and we also analyse the most recent techniques for their detection. In order to achieve these objectives, a key aspect is the availability of datasets, so we have also described the main datasets available in the state of the art. Finally, from our analysis we have extracted the main trends for the future, such as transparency and interpretability, the generation of multimodal multimedia content, the robustness of models and the increased use of diffusion models. We find a roadmap of deep challenges, including temporal consistency, computation requirements, generalizability, ethical aspects, and constant adaptation.
| ID de Registro: | 88855 |
|---|---|
| Identificador DC: | https://oa.upm.es/88855/ |
| Identificador OAI: | oai:oa.upm.es:88855 |
| URL Portal Científico: | https://portalcientifico.upm.es/es/ipublic/item/10307383 |
| Identificador DOI: | 10.9781/ijimai.2024.11.003 |
| URL Oficial: | https://www.ijimai.org/journal/bibcite/reference/3... |
| Depositado por: | iMarina Portal Científico |
| Depositado el: | 30 Abr 2025 16:52 |
| Ultima Modificación: | 30 Abr 2025 16:55 |
Publicar en el Archivo Digital desde el Portal Científico