Edge AI in practice: a survey and deployment framework for neural networks on embedded systems

Córdova Cárdenas, Ruth ORCID: https://orcid.org/0009-0003-2025-5676, Amor Martín, Daniel and Gutiérrez Martín, Álvaro ORCID: https://orcid.org/0000-0001-8926-5328 (2025). Edge AI in practice: a survey and deployment framework for neural networks on embedded systems. "Electronics", v. 14 (n. 24); p. 4877. ISSN 0883-4989. https://doi.org/10.3390/electronics14244877.

Descripción

Título: Edge AI in practice: a survey and deployment framework for neural networks on embedded systems
Autor/es:
Tipo de Documento: Artículo
Título de Revista/Publicación: Electronics
Fecha: 11 Diciembre 2025
ISSN: 0883-4989
Volumen: 14
Número: 24
Materias:
ODS:
Palabras Clave Informales: Neural networks; embedded systems; edge AI; model compression; inference optimization; TinyML
Escuela: E.T.S.I. Telecomunicación (UPM)
Departamento: Tecnología Fotónica y Bioingeniería
Licencias Creative Commons: Reconocimiento

Texto completo

[thumbnail of 10427773.pdf] PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (1MB)

Resumen

The growing demand for intelligent and autonomous devices has accelerated the integration of neural networks into embedded systems, a paradigm known as Edge AI. While this approach enables real-time, low-latency processing with improved privacy, it remains constrained by strict limitations in memory, computation and energy. This paper presents a systematic review, aligned with PRISMA principles, that examines the current landscape of deep learning deployment on embedded hardware. The review analyzes key optimization techniques-including pruning, quantization and inference-level improvements-together with lightweight architectures such as CNNs, RNNs and compact networks, as well as a diverse ecosystem of hardware platforms and software frameworks. From the recurring patterns identified in the literature, we derive a practical five-stage methodology that guides developers through requirement definition, model selection, optimization, hardware alignment and deployment. Unlike existing surveys that mainly provide descriptive taxonomies, this methodology offers a structured and reproducible workflow explicitly designed to support multi-objective trade-offs in resource-constrained environments. The review also identifies emerging trends such as TinyML and hybrid architectures and highlights persistent gaps, including limited support for ultra-low-precision inference, variability in hardware toolchains and the absence of standardized holistic benchmarking. By synthesizing these insights into a coherent framework, this work aims to facilitate more efficient, robust and scalable Edge AI implementations.

Proyectos asociados

Tipo
Código
Acrónimo
Responsable
Título
Comunidad de Madrid
IND2023/TIC-27863
Sin especificar
Sin especificar
Sin especificar

Más información

ID de Registro: 92947
Identificador DC: https://oa.upm.es/92947/
Identificador OAI: oai:oa.upm.es:92947
URL Portal Científico: https://portalcientifico.upm.es/es/ipublic/item/10427773
Identificador DOI: 10.3390/electronics14244877
URL Oficial: https://www.mdpi.com/2079-9292/14/24/4877
Depositado por: iMarina Portal Científico
Depositado el: 16 Ene 2026 11:32
Ultima Modificación: 16 Ene 2026 11:33