Feasibility of Deep Reinforcement Learning for the real-time attitude control of a satellite system

Pérez Muñoz, Ángel Grover ORCID: https://orcid.org/0009-0007-0760-6071, López García, Guillermo ORCID: https://orcid.org/0009-0004-3627-0413, García Villoria, Irene ORCID: https://orcid.org/0009-0008-5664-4186, Alonso Muñoz, Alejandro Antonio ORCID: https://orcid.org/0000-0002-1622-8996, Porras Hermoso, Ángel Luis ORCID: https://orcid.org/0000-0002-3576-0603 and Pérez Hernández, María de los Santos ORCID: https://orcid.org/0000-0003-2949-3307 (2025). Feasibility of Deep Reinforcement Learning for the real-time attitude control of a satellite system. "Journal of Systems Architecture", v. 167 (n. 103513); https://doi.org/10.1016/j.sysarc.2025.103513.

Descripción

Título: Feasibility of Deep Reinforcement Learning for the real-time attitude control of a satellite system
Autor/es:
Tipo de Documento: Artículo
Título de Revista/Publicación: Journal of Systems Architecture
Fecha: 17 Julio 2025
Volumen: 167
Número: 103513
Materias:
ODS:
Palabras Clave Informales: Artificial intelligence, Neural network controllers, Safety-critical systems, Embedded systems
Escuela: E.T.S. de Ingenieros Informáticos (UPM)
Departamento: Arquitectura y Tecnología de Sistemas Informáticos
Grupo Investigación UPM: Sistemas de tiempo real y arquitectura de servicios telemáticos STRAST
Licencias Creative Commons: Reconocimiento

Texto completo

[thumbnail of paper-published.pdf] PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (3MB)

Resumen

Although Machine Learning (ML) is widely used in a variety of interdisciplinary applications, its implementation in safety-critical systems, such as the Attitude Control System (ACS) of a satellite, poses numerous challenges. While previous studies have shown promising results, there is a lack of information on the design and development process for the application of ML in real-time control systems. This paper presents the implementation of a Deep Reinforcement Learning (DRL) model for a magnetic-based ACS of the UPMSat-2 satellite. The primary objective is not only to design, implement, and validate an RL agent, but also to provide some insights and criteria of the decision-making process to achieve an adequate model. The system was trained and validated on a simulation model with positive results. To further validate non-functional requirements, the resulting trained agent was tested on a real-time embedded system according to safety standards. The obtained quantitative metrics and performance results show the ability of the agent to maintain the satellite's attitude across various operational phases, leveraging its adaptability to dynamic conditions.

Proyectos asociados

Tipo
Código
Acrónimo
Responsable
Título
Comunidad de Madrid
Y2020/NMT-6427
OAPES
Ángel Sanz Andrés
Operación Avanzada de Pequeños Satélites (OAPES)
Gobierno de España
PID2021-124502OB-C43
PRESECREL
Alejandro Antonio Alonso Muñoz
Modelos y plataformas para sistemas informáticos industriales predecibles, seguros y confiables

Más información

ID de Registro: 90311
Identificador DC: https://oa.upm.es/90311/
Identificador OAI: oai:oa.upm.es:90311
URL Portal Científico: https://portalcientifico.upm.es/es/ipublic/item/10382872
Identificador DOI: 10.1016/j.sysarc.2025.103513
URL Oficial: https://www.sciencedirect.com/science/article/pii/...
Depositado por: Ángel Grover Pérez Muñoz
Depositado el: 31 Jul 2025 19:45
Ultima Modificación: 15 Oct 2025 01:01