Enhancing Event-Based Vision for Logistics: Data Processing, Prediction Models Analisys and 3D Dataset Creation

Sánchez Villaluenga, Tómas ORCID: https://orcid.org/0009-0004-0287-6572 (2025). Enhancing Event-Based Vision for Logistics: Data Processing, Prediction Models Analisys and 3D Dataset Creation. Tesis (Master), E.T.S.I. Industriales (UPM).

Descripción

Título: Enhancing Event-Based Vision for Logistics: Data Processing, Prediction Models Analisys and 3D Dataset Creation
Autor/es:
Director/es:
Tipo de Documento: Tesis (Master)
Título del máster: Automática y Robótica
Fecha: 12 Junio 2025
Materias:
ODS:
Palabras Clave Informales: Event-based vision, Semi-supervised learning, Self-supervised learning, Object detection, Smart logistics, Pseudolabels, 3D reconstruction
Escuela: E.T.S.I. Industriales (UPM)
Departamento: Automática, Ingeniería Eléctrica y Electrónica e Informática Industrial
Grupo Investigación UPM: Visión por Computador y Robótica Aérea (CVAR)
Licencias Creative Commons: Ninguna

Texto completo

[thumbnail of TFM_TOMAS_SANCHEZ.pdf] PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (2MB)

Resumen

Event-based vision offers advantages in dynamic environments due to its asynchronous data acquisition and low-latency processing. This thesis presents an in-depth analysis of the Label-Efficient Object Detection (LEOD) algorithm, evaluating its performance on a custom logistics dataset. To ensure compatibility, LEOD has been modified, addressing inherited inefficiencies and optimizing its handling of event-based data in logistics applications.

A critical part of this work involves the automation of data preprocessing and annotation, streamlining dataset creation, label assignment, and event representation. These enhancements reduce manual intervention and improve reproducibility, while optimizations in data storage cut annotation space requirements by 90%, making large-scale dataset handling more efficient.

Experimental results confirm that LEOD meets expectations in semi-supervised learning and outperforms previous benchmarks in Weakly-Supervised Object Detection (WSOD), achieving higher detection precision than the Gen1 and 1Mpx datasets across all training sizes.

Additionally, this research develops a 3D dataset using Neural Radiance Fields (NeRF) technology, reconstructing environments from RGB data to extend event-based perception into three dimensions. Future research will focus on integrating LEOD with 3D event-based perception and improving segmentation techniques for enhanced detection precision.

Más información

ID de Registro: 89413
Identificador DC: https://oa.upm.es/89413/
Identificador OAI: oai:oa.upm.es:89413
Depositado por: Tomas Sanchez Villaluenga
Depositado el: 24 Jul 2025 06:25
Ultima Modificación: 08 Sep 2025 10:09