Analysing Foreground Segmentation in Deep Learning Based Depth Estimation on Free-Viewpoint Video Systems

Usón Peirón, Javier ORCID: https://orcid.org/0009-0003-8110-1443, Cabrera Quesada, Julian ORCID: https://orcid.org/0000-0002-7154-2451, Corregidor Luna, Daniel ORCID: https://orcid.org/0000-0003-0994-377X and García Santos, Narciso ORCID: https://orcid.org/0000-0002-0397-894X (2022). Analysing Foreground Segmentation in Deep Learning Based Depth Estimation on Free-Viewpoint Video Systems. En: "2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)", 02-06 September 2022. pp. 1-5. https://doi.org/10.1109/ICCE-Berlin56473.2022.9937087.

Descripción

Título: Analysing Foreground Segmentation in Deep Learning Based Depth Estimation on Free-Viewpoint Video Systems
Autor/es:
Tipo de Documento: Ponencia en Congreso o Jornada (Artículo)
Título del Evento: 2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)
Fechas del Evento: 02-06 September 2022
Título del Libro: 2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)
Fecha: 11 Noviembre 2022
Materias:
ODS:
Palabras Clave Informales: Deep learning Training Image segmentation Estimation Color Media Consumer electronics Foreground segmentation deep learning stereo dataset depth multiview video free viewpoint video
Escuela: E.T.S.I. Telecomunicación (UPM)
Departamento: Señales, Sistemas y Radiocomunicaciones
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[thumbnail of ICCE_Berlin_2022.pdf] PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (16MB)

Resumen

Volumetric video acquisition systems enable realistic virtual experiences such as Free-Viewpoint Video (FVV). Stereo matching is a well known way of obtaining this volumetric information as depth images, calculating the disparity between two stereo color images. On these applications, the background of the scene captured is static and does not change, so foreground information is much more valuable. We propose adding foreground segmentation to help learning based algorithms, such as deep learning models, improve results previously obtained. We utilized the framework Detectron2 to model foreground segmentation by detecting people. Additionally, we built a large stereo dataset focused on FVV systems. Finally, we modified a successful deep learning model from the state-of-the-art, CREStereo, to add foreground segmentation and performed supervised training on it to estimate disparity, obtaining promising results.

Proyectos asociados

Tipo
Código
Acrónimo
Responsable
Título
Horizonte 2020
957102
5G-RECORDS
Sin especificar
5G key technology enableRs for Emerg- ing media COntent pRoDuction Services
Comunidad de Madrid
PID2020-115132RB
SARAOS
Sin especificar
ShAred Reality for Advanced sOcial communicationS

Más información

ID de Registro: 90525
Identificador DC: https://oa.upm.es/90525/
Identificador OAI: oai:oa.upm.es:90525
URL Portal Científico: https://portalcientifico.upm.es/es/ipublic/item/9994394
Identificador DOI: 10.1109/ICCE-Berlin56473.2022.9937087
URL Oficial: https://ieeexplore.ieee.org/document/9937087
Depositado por: Javier Usón Peirón
Depositado el: 04 Sep 2025 05:55
Ultima Modificación: 04 Sep 2025 05:55