Texto completo
|
PDF (Portable Document Format)
- Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (16MB) |
ORCID: https://orcid.org/0009-0003-8110-1443, Cabrera Quesada, Julian
ORCID: https://orcid.org/0000-0002-7154-2451, Corregidor Luna, Daniel
ORCID: https://orcid.org/0000-0003-0994-377X and García Santos, Narciso
ORCID: https://orcid.org/0000-0002-0397-894X
(2022).
Analysing Foreground Segmentation in Deep Learning Based Depth Estimation on Free-Viewpoint Video Systems.
En: "2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)", 02-06 September 2022. pp. 1-5.
https://doi.org/10.1109/ICCE-Berlin56473.2022.9937087.
| Título: | Analysing Foreground Segmentation in Deep Learning Based Depth Estimation on Free-Viewpoint Video Systems |
|---|---|
| Autor/es: |
|
| Tipo de Documento: | Ponencia en Congreso o Jornada (Artículo) |
| Título del Evento: | 2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin) |
| Fechas del Evento: | 02-06 September 2022 |
| Título del Libro: | 2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin) |
| Fecha: | 11 Noviembre 2022 |
| Materias: | |
| ODS: | |
| Palabras Clave Informales: | Deep learning Training Image segmentation Estimation Color Media Consumer electronics Foreground segmentation deep learning stereo dataset depth multiview video free viewpoint video |
| Escuela: | E.T.S.I. Telecomunicación (UPM) |
| Departamento: | Señales, Sistemas y Radiocomunicaciones |
| Licencias Creative Commons: | Reconocimiento - Sin obra derivada - No comercial |
|
PDF (Portable Document Format)
- Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (16MB) |
Volumetric video acquisition systems enable realistic virtual experiences such as Free-Viewpoint Video (FVV). Stereo matching is a well known way of obtaining this volumetric information as depth images, calculating the disparity between two stereo color images. On these applications, the background of the scene captured is static and does not change, so foreground information is much more valuable. We propose adding foreground segmentation to help learning based algorithms, such as deep learning models, improve results previously obtained. We utilized the framework Detectron2 to model foreground segmentation by detecting people. Additionally, we built a large stereo dataset focused on FVV systems. Finally, we modified a successful deep learning model from the state-of-the-art, CREStereo, to add foreground segmentation and performed supervised training on it to estimate disparity, obtaining promising results.
| ID de Registro: | 90525 |
|---|---|
| Identificador DC: | https://oa.upm.es/90525/ |
| Identificador OAI: | oai:oa.upm.es:90525 |
| URL Portal Científico: | https://portalcientifico.upm.es/es/ipublic/item/9994394 |
| Identificador DOI: | 10.1109/ICCE-Berlin56473.2022.9937087 |
| URL Oficial: | https://ieeexplore.ieee.org/document/9937087 |
| Depositado por: | Javier Usón Peirón |
| Depositado el: | 04 Sep 2025 05:55 |
| Ultima Modificación: | 04 Sep 2025 05:55 |
Publicar en el Archivo Digital desde el Portal Científico