Isolated German sign language recognition for classifying polar answers using landmarks and lightweight transformers

Luna Jiménez, Cristina ORCID: https://orcid.org/0000-0001-5369-856X, Eing, Lenart ORCID: https://orcid.org/0009-0005-8855-7659, Esteban Romero, Sergio ORCID: https://orcid.org/0009-0008-6336-7877, Gil Martin, Manuel ORCID: https://orcid.org/0000-0002-4285-6224 and Andre, Elizabeth ORCID: https://orcid.org/0000-0002-2367-162X (2025). Isolated German sign language recognition for classifying polar answers using landmarks and lightweight transformers. "Applied Sciences", v. 15 (n. 21); p. 11571. ISSN 1454-5101. https://doi.org/10.3390/app152111571.

Descripción

Título: Isolated German sign language recognition for classifying polar answers using landmarks and lightweight transformers
Autor/es:
Tipo de Documento: Artículo
Título de Revista/Publicación: Applied Sciences
Fecha: 29 Octubre 2025
ISSN: 1454-5101
Volumen: 15
Número: 21
Materias:
ODS:
Palabras Clave Informales: Isolated sign language recognition; transformers; machine learning; human-computer interaction; accessibility
Escuela: E.T.S.I. Telecomunicación (UPM)
Departamento: Ingeniería Electrónica
Licencias Creative Commons: Reconocimiento

Texto completo

[thumbnail of 10400388.pdf] PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (4MB)

Resumen

Sign Languages are the primary communication modality of deaf communities, yet building effective Isolated Sign Language Recognition (ISLR) systems remains difficult under data limitations. In this work, we curated a sub-dataset from the DGS-Korpus focused on recognizing affirmations and negations (polar answers) in German Sign Language (DGS). We designed lightweight transformer models using landmark-based inputs and evaluated them on two tasks: the binary classification of affirmations versus negations (binary semantic recognition) and the multi-class recognition of sign variations expressing positive or negative replies (multi-class gloss recognition). The main contribution of the article, hence, relies on the exploration of models for performing polar answer recognition in DGS and the exploration of differences between performing multi-class or binary class classification. Our best binary model achieved an accuracy of 97.71% using only hand landmarks without Positional Encoding, highlighting the potential of lightweight landmark-based transformers for efficient ISLR in constrained domains.

Proyectos asociados

Tipo
Código
Acrónimo
Responsable
Título
Horizonte Europa
101071191
ASTOUND
Sin especificar
Improving social competences of virtual agents through artificial consciousness based on the Attention Schema Theory
Gobierno de España
PID2021-126061OB-C43
BeWord
Sin especificar
Descubriendo el significado y la intención más allá de la palabra hablada: hacia un entorno inteligente para abordar los documentos multimedia
Gobierno de España
PID2020-118112RB-C22
GOMINOLA
Sin especificar
Agentes conversacionales sensibles a usuario, adaptativos y socio-afectivos basados en microservicios

Más información

ID de Registro: 91700
Identificador DC: https://oa.upm.es/91700/
Identificador OAI: oai:oa.upm.es:91700
URL Portal Científico: https://portalcientifico.upm.es/es/ipublic/item/10400388
Identificador DOI: 10.3390/app152111571
URL Oficial: https://www.mdpi.com/2076-3417/15/21/11571
Depositado por: iMarina Portal Científico
Depositado el: 31 Oct 2025 12:35
Ultima Modificación: 21 Ene 2026 13:02