Skip to main navigation Skip to search Skip to main content

LSC-54: A Landmark-Based Dataset for Colombian Sign Language

  • Juan Esteban Mora Zarate (masterstudent)
  • , Claudia Lorena Garzon Castro (Correspondent Author)
  • , Jorge Alberto Castellanos Rivillas (Third Author)
  • Universidad de la Sabana

Research output: Contribution to journalArticlepeer-review

Abstract

According to the World Health Organization, hearing disabilities are increasing globally, highlighting the need to support the deaf community. In countries such as Colombia, access to sign language education remains limited. Recent advances in Computer Vision have proven effective in promoting sign language learning through real-time interaction. In response to this need, a time-series dataset was developed to represent signs from Colombian Sign Language (CSL), including 11 colors, 33 courtesy phrases, and the numbers 1–10. This dataset served as input for the development of a technological tool aimed at supporting the teaching of CSL vocabulary. It includes both static and dynamic signs, represented as 3D coordinate data of the face, torso and hands. The dataset was built using recordings from 22 participants, each performing sign between three to five times. To increase data diversity and improve model performance, the dataset was augmented by a factor of 45. The data acquisition process involved: A user interface for recording sign language videos; post-processing (extracting coordinates via MediaPipe, and imputing missing hand data), and storing processed data in JSON files.
Original languageSpanish (Colombia)
Pages (from-to)1-18
Number of pages18
JournalData in Brief
Volume63
Issue number112145
DOIs
StatePublished - 8 Oct 2025

Strategic Focuses

  • Sociedad Digital y Competitividad​ (SocietalIA)

Article Classification

  • Full research article

Indexación Internacional (Artículo)

  • ISI Y SCOPUS

Scopus-Q Quartil

  • Q2

ISI- Q Quartil

  • Q3

Categoría Publindex

  • B

Cite this