LSA-T: The First Continuous Argentinian Sign Language Dataset for Sign Language Translation

LSA-T: The First Continuous Argentinian Sign Language Dataset for Sign Language Translation


Conference paper
Computer VisionDatasetDeep LearningLSA-TSign Language Translation
Authors: Pedro Dal Bianco, Gastón Ríos, Franco Ronchetti, Facundo Quiroga, Oscar Stanchi, Waldo Hasperué, Alejandro Rosete

Abstract

Sign language translation (SLT) is an active field of study that encompasses human-computer interaction, computer vision, natural language processing and machine learning. Progress on this field could lead to higher levels of integration of deaf people. This paper presents, to the best of our knowledge, the first continuous Argentinian Sign Language (LSA) dataset. It contains 14,880 sentence level videos of LSA extracted from the CN Sordos YouTube channel with labels and keypoints annotations for each signer. We also present a method for inferring the active signer, a detailed analysis of the characteristics of the dataset, a visualization tool to explore the dataset and a neural SLT model to serve as baseline for future experiments.

Citation

@inproceedings{dal2022lsa,
    title={Lsa-t: The first continuous argentinian sign language dataset for sign language translation},
    author={Dal Bianco, Pedro and R{\'\i}os, Gast{\'o}n and Ronchetti, Franco and Quiroga, Facundo and Stanchi, Oscar and Hasperu{\'e}, Waldo and Rosete, Alejandro},
    booktitle={Ibero-American Conference on Artificial Intelligence},
    pages={293--304},
    year={2022},
    organization={Springer}
}
© 2024 Oscar Stanchi