Enhancing ASL Recognition with GCNs and Successive Residual Connections
Abstract
This study presents a novel approach for enhancing American Sign Language (ASL) recognition using Graph Convolutional Networks (GCNs) integrated with successive residual connections. The method leverages the MediaPipe framework to extract key landmarks from each hand gesture, which are then used to construct graph representations. A robust preprocessing pipeline, including translational and scale normalization techniques, ensures consistency across the dataset. The constructed graphs are fed into a GCN-based neural architecture with residual connections to improve network stability. The architecture achieves state-of-the-art results, demonstrating superior generalization capabilities with a validation accuracy of 99.14%.
- Publication:
-
arXiv e-prints
- Pub Date:
- August 2024
- DOI:
- arXiv:
- arXiv:2408.09567
- Bibcode:
- 2024arXiv240809567S
- Keywords:
-
- Computer Science - Computer Vision and Pattern Recognition
- E-Print:
- To be submitted in G2-SP CV 2024. Contains 7 pages, 5 figures