What question did this study set out to answer?

To develop a real-time American Sign Language recognition system that enhances communication for deaf individuals.

February 14, 2026Open Access

Intelligent Sign Language Interpretation System Using Multi- Modal Deep Learning Architectures

Key Points

To develop a real-time American Sign Language recognition system that enhances communication for deaf individuals.
Employed a webcam for real-time ASL recognition.
Utilized an ensemble of deep learning models including CNN, GNN, and Vision Transformer.
Trained on a dataset of approximately 87,000 labeled images of ASL gestures.
Achieved recognition accuracy over 95%.
Demonstrated an average inference time of 85 milliseconds per gesture.
Outperformed existing ASL recognition methods.

Abstract

This project presents a real-time American Sign Language (ASL) recognition system using a standard webcam. Communication between deaf or hard-of-hearing individuals and the hearing community is often limited by the high cost and limited availability of professional interpreters. To address this, the proposed system employs an ensemble deep-learning approach that combines a Convolutional Neural Network (CNN) for hand shape recognition, a Graph Neural Network (GNN) to capture finger and joint relationships, and a Vision Transformer to focus on key visual regions while minimizing background noise. By fusing these complementary models, the system achieves enhanced recognition accuracy. The framework was trained and evaluated on a dataset of approximately 87,000 labeled images covering the complete ASL alphabet along with additional gestures such as space and delete. Experimental results demonstrate an accuracy exceeding 95%, outperforming existing methods. The system supports real-time interaction with an average inference time of about 85 milliseconds per gesture. It is deployed through a browser-based interface and requires no specialized hardware beyond a standard webcam. This solution provides an accessible, low-cost alternative to traditional interpretation services and promotes inclusive communication across educational, healthcare, and public environments.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Samruddhi Vijay Wakalkar

Sanskruti Vijay Wakalkar

Siddhi Nanasaheb Hon

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Intelligent Sign Language Interpretation System Using Multi- Modal Deep Learning Architectures

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study