Speech is the most common way people talk to each other, but some people have trouble saying or hearing. In this study, a deep learning-based model is proposed that can figure out what words a person is trying to say from the way they move their hands. Deep learning models, like LSTM and GRU (feedback-based learning models), are used to figure out what signs are in Indian sign language (ISL) film clips that are not connected to each other. This study shows how machine learning methods can be used for real-time motion recognition in a wide range of human-computer interfaces. Experiments showed that the system could recognise hand postures with 99.4% accuracy and active gestures with an average accuracy of 93.72%. For datasets with easy backgrounds, the accuracy is almost 99%, for datasets with complicated backgrounds it is 92%, and for the video dataset it is 84%.
Champaneria et al. (Thu,) studied this question.