Key points are not available for this paper at this time.
This article is devoted to the creation of a speech recognition system in the Uzbek language based on the analysis of existing approaches to speech recognition and the selection of the preferred approach, which provides information on the End2End approach and its effective use. In addition, several manifestations of transformer neural networks have been analyzed. Based on the results of the analysis, high quality and efficiency were used in speech recognition in Uzbek. The results obtained showed that transformer-based models were superior to other widely used models in terms of training time and accuracy.
Mamatov et al. (Wed,) studied this question.