What type of study is this?

This is a Quantitative Study study.

September 16, 2025

Methods and Algorithms for Dependency Parse of Texts of the National Uzbekistan Corps

Key Points

The biaffine graph-based parser achieved an F1 score of 91.8%, outperforming existing methods.
A dataset of 20,000 sentences was developed, leading to a 3% increase in detecting parallel semantic roles.
Comparative analysis included transition-based, graph-based, and transformer-based parser models.
Improvements in compound sentence accuracy by 27% highlight the need for modern neural parsing techniques.

Abstract

This article investigates methods for automatic syntactic analysis of texts in the Uzbek national corpus based on dependency parsing. First, the agglutinative properties of the Uzbek language and the influence of free word order on dependency trees are considered theoretically. Then, various parsers are compared - from traditional transition-based models to graph-based and transformer architectures. For experiments, a tagged dataset of 20, 000 sentences was created from the Universal Dependencies (UDUzbek) tag system and the updated national corpus. The biaffine graph-based parser based on the UzbBERT and XLM-R models showed a result of F1=91. 8%, outperforming existing approaches (F1≈88%) and the UDPipe base model (F1≈85%). In particular, the developed model reduced errors in compound sentences by 27%. The evaluation set included more than 5000 sentences, and the average reduction in dependency length was 0. 42 tokens, which led to a 3% F1 increase in the detection of parallel semantic roles. These results confirm the effectiveness of the transition to modern neural parsers that are adapted to the morphological complexity and free syntax of the Uzbek language.

Bookmark

Methods and Algorithms for Dependency Parse of Texts of the National Uzbekistan Corps

Key Points

Abstract

Cite This Study

Also Consider

Also Consider