What question did this study set out to answer?

The aim is to enhance Korean part-of-speech tagging through a novel transformer-based model integrating word embeddings.

April 17, 2026Open Access

Word Embeddings Network and Transformer Based Part of Speech Tagging for Korean

PYPong-Gol YouKorean Academy of Science and Technology CSChun-Sik SoKorean Academy of Science and Technology SCSong-Min ChoeKorean Academy of Science and Technology

Key Points

The aim is to enhance Korean part-of-speech tagging through a novel transformer-based model integrating word embeddings.
Developed a transformer-based POS tagging model.
Designed a sequence of pairs of morphemes and their POS tags for output.
Trained a convolution and highway network for character-level word representations.
Integrated word representation with transformer encoder output for better semantic information.
Achieved a POS tagging accuracy of 96.1% on Korean text.
Improved performance by 1.4% compared to models without word representation.
Outperformed all compared models, including the BiLSTM+CRF model.

Abstract

Korean Part-of-Speech (POS) tagging is different from and more difficult than other languages such as English, Russian and Chinese due to raising issues of Korean word segmentation and analysis of sound-changed morphemes. In this paper we propose a transformer-based Korean POS tagging model, which combines the output vector of a encoder of the transformer with a representational vector of the input word obtained from character-level word embeddings network unlike existing deep learning-based POS tagging models based on BiLSTM. First, in order to perform segmentation of words and changed sound analysis at once, we have designed a model to make a new output sequence of the POS tagging model as a sequence of pairs of strings of morphemes and its POS tags. Second, in order to obtain character-level word representations, word embedding network employing convolution network and highway network are trained. Finally, to make more efficient use of the semantic information of the input word in generating of sequences of POS tagging, we combined the word representation vector obtained from the word-embedding generation network with the output of a encoder of the transformer. According to the experimental results, the proposed model achieves 1.4% performance improvement over the model without incorporating the word representation vector obtained from the word embeddings network, and as a result, the POS tagging accuracy is 96.1%, which is superior to all other compared models including the BiLSTM+CRF model.

Ask AI

Helpful

Bookmark

View Full Paper