The Santali trilingual linguistic dataset provides extensive resources for language research.
It includes a rich structure aimed at facilitating linguistic studies and applications.
Data collection involved systematic approaches to gather diverse language examples and annotations.
This dataset highlights the potential for reuse across various linguistic and educational projects.
Abstract
This project contains the data paper describing the structure, collection, and reuse potential of a Santali trilingual linguistic dataset developed by Language Resource Hub.