March 3, 2026Open Access

Santali Trilingual Annotated Corpus: Data Paper

Key Points

The Santali trilingual linguistic dataset provides extensive resources for language research.
It includes a rich structure aimed at facilitating linguistic studies and applications.
Data collection involved systematic approaches to gather diverse language examples and annotations.
This dataset highlights the potential for reuse across various linguistic and educational projects.

Abstract

This project contains the data paper describing the structure, collection, and reuse potential of a Santali trilingual linguistic dataset developed by Language Resource Hub.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Samar Michael Soren (Thu,) studied this question.

synapsesocial.com/papers/69a75cd5c6e9836116a2604a https://doi.org/https://doi.org/10.17605/osf.io/7c5kf

Bookmark

View Full Paper