This study aims to examine the linguistic characteristics of Old Acehnese manuscripts through a corpus-based discourse approach and to propose a digital repository model for manuscript preservation. Acehnese manuscripts constitute a significant intellectual heritage reflecting linguistic, cultural, and religious traditions; however, systematic corpus-based linguistic analyses remain limited. This research employed a qualitative descriptive design integrating corpus linguistics and discourse analysis. The data were derived from selected manuscripts written in Jawi script, including religious, historical, and advisory texts. Data collection involved documentation, transcription, normalization, and corpus compilation. The analysis focused on lexical frequency, collocation patterns, stylistic features, and cohesive devices. The findings indicate that the manuscripts are predominantly characterized by religious lexical items, frequent Arabic loanwords, formulaic expressions, and repetitive rhetorical structures that reinforce moral and didactic messages. The discourse is largely persuasive, instructional, and narrative, highlighting the pedagogical role of manuscripts in traditional Acehnese society. In addition, this study proposes a digital repository framework integrating linguistic annotation, structured metadata, and searchable corpus features to enhance preservation and accessibility. This study contributes to manuscript studies by offering a systematic corpus-based linguistic description and a sustainable model for digital preservation.
Maulana et al. (Thu,) studied this question.