What question did this study set out to answer?

This study aims to develop a web-based tool for authorship analysis that balances accessibility and methodological rigor.

June 12, 2026Open Access

Bridging Accessibility and Rigour: A Web App for Authorship Analysis Incorporating KWIC Exploration and Shared N-Gram Profiling

Key Points

This study aims to develop a web-based tool for authorship analysis that balances accessibility and methodological rigor.
Designed a web app integrating KWIC exploration and n-gram analysis for forensic linguistics.
Enabled automated identification of shared bigrams and trigrams in textual datasets.
Addressed challenges in forensic semiotics through data-driven analysis and clear documentation.
The tool effectively visualizes key textual patterns, supporting both confirmatory and exploratory linguistic analyses.
Demonstrated significant improvements in accessibility for non-technical users in forensic settings.
Provided clear documentation for courtroom admissibility, enhancing the evidentiary validity of analyses.

Abstract

Abstract The increasing volume of digital linguistic evidence and the expansion of virtual communicative spaces demand forensic linguistic tools that are both methodologically rigorous and accessible to a wider range of users, including legal practitioners. This paper introduces a bespoke, web-based authorship analysis tool designed to identify and visualize key textual patterns across questioned and known datasets. Operating through an intuitive graphical user interface, the tool offers an accessible alternative to command-line methods, expanding the reach of computational forensic linguistics to non-technical users. The tool integrates two core functionalities: (1) automated identification and comparison of shared n-grams, particularly bigrams and trigrams, within and between datasets, and (2) Key Word in Context (KWIC) exploration to support the interpretation of overlapping or distinctive lexical patterns. By combining these capabilities, the system enables both confirmatory and exploratory analysis of linguistic evidence, supporting investigations where authorship is contested or where stylistic consistency must be assessed. It has applications across the three domains of forensic linguistics: language as evidence, interaction in legal settings, and language of the law. In addition to its methodological contributions, the paper addresses key challenges in forensic semiotics: subjectivity, reproducibility, and transparency. It shows how computational approaches can mitigate the “subjectivity effect” by offering replicable, data-driven analyses of linguistic behaviour. Courtroom admissibility is also supported by producing clear, step-by-step analytical records and exportable logs that document each analytic operation. Our contribution aligns with the aims of computational forensic linguistics to enhance the interpretive rigour and evidentiary validity of linguistic analyses.

Bookmark

View Full Paper

Bookmark

View Full Paper

Bridging Accessibility and Rigour: A Web App for Authorship Analysis Incorporating KWIC Exploration and Shared N-Gram Profiling

Key Points

Abstract

Cite This Study