What question did this study set out to answer?

This research aims to explore ways to reduce hallucinations in small language models deployed on edge devices, particularly in tax law environments.

May 25, 2026Open Access

Reducing Hallucinations in Edge-Deployed Small Language Models through Epistemic Scaffolding and INT4 Quantization in Tax-Law Environments

Key Points

This research aims to explore ways to reduce hallucinations in small language models deployed on edge devices, particularly in tax law environments.
Utilized epistemic scaffolding to organize legal knowledge into a structured ontology.
Fine-tuned the Qwen2.5-1.5B model via LoRA and implemented INT4 quantization for efficient inference.
Investigated the impact of structural pruning on model performance.
INT4 quantization improved groundedness by 38.8% compared to FP16 quantization.
Structural pruning led to significant performance degradation, indicating its negative impact on model reliability.
Epistemic knowledge organization was identified as crucial for enhancing trustworthiness in AI applications related to legal domains.

Abstract

EN This research essay investigates the reduction of hallucinations in compact Small Language Models (SLMs) deployed entirely on edge devices in tax-law environments. It introduces Epistemic Scaffolding, an ontology-guided knowledge architecture organizing legal knowledge across three layers: formal ontology, operational heuristics, and user-centered phenomenology. The Qwen2.5-1.5B model was fine-tuned via LoRA and quantized to INT4 using MLC-LLM pipelines, enabling browser-based inference via WebGPU without cloud data transmission. A key finding was that INT4 quantization acted as an implicit semantic regularization mechanism, improving groundedness (+38.8%) over FP16, while structural pruning caused catastrophic degradation. The work suggests that epistemological knowledge organization during training may be as consequential as parameter scale for trustworthy edge AI in legally sensitive domains.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Barros et al. (Sat,) studied this question.

synapsesocial.com/papers/6a13e8680e02ee3982d3336e https://doi.org/https://doi.org/10.5281/zenodo.20356916

Bookmark

View Full Paper