What type of study is this?

This is a Quantitative Study study.

October 18, 2025Open Access

Enhancing text-to-SQL capabilities of small language models via schema context enrichment and self-correction

Key Points

An improved pipeline raises execution accuracy to 86.2%, exceeding previous performance benchmarks.
Phase 1 involves schema enrichment using Louvain community detection, enhancing model capabilities significantly.
Phase 2 implements self-correction through an iterative feedback loop, refining generated SQL queries effectively.
The approach highlights practical applications of resource-efficient open-source small language models in real-world SQL tasks.

Abstract

Translating natural language into SQL is essential for intuitive database access, yet open-source small language models (SLMs) still lag behind larger systems when faced with complex schemas and tight context windows. This paper introduces a two-phase workflow designed to enhance the Text-to-SQL capabilities of SLMs. Phase 1 (offline) transforms the database schema into a graph, partitions it with Louvain community detection, and enriches each component in a cluster with metadata, relationships, and sample rows. Phase 2 (at runtime) selects the relevant tables, generates SQL queries, and iteratively refines the SQL through an execution-driven feedback loop until the query executes successfully. Evaluated on the Spider test set, our pipeline raises Qwen-2.5-Coder-14B to 86.2% Execution Accuracy (EX), surpassing its zero-shot baseline and outperforming all contemporary SLM + ICL approaches and narrowing the gap to GPT-4-based systems all while running on consumer-grade hardware. Ablation studies confirm that both schema enrichment and self-correction contribute significantly to the improvement. The study concludes that this workflow provides a practical methodology for deploying resource-efficient open-source SLMs in Text-to-SQL applications, effectively mitigating common challenges. An open-source implementation is released to support further research.

Read Full Paperexternally

AIに質問

Bookmark

View Full Paper

Cite This Study

Kiet et al. (Thu,) studied this question.

synapsesocial.com/papers/68f3eb011cfc5ad53f290961 https://doi.org/https://doi.org/10.22144/ctujoisd.2025.058

AIに質問

Bookmark

View Full Paper