Abstract Natural language text-to-SQL generation (Text2SQL) aims to translate natural language questions into executable SQL queries. Although the emergence of large language models (LLMs) has led to significant advancements in this field, their performance degrades sharply with question complexity increases. A key limitation of current LLM-based methods lies in their uniform generation strategies, which fail to adapt dynamically to varying question complexity. To address this issue, we propose TriSQL, a novel three-stage framework designed to analyze question complexity and generate accurate and executable SQL. First, a Question-Guided Schema Selector is conceived to get the most relevant schema to the question using cross attention. Second, a Structure-Aware SQL Generator takes both the question and the selected schema as input, employing hierarchical decoding to generate a syntactically valid initial SQL. Finally, a Complexity-Aware SQL Refiner is designed with LLM to dynamically adjust strategies corresponding to the complexity of question and initial SQL, ensuring that the final generated SQL is both accurate and executable. Experimental results on the Spider benchmark and its variants show that TriSQL achieves state-of-the-art execution accuracy, surpasses existing LLM-based methods, and provides both high efficiency and strong robustness.
Building similarity graph...
Analyzing shared references across papers
Loading...
Xiaodong Su
Yang Gu
Peng Wang
Scientific Reports
Shanghai Electric (China)
China Electric Equipment Group (China)
Building similarity graph...
Analyzing shared references across papers
Loading...
Su et al. (Mon,) studied this question.
www.synapsesocial.com/papers/698c1d1d267fb587c655faa1 — DOI: https://doi.org/10.1038/s41598-026-39128-9