Key points are not available for this paper at this time.
Despite the success of large language models (LLMs) in Text-to-SQL tasks, open-source LLMs encounter challenges in contextual understanding and response coherence. To tackle these issues, we present, a systematic methodology tailored for Text-to-SQL with open-source LLMs. Our contributions include a comprehensive evaluation of open-source LLMs in Text-to-SQL tasks, the strategy for effective question representation, and novel strategies for supervised fine-tuning. We explore the benefits of Chain-of-Thought in step-by-step inference and propose the method for enhanced few-shot learning. Additionally, we introduce token-efficient techniques, such as Variable-length Open DB Schema, Target Column Truncation, and Example Column Truncation, addressing challenges in large-scale databases. Our findings emphasize the need for further investigation into the impact of supervised fine-tuning on contextual learning capabilities. Remarkably, our method significantly improved Llama2-7B from 2. 54\% to 41. 04\% and Code Llama-7B from 14. 54\% to 48. 24\% on the BIRD-Dev dataset. Notably, the performance of Code Llama-7B surpassed GPT-4 (46. 35\%) on the BIRD-Dev dataset.
Building similarity graph...
Analyzing shared references across papers
Loading...
Chen et al. (Sat,) studied this question.
www.synapsesocial.com/papers/68e6b925b6db643587639f64 — DOI: https://doi.org/10.48550/arxiv.2405.06674
Xiaojun Chen
Tianle Wang
Tianhao Qiu
Building similarity graph...
Analyzing shared references across papers
Loading...
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: