UBC Undergraduate Research

Descript-SQL : Enriched Database Descriptions in Text-to-SQL Stenback, Soren

Abstract

Text-to-SQL research has developed over several years to make use of Large Language Models (LLMs). One avenue of research is to create models that prompt an existing LLM such as ChatGPT and provide specific context that will assist the LLM in generating correct SQL queries based on a provided database schema and a text question. However, there are certain functions or techniques that do not assist the LLM and reduce the accuracy of the model. Researchers must determine which techniques are beneficial and which are detrimental to improving an LLM for text-to-SQL. The focus of this thesis is to assess an existing model called E-SQL and attempt to apply improvements through generative database descriptions to increase the accuracy of the model. Additional discoveries through testing are presented.

Item Citations and Data

Rights

Attribution-NonCommercial-NoDerivatives 4.0 International