Few-shot Text-to-SQL Translation using Structure and Content Prompt Learning

Gu, Zihui; Fan, Ju; Tang, Nan; Cao, Lei; Jia, Bowen; Madden, Sam; Du, Xiaoyong

Published in

Proceedings of the ACM on Management of Data, 2(1), p. 1-28, 2023

DOI: 10.1145/3589292

Tools

Export citation

Search in Google Scholar

Few-shot Text-to-SQL Translation using Structure and Content Prompt Learning

Journal article published in 2023 by Zihui Gu

, Ju Fan

, Nan Tang

, Lei Cao

, Bowen Jia

, Sam Madden

, Xiaoyong Du

This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

A common problem with adopting Text-to-SQL translation in database systems is poor generalization. Specifically, when there is limited training data on new datasets, existing few-shot Text-to-SQL techniques, even with carefully designed textual prompts on pre-trained language models (PLMs), tend to be ineffective. In this paper, we present a divide-and-conquer framework to better support few-shot Text-to-SQL translation, which divides Text-to-SQL translation into two stages (or sub-tasks), such that each sub-task is simpler to be tackled. The first stage, called the structure stage, steers a PLM to generate an SQL structure (including SQL commands such as SELECT, FROM, WHERE and SQL operators such as <", ?>") with placeholders for missing identifiers. The second stage, called the content stage, guides a PLM to populate the placeholders in the generated SQL structure with concrete values (including SQL identifies such as table names, column names, and constant values). We propose a hybrid prompt strategy that combines learnable vectors and fixed vectors (i.e., word embeddings of textual prompts), such that the hybrid prompt can learn contextual information to better guide PLMs for prediction in both stages. In addition, we design keyword constrained decoding to ensure the validity of generated SQL structures, and structure guided decoding to guarantee the model to fill correct content. Extensive experiments, by comparing with ten state-of-the-art Text-to-SQL solutions at the time of writing, show that SC-Prompt significantly outperforms them in the few-shot scenario. In particular, on the widely-adopted Spider dataset, given less than 500 labeled training examples (5% of the official training set), SC-Prompt outperforms the previous SOTA methods by around 5% on accuracy.

Published in

Links

Tools

Few-shot Text-to-SQL Translation using Structure and Content Prompt Learning

Abstract