Natural language when passing in prompt getting converted to SQL

Srishti_Nagu · April 28, 2023, 1:25pm

I am converting natural language to SQL for my database using text to SQL chain. Further I am also finding out a summary of the data (retrieved from the first chain) using fact extraction chain. I have a few issues in the output summary:

The summary doesn’t come out to be decent for all queries.
For example, data- [{fruits: apples, sale:200}, {fruits:mango,sale:350}, {fruits:banana, sale:150}]. Summary- Apples have the highest sales followed by mangoes and then banana- which is incorrect.
My prompt consists of input data and query (in natural language). While passing the prompt for getting summary- the query in natural language is getting converted into SQL and then passed into prompt- which is puzzling me!
For example-

Prompt template-
template=“”"
Your task is to write a summary based on the
information provided in the data delimited by triple backticks following the
steps below-

Analyse the input data.
Extract key facts out of the input data.
Do not add names and figures that are not present in the data.
Do not write numbers in scientific notation or exponents or any other special symbols.
Use at most 25 words.
Do not add any prefix to the output. For example- do not write the output as Summary: or Answer:

Data: {text_input}
“”")

Please suggest edits to the prompt to improve factual accuracy and how to eliminate the second problem. Thank you!

DEEPANSHU_MEHTA · August 16, 2023, 5:32pm

Hey Srishti,

I see that you are trying to design a prompt for the text to SQL generation task. Have you also considered fine tuning the model. For e.g. you can use dataset provided in this paper to fine tune the model - https://arxiv.org/pdf/2305.03111.pdf

GitHub link for the same - https://bird-bench.github.io/

Topic		Replies	Views
Natural language to Sql conversion AI Discussions	0	56	August 24, 2023
Text2sql model for medical sql generation AI Discussions data-centric , project	0	292	March 22, 2024
Question about Ungraded Lab - Improving SQL Generation Agentic AI week-module-2 , ai-discussions , dl-ai-learning-platform	2	22	April 11, 2026
GenAI conversational application with multi tool interaction AI Discussions ai-discussions , langchain , llama-index	3	143	February 19, 2024
Creating domain specific LLM for creating a virtual data scientist(takes inputs in natural language as a query, uses data which is structured and gives insights as answers) Generative AI with Large Language Models week-module-1	3	500	July 21, 2023

Natural language when passing in prompt getting converted to SQL

Related topics