TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Text-To-SQL	BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation)	MAC-SQL + GPT-4	Execution Accuracy % (Test)	59.59	# 5
Text-To-SQL	BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation)	MAC-SQL + GPT-4	Execution Accuracy % (Dev)	57.56	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mac-sql-multi-agent-collaboration-for-text-to/text-to-sql-on-bird-big-bench-for-large-scale)](https://paperswithcode.com/sota/text-to-sql-on-bird-big-bench-for-large-scale?p=mac-sql-multi-agent-collaboration-for-text-to)`

MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL

18 Dec 2023 · Bing Wang, Changyu Ren, Jian Yang, Xinnian Liang, Jiaqi Bai, Linzheng Chai, Zhao Yan, Qian-Wen Zhang, Di Yin, Xing Sun, Zhoujun Li ·

Recent LLM-based Text-to-SQL methods usually suffer from significant performance degradation on ``huge" databases and complex user questions that require multi-step reasoning. Moreover, most existing methods neglect the crucial significance of LLMs utilizing external tools and model collaboration. To address these challenges, we introduce MAC-SQL, a novel LLM-based multi-agent collaborative framework. Our framework comprises a core decomposer agent for Text-to-SQL generation with few-shot chain-of-thought reasoning, accompanied by two auxiliary agents that utilize external tools or models to acquire smaller sub-databases and refine erroneous SQL queries. The decomposer agent collaborates with auxiliary agents, which are activated as needed and can be expanded to accommodate new features or tools for effective Text-to-SQL parsing. In our framework, We initially leverage GPT-4 as the strong backbone LLM for all agent tasks to determine the upper bound of our framework. We then fine-tune an open-sourced instruction-followed model, SQL-Llama, by leveraging Code Llama 7B, to accomplish all tasks as GPT-4 does. Experiments show that SQL-Llama achieves a comparable execution accuracy of 43.94, compared to the baseline accuracy of 46.35 for vanilla GPT-4. At the time of writing, MAC-SQL+GPT-4 achieves an execution accuracy of 59.59 when evaluated on the BIRD benchmark, establishing a new state-of-the-art (SOTA) on its holdout test set (https://github.com/wbbeyourself/MAC-SQL).

PDF Abstract

Code

Add Remove Mark official

wbbeyourself/mac-sql official

Tasks

Add Remove

SQL Parsing

Text-To-SQL

Datasets

BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation)

Results from the Paper

Add Remove

Ranked #5 on Text-To-SQL on BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text-To-SQL	BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation)	MAC-SQL + GPT-4	Execution Accuracy % (Test)	59.59	# 5	Compare
Text-To-SQL		MAC-SQL + GPT-4	Execution Accuracy % (Dev)	57.56	# 4	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • GPT-4 • Label Smoothing • Layer Normalization • Linear Layer • LLaMA • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove