I trained a local 7B model to be better than GPT-4 at writing SQL
Fine-tuned Mistral 7B on 100k SQL query pairs with DPO. It now beats GPT-4 on Spider benchmark by 12%. Runs on a single RTX 4090. Model weights and training code included.
Fine-tuningSQL