Getting AI to write good SQL

1. rectang ◴[17 May 25 00:04 UTC] No.44010899[source]▶

> We will cover state-of-the-art [...] how we approach techniques that allows the system to offer virtually certified correct answers.

I don't need AI to generate perfect SQL, because I am never going to trust the output enough to copy/paste it — the risk of subtle semantic errors is too high, even if the code validates.

Instead, I find it helpful for AI to suggest approaches — after which I will manually craft the SQL, starting from scratch.

replies(4): >>44011204 #>>44011349 #>>44011379 #>>44011432 #

2. hosel ◴[17 May 25 01:00 UTC] No.44011204[source]▶

>>44010899 (TP) #

Really? In my experience it’s been pretty good (using Pydantic)! I read over before I execute it, but it’s never done anything malicious.

replies(3): >>44011259 #>>44011340 #>>44013480 #

3. rectang ◴[17 May 25 01:10 UTC] No.44011259[source]▶

>>44011204 #

I don't trust myself to craft a prompt in natural language which completely specifies my intent as codified with the precision of a programming language.

I also tend to turn to AI for advising me on difficult use cases, and most of the time it's for production code rather than one-offs. The easy cases, I just write myself because it's more mental effort to review code for subtle errors than it is to write it.

4. ◴[17 May 25 01:29 UTC] No.44011340[source]▶

>>44011204 #

5. ◴[17 May 25 01:30 UTC] No.44011349[source]▶

>>44010899 (TP) #

6. hsbauauvhabzb ◴[17 May 25 01:36 UTC] No.44011379[source]▶

>>44010899 (TP) #

Explain that to the average manager or junior engineer, both who don’t care about your desire to build well but not fast.

replies(2): >>44011448 #>>44011772 #

7. paulddraper ◴[17 May 25 01:48 UTC] No.44011432[source]▶

>>44010899 (TP) #

Hopefully your trust in yourself is warranted

replies(2): >>44011729 #>>44012314 #

8. noosphr ◴[17 May 25 01:51 UTC] No.44011448[source]▶

>>44011379 #

> So now that we brought down prod for a day the new rule is no AI sql without three humans signing off on any queries.

replies(1): >>44011475 #

9. Closi ◴[17 May 25 01:57 UTC] No.44011475{3}[source]▶

>>44011448 #

If that’s the scenario, I would be asking why the testing pipeline didn’t catch this rather than why was the AI SQL wrong.

replies(3): >>44011595 #>>44011821 #>>44014227 #

10. noosphr ◴[17 May 25 02:31 UTC] No.44011595{4}[source]▶

>>44011475 #

Because the testing pipeline isn't the real database.

Anyone that knows a database well can bring it down with a innocent looking statement that no one else will blink at.

replies(1): >>44020014 #

11. rectang ◴[17 May 25 03:01 UTC] No.44011729[source]▶

>>44011432 #

I embrace my fallibility, and enthusiastically pursue testing, code reviews, staging environments, and so on to minimize the mistakes that make it through to production.

It seems to me that this skeptical mindset is consonant with handling AI output with care.

12. rectang ◴[17 May 25 03:14 UTC] No.44011772[source]▶

>>44011379 #

It’s not true that I want to build “well but not fast” — I’m trying to add value, and both speed and reliability matter. My productivity is high and I don’t have trouble articulating why; my approach has generally (though not universally) been well received by management and colleagues.

13. fkyimeanit ◴[17 May 25 03:28 UTC] No.44011821{4}[source]▶

>>44011475 #

Because the testing pipeline was generated by AI, and code-reviewed by AI, reading a PR description generated by AI.

14. auggierose ◴[17 May 25 05:53 UTC] No.44012314[source]▶

>>44011432 #

You'd rather trust in AI than yourself?

replies(1): >>44012366 #

15. malthaus ◴[17 May 25 06:14 UTC] No.44012366{3}[source]▶

>>44012314 #

in writing good sql code? i definitely would

ai is not going to replace the senior sql expert with 20 years of battle experience in the short-term but support me who last dug into sql 15 years ago and needs to get a working sql query in a project. and ai usually does a better job than me copy pasting googled code in between quickly browsing through tutorials.

16. yahoozoo ◴[17 May 25 11:16 UTC] No.44013480[source]▶

>>44011204 #

What is the relevance of Pydantic with SQL?

17. dns_snek ◴[17 May 25 13:35 UTC] No.44014227{4}[source]▶

>>44011475 #

To offer a 3rd option - what testing pipeline? Incompetent managers aren't going to approve of developers "wasting their time" on writing high quality tests.

18. Closi ◴[18 May 25 09:06 UTC] No.44020014{5}[source]▶

>>44011595 #

Sure, but everyone knows humans end up bringing down the database too by writing an innocent looking test query nobody else blinks at, which is why you end up needing a testing strategy for ANY SQL before YOLO'ing into prod.