Getting AI to write good SQL

(cloud.google.com)

501 points richards | 2 comments | 16 May 25 21:10 UTC | HN request time: 0.56s | source

Show context

nashashmi ◴[16 May 25 22:31 UTC] No.44010349[source]▶

>>44009848 (OP) #

AI text to regex solutions would be incredibly handy.

replies(4): >>44010706 #>>44011846 #>>44012981 #>>44015104 #

RadiozRadioz ◴[16 May 25 23:28 UTC] No.44010706[source]▶

>>44010349 #

This comment appears frequently and always surprises me. Do people just... not know regex? It seems so foreign to me.

It's not like it's some obscure thing, it's absolutely ubiquitous.

Relatively speaking it's not very complicated, it's widely documented, has vast learning resources, and has some of the best ROI of any DSL. It's funny to joke that it looks like line noise, but really, there is not a lot to learn to understand 90% of the expressions people actually write.

It takes far longer to tell an AI what you want than to write a regex yourself.

replies(12): >>44010769 #>>44010791 #>>44010803 #>>44010854 #>>44010941 #>>44011236 #>>44011532 #>>44011584 #>>44011591 #>>44012097 #>>44012483 #>>44013224 #

eddd-ddde ◴[16 May 25 23:55 UTC] No.44010854[source]▶

>>44010706 #

I know regex. But I use it so sparingly that every time I need it I forgot again the character for word boundary, or the character for whitespace, or the exact incantation for negative lookahead. Is it >!? who knows.

A shortcut to type in natural language and get something I can validate in seconds is really useful.

replies(1): >>44010931 #

layer8 ◴[17 May 25 00:10 UTC] No.44010931[source]▶

>>44010854 #

How do you validate it if you don’t know the syntax? Or are you saying that looking up syntax –> semantics is significantly quicker than semantics –> syntax? Which I don’t find to be the case. What takes time is grokking the semantics in context, which you have to do in both cases.

replies(2): >>44010966 #>>44014538 #

tough ◴[17 May 25 00:16 UTC] No.44010966[source]▶

>>44010931 #

https://regex101.com/

replies(2): >>44011044 #>>44011120 #

layer8 ◴[17 May 25 00:43 UTC] No.44011120[source]▶

>>44010966 #

That doesn’t answer the question. By “validate”, I mean “prove to yourself that the regular expression is correct”. Much like with program code, you can’t do that by only testing it. You need to understand what the expression actually says.

replies(1): >>44013194 #

widdershins ◴[17 May 25 09:41 UTC] No.44013194[source]▶

>>44011120 #

Testing something is the best way to prove that it behaves correctly in all the cases you can think of. Relying on your own (fallible) understanding is dangerous.

Of course, there may be cases you didn't think of where it behaves incorrectly. But if that's true, you're just as likely to forget those cases when studying the expression to see "what it actually says". If you have tests, fixing a broken case (once you discover it) is easy to do without breaking the existing cases you care about.

So for me, getting an AI to write a regex, and writing some tests for it (possibly with AI help) is a reasonable way to work.

replies(1): >>44014320 #

1. layer8 ◴[17 May 25 13:48 UTC] No.44014320[source]▶

>>44013194 #

I don’t believe this is true. That’s why we do mathematical proofs, instead of only testing all the cases one can think of. It’s important to sanity-check one’s understanding with tests, but mere black-box testing is no substitute for the understanding.

replies(1): >>44015072 #

2. tough ◴[17 May 25 15:40 UTC] No.44015072[source]▶

>>44014320 (TP) #

Code is not perfect like math imho

libraries some times make weird choices

in theory theory and practice are the same, in practice not really

in the context of regex, you have to know which dialect and programming language version of regex you’re targeting for example. its not really universal how all libs/languages works

thus the need to test

↑