The highest quality codebase

(gricha.dev)

559 points Gricha | 1 comments | 08 Dec 25 21:33 UTC | HN request time: 0s | source

Show context

f311a ◴[11 Dec 25 15:53 UTC] No.46232910[source]▶

>>46197930 (OP) #

I like to ask LLMs to find problems o improvements in 1-2 files. They are pretty good at finding bugs, but for general code improvements, 50-60% edits are trash. They add completely unnecessary stuff. If you ask them to improve a pretty well-written code, they rarely say it's good enough already.

For example, in a functional-style codebase, they will try to rewrite everything to a class. I have to adjust the prompt to list things that I'm not interested in. And some inexperienced people are trying to write better code by learning from such changes of LLMs...

replies(2): >>46233237 #>>46233940 #

pawelduda ◴[11 Dec 25 16:14 UTC] No.46233237[source]▶

>>46232910 #

If you just ask it to find problems, it will do its best to find them - like running a while loop with no return condition. That's why I put some breaker in the prompt, which in this case would be "don't make any improvements if the positive impact is marginal". I've mostly seen it do nothing and just summarize why, followed by some suggestions in case I still want to force the issue

replies(1): >>46233341 #

1. f311a ◴[11 Dec 25 16:22 UTC] No.46233341[source]▶

>>46233237 #

I guess "marginal impact" for them is a pretty random metric, which will be different on each run. Will try it next time.

Another problem is that they try to add handling of different cases that are never present in my data. I have to mention that there is no need to update handling to be more generalized. For example, my code handles PNG files, and they add JPG handling that never happens.

↑