Generative AI coding tools and agents do not work for me

1. mellosouls ◴[17 Jun 25 14:13 UTC] No.44299503[source]▶

The author makes the excellent point that LLM-coding still has a human bottleneck at the code review point - regardless of whether the issue at hand is fixed or not.

Leaving aside the fact that this isn't an LLM problem; we've always had tech debt due to cowboy devs and weak management or "commercial imperatives":

I'd be interested to know if any of the existing LLM ELO style leaderboards mark for code quality in addition to issue fixing?

The former seems a particularly useful benchmark as they become more powerful in surface abilities.

replies(1): >>44300039 #

2. NoGravitas ◴[17 Jun 25 14:56 UTC] No.44300039[source]▶

>>44299503 (TP) #

> Leaving aside the fact that this isn't an LLM problem; we've always had tech debt due to cowboy devs and weak management or "commercial imperatives":

But this is one of the core problems with LLM coding, right? It accelerates an already broken model of software development (worse is better) rather than trying to help fix it.

replies(1): >>44302155 #

3. mellosouls ◴[17 Jun 25 18:22 UTC] No.44302155[source]▶

>>44300039 #

Possibly so - which is why I think research towards quality rather than just test-passing would be a significant benefit.