AI makes tech debt more expensive

(www.gauge.sh)

Show context

perrygeo ◴[14 Nov 24 16:50 UTC] No.42138092[source]▶

> Companies with relatively young, high-quality codebases benefit the most from generative AI tools, while companies with gnarly, legacy codebases will struggle to adopt them. In other words, the penalty for having a ‘high-debt’ codebase is now larger than ever.

This mirrors my experience using LLMs on personal projects. They can provide good advice only to the extent that your project stays within the bounds of well-known patterns. As soon as your codebase gets a little bit "weird" (ie trying to do anything novel and interesting), the model chokes, starts hallucinating, and makes your job considerably harder.

Put another way, LLMs make the easy stuff easier, but royally screws up the hard stuff. The gap does appear to be widening, not shrinking. They work best where we need them the least.

replies(24): >>42138267 #>>42138350 #>>42138403 #>>42138537 #>>42138558 #>>42138582 #>>42138674 #>>42138683 #>>42138690 #>>42138884 #>>42139109 #>>42139189 #>>42140096 #>>42140476 #>>42140626 #>>42140809 #>>42140878 #>>42141658 #>>42141716 #>>42142239 #>>42142373 #>>42143688 #>>42143791 #>>42151146 #

1. irrational ◴[14 Nov 24 17:31 UTC] No.42138674[source]▶

>>42138092 #

I was recently assigned to work on a huge legacy ColdFusion backend service. I was very surprised at how useful AI was with code. It was even better, in my experience, than I've seen with python, java, or typescript. The only explanation I can come up with is there is so much legacy ColdFusion code out there that was used to train Copilot and whatever AI jetbrains uses for code completion that this is one of the languages they are most suited to assist with.

replies(4): >>42139225 #>>42139249 #>>42139393 #>>42139543 #

2. randomdata ◴[14 Nov 24 18:11 UTC] No.42139225[source]▶

>>42138674 (TP) #

Perhaps it is the reverse: That ColdFusion training sources are limited, so it is more likely to converge on a homogenization?

While, causally, we usually think of a programming language as being one thing, but in reality a programming language generally only specifies a syntax. All of the other features of a language emerge from the people using them. And because of that, two different people can end up speaking two completely different languages even when sharing the same syntax.

This is especially apparent when you witness someone who is familiar with programming in language X, who then starts learning language Y. You'll notice, at least at first, they will still try to write their programs in language X using Y syntax, instead of embracing language Y in all its glory. Now, multiply that by the millions of developers who will touch code in a popular language like Python, Java, or Typescript and things end up all over the place.

So while you might have a lot more code to train on overall, you need a lot more code for the LLM to be able to discern the different dialects that emerge out of the additional variety. Quantity doesn't imply quality.

replies(1): >>42139415 #

3. mdtancsa ◴[14 Nov 24 18:13 UTC] No.42139249[source]▶

>>42138674 (TP) #

similar experience with perl scripts being re-written into golang. Crazy good experience with Claude

4. cpeterso ◴[14 Nov 24 18:24 UTC] No.42139393[source]▶

>>42138674 (TP) #

But where did these companies get the ColdFusion code for their training data? Since ColdFusion is an old language and used for backend services, how much ColdFusion code is open source and crawlable?

replies(2): >>42140959 #>>42141919 #

5. cpeterso ◴[14 Nov 24 18:26 UTC] No.42139415[source]▶

>>42139225 #

I wonder what a language designed as a target for LLM-generated code would look like? What semantics and syntax would help the LLM generate code that is more likely to be correct and maintainable by humans?

replies(1): >>42143160 #

6. eqvinox ◴[14 Nov 24 18:38 UTC] No.42139543[source]▶

>>42138674 (TP) #

That's great, but a sample size of 1, and AI utility is also self-confirmation-biasing. If the AI stops providing useful output, you stop using it. It's like "what you're searching is always in the last place you look". After you recognize AI's limits, most people wouldn't keep trying to ask it to do things they've learned it can't do. But still, there's an area of things it does, and a (ok, fuzzy) boundary of its capabilities.

Basically, for any statement about AI helpfulness, you need to quantify how far it can help you. Depending on your personality, anything else is likely either always a success (if you have a positive outlook) or a failure (if you focus on the negative).

7. irrational ◴[14 Nov 24 20:39 UTC] No.42140959[source]▶

>>42139393 #

That's a good question. I presume there is some way to check github for how much code in each language is available on it.

8. PeterisP ◴[14 Nov 24 22:34 UTC] No.42141919[source]▶

>>42139393 #

I'm definitely assuming that they don't limit their training data to what is open source and crawlable.

9. eru ◴[15 Nov 24 01:35 UTC] No.42143160{3}[source]▶

>>42139415 #

Perhaps something like Cobol? (Shudder.)

↑