Vibe coding as a coding veteran: from 8-bit assembly to English-as-code

(levelup.gitconnected.com)

Show context

mycentstoo ◴[31 Aug 25 13:57 UTC] No.45083181[source]▶

I believe choosing a well known problem space in a well known language certainly influenced a lot of the behavior. AIs usefulness is correlated strongly with its training data and there’s no doubt been a significant amount of data about both the problem space and Python.

I’d love to see how this compares when either the problem space is different or the language/ecosystem is different.

It was a great read regardless!

replies(5): >>45083320 #>>45085533 #>>45086752 #>>45087639 #>>45092126 #

1. Insanity ◴[31 Aug 25 14:17 UTC] No.45083320[source]▶

>>45083181 #

100% this. I tried haskelling with LLMs and it’s performance is worse compared to Go.

Although in fairness this was a year ago on GPT 3.5 IIRC

replies(6): >>45083408 #>>45083590 #>>45083706 #>>45085045 #>>45085275 #>>45085640 #

2. danielbln ◴[31 Aug 25 14:30 UTC] No.45083408[source]▶

>>45083320 (TP) #

Post-training in all frontier models has improved significantly wrt to programming language support. Take Elexir, which LLMs could barely handle a test ago, but now support has gotten really good

3. diggan ◴[31 Aug 25 14:51 UTC] No.45083590[source]▶

>>45083320 (TP) #

> Although in fairness this was a year ago on GPT 3.5 IIRC

GPT3.5 was impressive at the time, but today's SOTA (like GPT 5 Pro) are almost night-and-difference both in terms of just producing better code for wider range of languages (I mostly do Rust and Clojure, handles those fine now, was awful with 3.5) and more importantly, in terms of following your instructions in user/system prompts, so it's easier to get higher quality code from it now, as long as you can put into words what "higher quality code" means for you.

4. r_lee ◴[31 Aug 25 15:03 UTC] No.45083706[source]▶

>>45083320 (TP) #

I'm not sure I'd say "100% this" if I was talking about GPT 3.5...

replies(2): >>45084580 #>>45085309 #

5. verelo ◴[31 Aug 25 16:42 UTC] No.45084580[source]▶

>>45083706 #

Yeah, 3.5 was good when it came out but frankly anyone reviewing AI for coding not using sonnet 4.1, GPT-5 or equivalent is really not aware of what they've missed out on.

6. johnisgood ◴[31 Aug 25 17:31 UTC] No.45085045[source]▶

>>45083320 (TP) #

I wrote some Haskell using Claude. It was great.

7. ocharles ◴[31 Aug 25 17:54 UTC] No.45085275[source]▶

>>45083320 (TP) #

I write Haskell with Claude Code and it's got remarkably good recently. We have some code at work that uses STM to have what is essentially a mutable state machine. I needed to split a state transition apart, and it did an admirable job. I had to intervene once or twice when it was going down a valid, but undesirable approach. This almost one shot performance was already a productivity boost, but didn't quite build. What I find most impressive now is the "fix" here is to literally have Claude run the build and see the errors. While GHC errors are verbose and not always the best it got everything building in a few more iterations. When it later got a test failure, I suggested we add a bit more logging - so it logged all state transitions, and spotted the unexpected transition and got the test passing. We really are a LONG way away from 3.5 performance.

8. Insanity ◴[31 Aug 25 17:57 UTC] No.45085309[source]▶

>>45083706 #

Yah, that’s a fair point. I had assumed it’d remain relatively similar given that the training data would be smaller for languages like Haskell versus languages like Python & JavaScript.

9. computerex ◴[31 Aug 25 18:29 UTC] No.45085640[source]▶

>>45083320 (TP) #

3.5 was a joke in coding compared to sonnet 4.

replies(2): >>45086680 #>>45087723 #

10. Insanity ◴[31 Aug 25 20:16 UTC] No.45086680[source]▶

>>45085640 #

Yup fair point, it’s been some time. Although vibe coding is more “miss” than “hit” for me.

11. pizza ◴[31 Aug 25 22:38 UTC] No.45087723[source]▶

>>45085640 #

It's so thrilling that this is actually true in just a year

↑