Most active commenters

diggan(8)
alganet(6)
dist-epoch(5)
dcminter(5)
danielbln(3)
kookamamie(3)
TZubiri(3)

Popular/hot comments

>>44403162 #

LLMs bring new nature of abstraction – up and sideways

(martinfowler.com)

1. oytis ◴[24 Jun 25 15:11 UTC] No.44367106[source]▶

I don't get his argument, and if it wasn't Martin Fowler I would just dismiss it. He admits himself that it's not an abstraction over previous activity as it was with HLLs, but rather a new activity altogether - that is prompting LLMs for non-deterministic outputs.

Even if we assume there is value in it, why should it replace (even if in part) the previous activity of reliably making computers do exactly what we want?

replies(2): >>44403162 #>>44403847 #

2. felineflock ◴[24 Jun 25 22:28 UTC] No.44371711[source]▶

>>44366904 (OP) #

It is a new nature of abstraction, not a new level.

UP: It lets us state intent in plain language, specs, or examples. We can ask the model to invent code, tests, docs, diagrams—tasks that previously needed human translation from intention to syntax.

BUT SIDEWAYS: Generation is a probability distribution over tokens. Outputs vary with sampling temperature, seed, context length, and even with identical prompts.

replies(2): >>44403418 #>>44403438 #

3. dtagames ◴[25 Jun 25 02:07 UTC] No.44373048[source]▶

>>44366904 (OP) #

I respect Martin Fowler greatly but those who, by their own admission, have not used current AI coding tools really don't have much to add regarding how they affect our work as developers.

I do hope he takes the time to get good with them!

replies(1): >>44403527 #

4. nmaley ◴[25 Jun 25 04:22 UTC] No.44373601[source]▶

>>44366904 (OP) #

I'm in the process of actually building LLM based apps at the moment, and Martin Fowler's comments are on the money. The fact is seemingly insignificant changes to prompts can yield dramatically different outcomes, and the odd new outcomes have all these unpredictable downstream impacts. After working with deterministic systems most of my career it requires a different mindset.

It's also a huge barrier to adoption by mainstream businesses, which are used to working to unambiguous business rules. If it's tricky for us developers it's even more frustrating to end users. Very often they end up just saying, f* it, this is too hard.

I also use LLM's to write code and for that they are a huge productivity boon. Just remember to test! But I'm noticing that use of LLM's in mainstream business applications lags the hype quite a bit. They are touted as panaceas, but like any IT technology they are tricky to implement. People always underestimate the effort necessary to get a real return, even with deterministic apps. With indeterministic apps it's an even bigger problem.

replies(1): >>44402970 #

5. CraigJPerry ◴[28 Jun 25 07:52 UTC] No.44402970[source]▶

>>44373601 #

Some failure modes can be annoying to test for. For example, if you exceed the model’s context window, nothing will happen in terms of errors or exceptions but the observable performance on the task will tank.

Counting tokens is the only reliable defence i found to this.

replies(1): >>44403060 #

6. danielbln ◴[28 Jun 25 08:15 UTC] No.44403060{3}[source]▶

>>44402970 #

If you exceed the context window the remote LLM endpoint will throw you an error which you probably want to catch, or rather you want to catch that before it happens and deal with it. Either way, it's not a silent error that goes unnoticed usually, what makes you think that?

replies(2): >>44403270 #>>44403434 #

7. dist-epoch ◴[28 Jun 25 08:38 UTC] No.44403162[source]▶

>>44367106 #

Because unreliably solving a harder problem with LLMs is much more valuable than reliably solving an easier problem without.

replies(4): >>44403214 #>>44403346 #>>44404165 #>>44407471 #

8. darkwater ◴[28 Jun 25 08:50 UTC] No.44403214{3}[source]▶

>>44403162 #

Which harder problems are LLMs going to (unreliably) solve in your opinion?

replies(1): >>44403853 #

9. CraigJPerry ◴[28 Jun 25 09:05 UTC] No.44403270{4}[source]▶

>>44403060 #

Interesting, the completion return object is documented but theres no error or exception field. In practice the only errors ive seen so far have been on the HTTP transport layer.

It would make sense to me for the chat context to raise an exception. Maybe i should read the docs further…

10. oytis ◴[28 Jun 25 09:25 UTC] No.44403346{3}[source]▶

>>44403162 #

OK, so we are having two classes of problems here - ones worth solving unreliably, and ones that are better solved without LLMs. Doesn't sound like a next level of abstraction to me

replies(2): >>44403871 #>>44404015 #

11. dcminter ◴[28 Jun 25 09:38 UTC] No.44403418[source]▶

>>44371711 #

Surely given an identical prompt with a clean context and the same seed the outputs will not vary?

replies(2): >>44403454 #>>44404213 #

12. diggan ◴[28 Jun 25 09:41 UTC] No.44403434{4}[source]▶

>>44403060 #

> If you exceed the context window the remote LLM endpoint will throw you an error which you probably want to catch

Not every endpoint works the same way, I'm pretty sure LM Studio's OpenAI-compatible endpoints will silently (from the clients perspective) truncate the context, rather than throw an error. It's up to the client to make sure the context fits in those cases.

OpenAI's own endpoints do show an error and refuses if you exceed the context length though. I think I've seen others use the "finish_reason" attribute too to signal the context length was exceeded, rather than setting an error status code on the response.

Overall, even "OpenAI-compatible" endpoints often aren't 100% faithful reproductions of the OpenAI endpoints, sadly.

replies(1): >>44403624 #

13. genidoi ◴[28 Jun 25 09:43 UTC] No.44403438[source]▶

>>44371711 #

This is too abstract and a concrete example of what this looks like in output is needed.

14. diggan ◴[28 Jun 25 09:45 UTC] No.44403454{3}[source]▶

>>44403418 #

+ temperature=0.0 would be needed for reproducible outputs. And even with that, if it's actually reproducible or not depends on the model/weights themselves, not all of them are even when all those things are static. And then finally depends on the implementation of the model architecture as well.

I think the tricky part is that we tend to think that prompts with similar semantic meaning will give the same outputs (like a human), while LLMs can give vastly different outputs if you have one spelling mistake for example, or used "!" instead of "?", the effect varies greatly per model.

replies(2): >>44403803 #>>44403989 #

15. alganet ◴[28 Jun 25 09:52 UTC] No.44403486[source]▶

>>44366904 (OP) #

> This evolution in non-determinism is unprecedented in the history of our profession.

Not actually true. Fuzzing and mutation testing have been here for a while.

replies(2): >>44403513 #>>44404354 #

16. diggan ◴[28 Jun 25 10:00 UTC] No.44403513[source]▶

>>44403486 #

I think the whole context of the article is "program with non-deterministic tools", while non-deterministic fuzzing and mutation testing is kind of isolated to "coming up with test cases", not something you constantly program side-by-side with, or even integrate into the (business-side) of the software project itself. That's how I've used fuzzing and mutation testing in the past at least, maybe others use it differently.

Otherwise yeah, there are a bunch of non-deterministic technologies, processes and workflows missing, like what Machine Learning folks been doing for decades, which is also software and non-deterministic, but also off-topic from context of the article, as I read it.

replies(1): >>44403633 #

17. diggan ◴[28 Jun 25 10:03 UTC] No.44403527[source]▶

>>44373048 #

> have not used current AI coding tools really don't have much to add regarding how they affect our work as developers

I dunno, sometimes it's helpful to learn about the perspectives of people who've watched something from afar as well, especially if they already have broad knowledge and context that is adjacent to the topic itself, and have lots of people around them deep in the trenches that they've discussed with.

A bit like historians still can provide valuable commentary on wars, even though they (probably) haven't participated in the wars themselves.

replies(1): >>44404350 #

18. danielbln ◴[28 Jun 25 10:24 UTC] No.44403624{5}[source]▶

>>44403434 #

That seems like terrible API design to just truncate without telling the caller. Anthropic, Google and OpenAI all will fail very loudly if you exceed the context window, and that's how it should be. But fair enough, this shouldn't happen anyway and the context should be actively handled before it blows up either way.

replies(2): >>44404066 #>>44404527 #

19. alganet ◴[28 Jun 25 10:26 UTC] No.44403633{3}[source]▶

>>44403513 #

I just have a problem with his use of the word "unprecedent".

This is not the first rodeo of our profession with non-determinism.

20. dcminter ◴[28 Jun 25 11:05 UTC] No.44403803{4}[source]▶

>>44403454 #

Hmm, I'm barely even a dabbler, but I'd assumed that the seed in question drove the (pseudo)randomness inherent in "temperature" - if not, what seed(s) do they use and why could one not set that/those too?

To your second part I wouldn't make that assumption - I can see how a non-technical person might, but surely programmers wouldn't? I've certainly produced very different output from that which I intended in boring old C with a mis-placed semi-colon after all!

replies(1): >>44404515 #

21. kookamamie ◴[28 Jun 25 11:15 UTC] No.44403847[source]▶

>>44367106 #

Funny, I dismiss the opinion based on the author in question.

replies(1): >>44403918 #

22. dist-epoch ◴[28 Jun 25 11:17 UTC] No.44403853{4}[source]▶

>>44403214 #

Anything which requires "common sense".

A contrived example: there are only 100 MB of disk space left, but 1 GB of logs to write. LLM discards 900 MB of logs and keeps only the most important lines.

Sure, you can nitpick this example, but it's the kind of edge case handling that LLMs can "do something resonable" that before required hard coding and special casing.

replies(1): >>44406838 #

23. dist-epoch ◴[28 Jun 25 11:22 UTC] No.44403871{4}[source]▶

>>44403346 #

I was thinking more along this line: you can solve unreliably 100% of the problem with LLMs, or solve reliably only 80% of the problem.

So you trade reliability to get to that extra 20% of hard cases.

24. Insanity ◴[28 Jun 25 11:35 UTC] No.44403918{3}[source]▶

>>44403847 #

Serious question - why? I know of the author but don’t see a reason to value his opinion on this topic more or less because of this.

(Attaching too much value to the person instead of the argument is more of an ‘argument from authority’)

replies(1): >>44404170 #

25. smokel ◴[28 Jun 25 11:49 UTC] No.44403989{4}[source]▶

>>44403454 #

> I think the tricky part is that we tend to think that prompts with similar semantic meaning will give the same outputs (like a human)

Trust me, this response would have been totally different if I were in a different mood.

26. smokel ◴[28 Jun 25 11:54 UTC] No.44404011[source]▶

>>44366904 (OP) #

Abstractions for high-level programming languages have always gone in multiple directions (or dimensions if you will). Operations in higher level languages abstract over multiple simpler operations in other languages, but they also allow for abstraction over human concepts, by introducing variable names for example. Variable names are irrelevant to a computer, but highly relevant to humans.

Languages are created to support both computers as well as humans. And to most humans, abstractions such as those presented by, say, Hibernate annotations, are as non-deterministic as can be. To the computer it is all the same, but that is increasingly becoming less relevant, given that software is growing and has to be maintained by humans.

So, yes, LLMs are interesting, but not necessarily that much of a game-changer when compared to the mess we are already in.

27. pydry ◴[28 Jun 25 11:55 UTC] No.44404015{4}[source]▶

>>44403346 #

The story of programming is not largely one of humans striving to be more reliable when programming but putting up better defenses against our own inherent unreliabilities.

When I watch juniors struggle they seem to think that it's because they dont think hard enough whereas it's usually because they didnt build enough infrastructure that would prevent them from needing to think too hard.

As it happens, when it comes to programming, LLM unreliabilities seem to align quite closely with ours so the same guardrails that protect against human programmers' tendencies to fuck up (mostly tests and types) work pretty well for LLMs too.

28. dist-epoch ◴[28 Jun 25 12:05 UTC] No.44404066{6}[source]▶

>>44403624 #

It's complicated, for example some models (o3) will throw an error if you set temperature.

What do you do if you want to support multiple models in your LLM gateway? Do you throw an error if a user sets temperature for o3, thus dumping the problem on them? Or just ignore it, but potentially creating confusion because temperature will seem to not work for some models?

replies(1): >>44404385 #

29. furyofantares ◴[28 Jun 25 12:20 UTC] No.44404165{3}[source]▶

>>44403162 #

I'm pretty deep into these things and have never had them solve a harder problem than I can solve. They just solve problems I can solve much, much faster.

Maybe that does add up to solving harder higher level real world problems (business problems) from a practical standpoint, perhaps that's what you mean rather than technical problems.

Or maybe you're referring to producing software which utilizes LLMs, rather than using LLMs to program software (which is what I think the blog post is about, but we should certainly discuss both.)

replies(1): >>44404503 #

30. kookamamie ◴[28 Jun 25 12:22 UTC] No.44404170{4}[source]▶

>>44403918 #

Let's just say I think a lot of damage was caused by their OOP evangelism back in the day.

replies(2): >>44404432 #>>44406920 #

31. furyofantares ◴[28 Jun 25 12:29 UTC] No.44404213{3}[source]▶

>>44403418 #

You can make these things deterministic for sure, and so you could also store prompts plus model details instead of code if you really wanted to. Lots of reasons this would be a very very poor choice but you could do it.

I don't think that's how you should think about these things being non-deterministic though.

Let's call that technical determinism, and then introduce a separate concept, practical determinism.

What I'm calling practical determinism is your ability as the author to predict (determine) the results. Two different prompts that mean the same thing to me will give different results, and my ability to reason about the results from changes to my prompt is fuzzy. I can have a rough idea, I can gain skill in this area, but I can't gain anything like the same precision as I have reasoning about the results of code I author.

32. TZubiri ◴[28 Jun 25 12:50 UTC] No.44404350{3}[source]▶

>>44403527 #

I agree, I don't use coding tools, "except to ask for a script to chatgpt every once in a while". But I experience it by reviewing and detecting LLM generated code by consultants and juniors. It's easy to ask them for the prompts for example, but when they use autocompletion based LLMs, it's really hard to distinguish source from target code.

33. TZubiri ◴[28 Jun 25 12:51 UTC] No.44404354[source]▶

>>44403486 #

Right, in testing, but not in the compiler chain

replies(1): >>44404505 #

34. danielbln ◴[28 Jun 25 12:55 UTC] No.44404385{7}[source]▶

>>44404066 #

I'm a big fan of fail early and fail loudly.

35. diggan ◴[28 Jun 25 13:03 UTC] No.44404432{5}[source]▶

>>44404170 #

You don't think the damage was done by the people who religiously follow whatever loudmouths says? Those are the people I'd stop listening to, rather than ignoring what an educator says when sharing their perspective.

Don't get me wrong, I feel like Fowler is wrong about some things too, and wouldn't follow what he says as dogma, but I don't think I'd attribute companies going after the latest fad as his fault.

replies(2): >>44404971 #>>44406359 #

36. dist-epoch ◴[28 Jun 25 13:16 UTC] No.44404503{4}[source]▶

>>44404165 #

> solve a harder problem than I can solve

If you've never done web-dev, and want to create an web-app, where does that fall? In principle you could learn web-dev in 1 week/month, so technically you could do it.

> maybe you're referring to producing software which utilizes LLMs

but yes, this is what I meant, outsourcing "business logic" to an LLM instead of trying to express it in code.

37. alganet ◴[28 Jun 25 13:16 UTC] No.44404505{3}[source]▶

>>44404354 #

I don't understand what you mean. Can you elaborate on your perception of what a "compiler chain" is and the supposed LLM role in it?

replies(1): >>44411787 #

38. diggan ◴[28 Jun 25 13:18 UTC] No.44404515{5}[source]▶

>>44403803 #

> Hmm, I'm barely even a dabbler, but I'd assumed that the seed in question drove the (pseudo)randomness inherent in "temperature" - if not, what seed(s) do they use and why could one not set that/those too?

Implementations and architectures are different enough that it's hard to say "It's like X" in all cases. Last time I tried to achieve 100% reproducible outputs, which obviously includes hard-coding various seeds, I remember not getting reproducible outputs unless setting temperature to 0, I think this was with Qwen2 or Qwq used via Huggingface's Transformers library, but cannot find the exact details now.

Then in other cases, like the hosted OpenAI models, they straight up say "temperature to 0 makes them mostly deterministic", but I'm not exactly sure why they are unable to offer endpoints with determinism.

> I can see how a non-technical person might, but surely programmers wouldn't?

When talking even with developers about prompting and LLMs, there is still quite a few people who are surprised that "You are a helpful assistant." would lead to different outputs than "You are a helpful assistant!". I think if you're a programmer or not matters less, more about understanding how the LLMs actually work in order to understand that.

replies(1): >>44407315 #

39. diggan ◴[28 Jun 25 13:20 UTC] No.44404527{6}[source]▶

>>44403624 #

> That seems like terrible API design to just truncate without telling the caller

Agree, confused me a lot the first time I encountered it.

It would be great if implementations/endpoints could converge, but with OpenAI moving to the Responses API rather than ChatCompletion, yet the rest of the ecosystem seemingly still implementing ChatCompletion with various small differences (like how to do structured outputs), it feels like it's getting further away, not closer...

40. bgwalter ◴[28 Jun 25 13:54 UTC] No.44404714[source]▶

>>44366904 (OP) #

How many bandwagons has this guy jumped on? Now he says that LLMs will be the new high level programming languages but also that he listens to colleagues and hasn't really tried them yet.

I suppose he is aiming for a new book and speaker fees from the LLM industrial complex.

41. somewhereoutth ◴[28 Jun 25 14:26 UTC] No.44404934[source]▶

>>44366904 (OP) #

> As we learn to use LLMs in our work, we have to figure out how to live with this non-determinism. This change is dramatic, and rather excites me. I'm sure I'll be sad at some things we'll lose, but there will also things we'll gain that few of us understand yet. This evolution in non-determinism is unprecedented in the history of our profession.

The whole point of computers is that they were deterministic, such that any effective method can be automated - leaving humans to do the non-deterministic (and hopefully more fun) stuff.

Why do we want to break this up-to-now hugely successful symbiosis?

42. kookamamie ◴[28 Jun 25 14:30 UTC] No.44404971{6}[source]▶

>>44404432 #

Perhaps. Then again, advocating things like Singleton as anything beyond a gloriefied global variable is pretty high on my BS list.

An example: https://martinfowler.com/bliki/StaticSubstitution.html

replies(1): >>44405576 #

43. stpedgwdgfhgdd ◴[28 Jun 25 15:01 UTC] No.44405163[source]▶

>>44366904 (OP) #

I’m programming software development workflows for Claude in plain english (custom commands). The non-deterministic is indeed a (tiny!) bit of a problem, but just tell Claude to improve the command so next time it won’t make the same mistake. One time it added an implementation section to the command. Pretty cool

This is the big game changer: we have a programming environment where the program can improve itself. That is something Fortran couldn’t do.

replies(1): >>44405208 #

44. stpedgwdgfhgdd ◴[28 Jun 25 15:07 UTC] No.44405208[source]▶

>>44405163 #

(bit off topic, but I wished someone told me this a few months ago; for regular programming generation, use TDD and a type safe language like Go. (Fast compile times and excellent testing support). Don’t aim for the magical huge waterfall prompt)

45. diggan ◴[28 Jun 25 15:49 UTC] No.44405576{7}[source]▶

>>44404971 #

> gloriefied global variable is pretty high on my BS list

Say you have a test that is asserting the output of some code, and that code is using a global variable of some kind, how do you ensure you can have tests that are using different values for that global variable and it all works? You'd need to be able to change it during tests somehow.

Personally, I think a lot of the annoying parts of programming go away when you use a more expressive language (like Clojure), including this one. But for other languages, you might need to work around the limitations of the language and then approaches like using Singletons might make more sense.

At the same time, Fowlers perspective is pretty much always in the context of "I have this piece of already written code I need to make slightly better", obviously the easy way is to not have global variables in the first place, but when working with legacy code you do stumble upon one or three non-optimal conditions.

46. alganet ◴[28 Jun 25 17:15 UTC] No.44406359{6}[source]▶

>>44404432 #

You need to understand that Mr. Fowler works for a consultancy.

LLMs sound great for consultants. A messy hyped technology that you can charge to pretend to fix? Jackpot.

All things these consultancies eventually promote are learnings they had with their own clients.

The OOP patterns he described in the past likely came from observing real developers while being in this consultant role, and _trying_ to document how they overcame typical problems of the time.

I have a feeling that the real people with skin on the game (not consultants) that came up with that stuff would describe it in much simpler terms.

Similarly, it is likely that some of these posts are based on real experience but "consultancified" (made vague and more complex than it needs to be).

replies(1): >>44407367 #

47. sarchertech ◴[28 Jun 25 18:10 UTC] No.44406838{5}[source]▶

>>44403853 #

In that example something simple like log the errors, or log the first error of the same type per 5 minute block had some percent chance of solving 100% of the problem.

And it’s not just this specific problem. I don’t think letting an LLM handle edge cases is really ever an appropriate use case in production.

I’d much rather the system just fail so that someone will fix it. Imagine a world where at every level instead of failing and halting, everything error just got bubbled up to an LLM that tried to do something reasonable.

Talk about emergent behavior, or more likely catastrophic cascading failures.

I can kind of see your point if you’re talking about a truly hopeless scenario. Like some imaginary autonomous spacecraft that is going to crash into the sun, so in a last ditch effort the autopilot turns over the controls to an LLM.

But even in that scenario we have to have some way of knowing that we truly are in a hopeless scenario. Maybe it just appears that way and the LLM makes it worse.

Or maybe the LLM decides to pilot it into another spacecraft to reduce velocity.

My point is there aren’t many scenarios where “do something reasonable 90% of the time, but do something insane the other 10% of the time” is better than do nothing.

I’ve been using LLMs at work and my gut feeling saying I’m getting some productivity boost, but I’m not even certain of that because I have also spent time chasing subtle bugs that I wouldn’t have introduced myself. I think I’m going to need to see the results of some large well designed studies and several years of output before I really feel confident saying one way or the other.

48. Disposal8433 ◴[28 Jun 25 18:23 UTC] No.44406920{5}[source]▶

>>44404170 #

His Refactoring book was a good thing at the time. But it ends there, he should have tried to program instead of writing all the other books that made no sense.

49. dcminter ◴[28 Jun 25 19:10 UTC] No.44407315{6}[source]▶

>>44404515 #

Oh, well that's super interesting, thanks; I guess some side effect of the high degree of parallelism? Anyway, I guess I need to do a bit more than dabble.

> I think if you're a programmer or not matters less, more about understanding how the LLMs actually work in order to understand that.

Sounds like I need to understand them better then as I merely had different misaprehensions than those. More reading for me...

50. dcminter ◴[28 Jun 25 19:18 UTC] No.44407367{7}[source]▶

>>44406359 #

I'm a bit too lazy to check, but didn't he leave thoughtworks?

Apropos of nothing I saw him speak once at a corporate shindig and I didn't get the impression that he enjoyed it very much. Some of the engineering management were being super weird about him being a (very niche) famous person too...

replies(1): >>44408005 #

51. ◴[28 Jun 25 19:31 UTC] No.44407471{3}[source]▶

>>44403162 #

52. alganet ◴[28 Jun 25 20:41 UTC] No.44408005{8}[source]▶

>>44407367 #

https://martinfowler.com/aboutMe.html

> [...] I work for Thoughtworks [...]

> [...] I don't come up with original ideas, but do a pretty good job of recognizing and packaging the ideas of others [...]

> [...] I see my main role as helping my colleagues to capture and promulgate what we've learned about software development to help our profession improve. We've always believed that this openness helps us find clients, recruit the best people, and help our clients succeed. [...]

So, we should read him as such. It's a consultant, trying to capture what successful teams do. Sometimes succeeding, sometimes failing.

replies(1): >>44412862 #

53. gdubs ◴[28 Jun 25 23:27 UTC] No.44409031[source]▶

>>44366904 (OP) #

The Star Trek computer – I grew up with TNG – is the example I always think of. They ask it in human language to pull up some data, create a visualization, run an analysis. You get the sense that people also still write programs in the future in a more manual way – but, done well these LLMs are the building blocks for that more conversational way of getting the computer to do what you want it to do.

A lot of the complaints that come up on Hacker News are around the idea that a piece of code needs to be elegantly crafted "Just so" for a particular purpose. An efficient algorithm, a perfectly correct program. (Which, sorry but – have you seen most of the software in the world?)

And that's all well and good – I like the craft too. I'm proud of some very elegant code I've written.

But, the writing is on the wall – this is another turning point in computing similar to the personal computer. People scoffed at that too. "Why would regular people want a computer? Their programs will be awful!"

54. TZubiri ◴[29 Jun 25 10:06 UTC] No.44411787{4}[source]▶

>>44404505 #

A C compiler outputs x86 or ARM or whatever assembly. C is the source, x86 is the target code.

Javascript is source code that might be interpreted or might output html target code (by Dom manipulation)

Typescript compiles to javascript.

Now javascript is both source and target code. If you upload javascript code that was generated by ts to your repo and you leave out your ts, that's bad.

Similarly, an LLM has english (or any natural language) as it's source code and typescript (or whatever programming language) as its target code. You shouldn't upload your target code to your repo, and you shouldn't consider it source code.

It's interesting that the compiler in this case is non deterministic, but it doesn't change the fact that the prompts are source code, the vibecode is target code.

I have a repo that showcases this

https://github.com/TZubiri/keyboard-transpositions-checker

replies(1): >>44413849 #

55. dcminter ◴[29 Jun 25 13:10 UTC] No.44412862{9}[source]▶

>>44408005 #

Yeah, seems I was misremembering - looks like he just doesn't do talks any more.

56. alganet ◴[29 Jun 25 15:24 UTC] No.44413849{5}[source]▶

>>44411787 #

Read his text more carefully:

> I can't just store my prompts in git and know that I'll get the same behavior each time

He's not on this idea of using english as source code. He explicitly acknowledges that it doesn't work that way (although he's vague in what _actually_ would replace this).

In summary, he's not talking about english as source code.

It _could_ be that someone else figures out how to use english as authoritative source, but that's not what he's talking about.

In that sense, he's talking about using LLMs as the IDE, tooling. It's not that different from using mutation testing (not something I would commit to the repo), and I stand by my original statement that this is not "unprecedent" as it seems.

↑