Most active commenters

whiplash451(4)
benreesman(4)
dingnuts(3)
sebzim4500(3)

Popular/hot comments

>>44455851 #
>>44455935 #

←back to thread

Tools: Code Is All You Need

(lucumr.pocoo.org)

Show context

pclowes ◴[03 Jul 25 13:16 UTC] No.44454741[source]▶

>>44453688 (OP) #

Directionally I think this is right. Most LLM usage at scale tends to be filling the gaps between two hardened interfaces. The reliability comes not from the LLM inference and generation but the interfaces themselves only allowing certain configuration to work with them.

LLM output is often coerced back into something more deterministic such as types, or DB primary keys. The value of the LLM is determined by how well your existing code and tools model the data, logic, and actions of your domain.

In some ways I view LLMs today a bit like 3D printers, both in terms of hype and in terms of utility. They excel at quickly connecting parts similar to rapid prototyping with 3d printing parts. For reliability and scale you want either the LLM or an engineer to replace the printed/inferred connector with something durable and deterministic (metal/code) that is cheap and fast to run at scale.

Additionally, there was a minute during the 3D printer Gardner hype cycle where there were notions that we would all just print substantial amounts of consumer goods when the reality is the high utility use case are much more narrow. There is a corollary here to LLM usage. While LLMs are extremely useful we cannot rely on LLMs to generate or infer our entire operational reality or even engage meaningfully with it without some sort of pre-existing digital modeling as an anchor.

replies(4): >>44455110 #>>44455475 #>>44455505 #>>44456514 #

1. whiplash451 ◴[03 Jul 25 14:36 UTC] No.44455505[source]▶

>>44454741 #

Interesting take but too bearish on LLMs in my opinion.

LLMs have already found large-scale usage (deep research, translation) which makes them more ubiquitous today than 3D printers ever will or could have been.

replies(7): >>44455662 #>>44455664 #>>44456263 #>>44456415 #>>44456476 #>>44456575 #>>44458961 #

2. benreesman ◴[03 Jul 25 14:51 UTC] No.44455664[source]▶

>>44455505 (TP) #

What we call an LLM today (by which almost everyone means an autogressive language model from the Generative Pretrained Transformer family tree, and BERTs are still doing important eork, believe that) is actually an offshoot of neural machine translation.

This isn't (intentionally at least) mere HN pedantry: they really do act like translation tools in a bunch of observable ways.

And while they have recently crossed the threshold into "yeah, I'm always going to have a gptel buffer open now" territory at the extreme high end, their utility outside of the really specific, totally non-generalizing code lookup gizmo usecase remains a claim unsupported by robust profits.

There is a hole in the ground where something between 100 billion and a trillion dollars in the ground that so far has about 20B in revenue (not profit) going into it annually.

AI is going to be big (it was big ten years ago).

LLMs? Look more and more like the Metaverse every day as concerns the economics.

replies(2): >>44455935 #>>44456578 #

3. dingnuts ◴[03 Jul 25 14:51 UTC] No.44455662[source]▶

>>44455505 (TP) #

And yet you didn't provide a single reference link! Every case of LLM usage that I've seen claimed about those things has been largely a lie -- guess you won't take the opportunity to be the first to present a real example. Just another rumor.

replies(1): >>44455788 #

4. whiplash451 ◴[03 Jul 25 15:03 UTC] No.44455788[source]▶

>>44455662 #

My reference is the daily usage of chatgpt around me (outside of tech circles).

I don’t want to sound like a hard-core LLM believer. I get your point and it’s fair.

I just wanted to point out that the current usage of chatgpt is a lot broader than that of 3D printers even at the peak hype of it.

replies(1): >>44455851 #

5. dingnuts ◴[03 Jul 25 15:10 UTC] No.44455851{3}[source]▶

>>44455788 #

Outside of tech circles it looks like NFTs: people following hype using tech they don't understand which will be popular until the downsides we're aware of that they are ignorant to have consequences, and then the market will reflect the shift in opinion.

replies(4): >>44455972 #>>44455975 #>>44457079 #>>44457364 #

6. rapind ◴[03 Jul 25 15:18 UTC] No.44455935[source]▶

>>44455664 #

> There is a hole in the ground where something between 100 billion and a trillion dollars in the ground that so far has about 20B in revenue (not profit) going into it annually.

This is a concern for me. I'm using claude-code daily and find it very useful, but I'm expecting the price to continue getting jacked up. I do want to support Anthropic, but they might eventually need to cross a price threshold where I bail. We'll see.

I expect at some point the more open models and tools will catch up when the expensive models like ChatGPT plateau (assuming they do plateau). Then we'll find out if these valuations measure up to reality.

Note to the Hypelords: It's not perfect. I need to read every change and intervene often enough. "Vibe coding" is nonsense as expected. It is definitely good though.

replies(3): >>44456502 #>>44457037 #>>44457552 #

7. whiplash451 ◴[03 Jul 25 15:21 UTC] No.44455972{4}[source]▶

>>44455851 #

I see it differently: people are switching to chatgpt like they switched to google back in 2005 (from whatever alternative existed back then)

And I mean random people, not tech circles

It’s very different from NFTs in that respect

8. basch ◴[03 Jul 25 15:21 UTC] No.44455975{4}[source]▶

>>44455851 #

No way.

Everybody under a certain age is using ChatGPT, where they were once using search and friendship/expertises. It’s the number 1 app in the App Store. Copilot use in the enterprise is so seamless, you just talk to PowerPoint or outlook and it formulated what you were supposed to make or write.

It’s not a fad, it is a paradigm change.

People don’t need to understand how it works for it to work.

replies(2): >>44458465 #>>44459185 #

9. datameta ◴[03 Jul 25 15:49 UTC] No.44456263[source]▶

>>44455505 (TP) #

Without trying to take away from your assertion, I think it is worthwhile to mention that part of this phenomenon is the unavoidable matter of meatspace being expensive and dataspace being intangibly present everywhere.

10. deadbabe ◴[03 Jul 25 16:01 UTC] No.44456415[source]▶

>>44455505 (TP) #

large scale usage in niche domains is still small scale overall.

11. kibwen ◴[03 Jul 25 16:06 UTC] No.44456476[source]▶

>>44455505 (TP) #

No, 3D printers are the backbone of modern physical prototyping. They're far more important to today's global economy than LLMs are, even if you don't have the vantage point to see it from your sector. That might change in the future, but snapping your fingers to wink LLMs out of existence would change essentially nothing about how the world works today; it would be a non-traumatic non-event. There just hasn't been time to integrate them into any essential processes.

replies(1): >>44456633 #

12. benreesman ◴[03 Jul 25 16:09 UTC] No.44456502{3}[source]▶

>>44455935 #

Vibe coding is nonsense, and its really kind of uncomfortable to realize that a bunch of people you had tons of respect for are either ignorant or dishonest/bought enough to say otherwise. There's a cold wind blowing and the bunker-building crowd, well let's just say I won't shed a tear.

You don't stock antibiotics and bullets in a survival compound because you think that's going to keep out a paperclip optimizer gone awry. You do that in the forlorn hope that when the guillotines come out that you'll be able to ride it out until the Nouveau Regime is in a negotiating mood. But they never are.

13. nativeit ◴[03 Jul 25 16:16 UTC] No.44456575[source]▶

>>44455505 (TP) #

[citation needed]

14. sebzim4500 ◴[03 Jul 25 16:16 UTC] No.44456578[source]▶

>>44455664 #

>LLMs? Look more and more like the Metaverse every day as concerns the economics.

ChatGPT has 800M+ weekly active users how is that comparable to the Metaverse in any way?

replies(2): >>44456905 #>>44467623 #

15. whiplash451 ◴[03 Jul 25 16:21 UTC] No.44456633[source]▶

>>44456476 #

> snapping your fingers to wink LLMs out of existence would change essentially nothing about how the world works today

One could have said the same thing about Google in 2006

replies(1): >>44457064 #

16. benreesman ◴[03 Jul 25 16:44 UTC] No.44456905{3}[source]▶

>>44456578 #

I said as concerns the economics. It's clearly more popular than the Oculus or whatever, but it's still a money bonfire and shows no signs of changing on that front.

replies(2): >>44457858 #>>44463069 #

17. juped ◴[03 Jul 25 16:56 UTC] No.44457037{3}[source]▶

>>44455935 #

I'm just taking advantage and burning VCs' money on useful but not world-changing tools while I still can. We'll come out of it with consumer-level okay tools even if they don't reach the levels of Claude today, though.

18. kibwen ◴[03 Jul 25 16:59 UTC] No.44457064{3}[source]▶

>>44456633 #

No, not even close. By 2006 all sorts of load-bearing infrastructure was relying on Google (e.g. Gmail). Today LLMs are still on the edge of important systems, rather than underlying those systems.

replies(1): >>44457492 #

19. jrm4 ◴[03 Jul 25 17:02 UTC] No.44457079{4}[source]▶

>>44455851 #

Not even remotely in the same universe; the difference is ChatGPT is actually having an impact, people are incorporating it day-to-day in a way that NFTs never stood much of a chance.

20. retsibsi ◴[03 Jul 25 17:33 UTC] No.44457364{4}[source]▶

>>44455851 #

Even if the most bearish predictions turn out to be correct, the comparison of LLMs to NFTs is a galaxy-spanning stretch.

NFTs are about as close to literally useless as it gets, and that was always obvious; 99% of the serious attention paid to them came from hustlers and speculators.

LLMs, for all their limitations, are already good at some things and useful in some ways. Even in the areas where they are (so far) too unreliable for serious use, they're not pure hype and bullshit; they're doing things that would have seemed like magic 10 years ago.

21. johnsmith1840 ◴[03 Jul 25 17:46 UTC] No.44457492{4}[source]▶

>>44457064 #

Things like BERT are a load bearing structure in data science pipelines.

I assume there are massive number of LLM analysis pipelines out there.

I suppose it depends if you consider non determinist DS/ML pipelines "loadbearing" or not. Most are not using LLMs though.

3D parts regularly are used beyond prototyping though as tooling for a small company can be higher than just metal 3D parts. So I do somewhat agree but the loss of productivity in software prototyping would be a massive hit if LLMs vanished.

22. strgcmc ◴[03 Jul 25 17:52 UTC] No.44457552{3}[source]▶

>>44455935 #

As a thought-exercise -- assume models continue to improve, whereas "using claude-code daily" is something you choose to do because it's useful, but is not yet at the level of "absolute necessity, can't imagine work without it". What if it does become, that level of absolute necessity?

- Is your demand inelastic at that point, if having claude-code becomes effectively required, to sustain your livelihood? Does pricing continue to increase, until it's 1%/5%/20%/50% of your salary (because hey, what's the alternative? if you don't pay, then you won't keep up with other engineers and will just lose your job completely)?

- But if tools like claude-code become such a necessity, wouldn't enterprises be the ones paying? Maybe, but maybe like health-insurance in America (a uniquely dystopian thing), your employer may pay some portion of the premiums, but they'll also pass some costs to you as the employee... Tech salaries have been cushy for a while now, but we might be entering a "K-shaped" inflection point --> if you are an OpenAI elite researcher, then you might get a $100M+ offer from Meta; but if you are an average dev doing average enterprise CRUD, maybe your wages will be suppressed because the small cabal of LLM providers can raise prices and your company HAS to pay, which means you HAVE to bear the cost (or else what? you can quit and look for another job, but who's hiring?)

This is a pessimistic take of course (and vastly oversimplified / too cynical). A more positive outcome might be, that increasing quality of AI/LLM options leads to a democratization of talent, or a blossoming of "solo unicorns"... personally I have toyed with calling this, something like a "techno-Amish utopia", in the sense that Amish people believe in self-sufficiency and are not wholly-resistant to technology (it's actually quite clever, what sorts of technology they allow for themselves or not), so what if we could take that further?

If there was a version of that Amish-mentality of loosely-federated self-sufficient communities (they have newsletters! they travel to each other! but they largely feed themselves, build their own tools, fix their own fences, etc.!), where engineers + their chosen LLM partner could launch companies from home, manage their home automation / security tech, run a high-tech small farm, live off-grid from cheap solar, use excess electricity to Bitcoin mine if they choose to, etc.... maybe there is actually a libertarian world that can arise, where we are no longer as dependent on large institutions to marshal resources, deploy capital, scale production, etc., if some of those things are more in-reach for regular people in smaller communities, assisted by AI. This of course assumes that, the cabal of LLM model creators can be broken, that you don't need to pay for Claude if the cheaper open-source-ish Llama-like alternative is good enough

replies(1): >>44457825 #

23. rapind ◴[03 Jul 25 18:20 UTC] No.44457825{4}[source]▶

>>44457552 #

Well my business doesn't rely on AI as a competitive advantage, at least not yet anyways. So as it stands, if claude got 100x as effective, but cost 100x more, I'm not sure I could justify the cost because my market might just not be large enough. Which means I can either ditch it (for an alternative if one exists) or expand into other markets... which is appealing but a huge change from what I'm currently doing.

As usual, the answer is "it depends". I guarantee though that I'll at least start looking at alternatives when there's a huge price hike.

Also I suspect that a 100x improvement (if even possible) wouldn't just cost 100 times as much, but probably 100,000+ times as much. I also suspect than an improvement of 100x will be hyped as an improvement of 1,000x at least :)

Regardless, AI is really looking like a commodity to me. While I'm thankful for all the investment that got us here, I doubt anyone investing this late in the game at these inflated numbers are going to see a long term return (other than ponzi selling).

24. threetonesun ◴[03 Jul 25 18:24 UTC] No.44457858{4}[source]▶

>>44456905 #

LLMs as we know them via ChatGPT were a way to disrupt the search monopoly Google had for so many years. And my guess is the reason Google was in no rush to jump into that market was because they knew the economics of it sucked.

replies(1): >>44459428 #

25. lotsoweiners ◴[03 Jul 25 19:37 UTC] No.44458465{5}[source]▶

>>44455975 #

> It’s the number 1 app in the App Store.

When I checked the iOS App Store just now, something called Love Island USA is the #1 free app. Kinda makes you think….

replies(1): >>44528619 #

26. skeeter2020 ◴[03 Jul 25 20:37 UTC] No.44458961[source]▶

>>44455505 (TP) #

Th author is not bearish on LLMs at all; this post is about using LLMs and code vs. LLMs with autonomous tools via MCP. An example from your set would be translation. The author says you'll get better results if you do something like ask an LLM to translate documents, review the proposed approach, ask it to review it's work and maybe ask another LLM to validate the results than if you say "you've got 10K documents in English, and these tools - I speak French"

27. dingnuts ◴[03 Jul 25 21:05 UTC] No.44459185{5}[source]▶

>>44455975 #

I know it's popular; that doesn't mean it's not a fad. Consequences take time. It's easy to use but once you get burned in a serious way by the bot that's still wrong 20% of the time, you'll become more reluctant to put your coin in the slot machine.

Maybe if the AI companies start offering refunds for wrong answers, then the price per token might not be such a scam.

28. benreesman ◴[03 Jul 25 21:44 UTC] No.44459428{5}[source]▶

>>44457858 #

Right, and inb4 ads on ChatGPT to stop the bleeding. That's the default outcome at this point: quantize it down gradually to the point where it can be ad supported.

You can just see the scene from the Sorkin film where Fidji is saying to Altman: "Its time to monetize the site."

"We don't even know what it is yet, we know that it is cool."

29. sebzim4500 ◴[04 Jul 25 10:01 UTC] No.44463069{4}[source]▶

>>44456905 #

I supposed in that sense it is more like the early days of social media, where there were huge numbers of users but no one was sure how to monetize it properly.

In this case though I think the ChatGPT product line is profitable albeit not enough to cover the R&D costs of OpenAI.

30. player1234 ◴[04 Jul 25 20:27 UTC] No.44467623{3}[source]▶

>>44456578 #

I can give a way 800M+ of anything for free. How many of these users are willing to pay OpenAI enough for full ROI and profits on top?

replies(1): >>44471525 #

31. sebzim4500 ◴[05 Jul 25 10:02 UTC] No.44471525{4}[source]▶

>>44467623 #

>I can give a way 800M+ of anything for free

No you can't, be serious. 10% of the global population is using their service, you can't just pretend that isn't impressive.

There are a lot of free websites, they do not have 800M users.

32. basch ◴[11 Jul 25 05:15 UTC] No.44528619{6}[source]▶

>>44458465 #

It's back to #1. The show is towards the end of the season and people need to vote for their favorite couples. Itll drop off the chart soon. ChatGPT will not.

A little surprised to see Threads at 4 though.

↑