LLM Inevitabilism

(tomrenner.com)

1611 points SwoopsFromAbove | 2 comments | 15 Jul 25 04:35 UTC | HN request time: 0s | source

Show context

Animats ◴[15 Jul 25 05:26 UTC] No.44568076[source]▶

There may be an "LLM Winter" as people discover that LLMs can't be trusted to do anything. Look for frantic efforts by companies to offload responsibility for LLM mistakes onto consumers. We've got to have something that has solid "I don't know" and "I don't know how to do this" outputs. We're starting to see reports of LLM usage having negative value for programmers, even though they think it's helping. Too much effort goes into cleaning up LLM messes.

replies(5): >>44568232 #>>44568321 #>>44568785 #>>44570451 #>>44578122 #

keeda ◴[15 Jul 25 07:22 UTC] No.44568785[source]▶

>>44568076 #

People can't be trusted to do anything either, which is why we have guardrails and checks and balances and audits. That is why in software, for instance, we have code reviews and tests and monitoring and other best practices. That is probably also why LLMs have made the most headway in software development; we already know how to deal with unreliable workers that are humans and we can simply transfer that knowledge over.

As was discussed on a subthread on HN a few weeks ago, the key to developing successful LLM applications is going to be figuring out how to put in the necessary business-specific guardrails with a fallback to a human-in-the-loop.

replies(1): >>44568816 #

lmm ◴[15 Jul 25 07:28 UTC] No.44568816[source]▶

>>44568785 #

> People can't be trusted to do anything either, which is why we have guardrails and checks and balances and audits. That is why in software, for instance, we have code reviews and tests and monitoring and other best practices. That is probably also why LLMs have made the most headway in software development; we already know how to deal with unreliable workers that are humans and we can simply transfer that knowledge over.

The difference is that humans eventually learn. We accept that someone who joins a team will be net-negative for the first few days, weeks, or even months. If they keep making the same mistakes that were picked out in their first code review, as LLMs do, eventually we fire them.

replies(1): >>44569006 #

1. keeda ◴[15 Jul 25 08:05 UTC] No.44569006[source]▶

>>44568816 #

LLMs may not learn on the fly (yet), but these days they do have some sort of a memory that they automatically bring into their context. It's probably just a summary that's loaded into its context, but I've had dozens of conversations with ChatGPT over the years and it remembers my past discussions, interests and preferences. It has many times connected dots across conversations many months apart to intuit what I had in mind and proactively steered the discussion to where I wanted it to go.

Worst case, if they don't do this automatically, you can simply "teach" them by updating the prompt to watch for a specific mistake (similar to how we often add a test when we catch a bug.)

But it need not even be that cumbersome. Even weaker models do surprisingly well with broad guidelines. Case in point: https://news.ycombinator.com/item?id=42150769

replies(1): >>44576006 #

2. yahoozoo ◴[15 Jul 25 21:29 UTC] No.44576006[source]▶

>>44569006 (TP) #

Yeah, the memory feature is just a summary of past conversations added to the system prompt.

↑