Why language models hallucinate

(openai.com)

277 points simianwords | 2 comments | 06 Sep 25 07:41 UTC | HN request time: 0s | source

Show context

rhubarbtree ◴[06 Sep 25 21:12 UTC] No.45152883[source]▶

I find this rather oddly phrased.

LLMs hallucinate because they are language models. They are stochastic models of language. They model language, not truth.

If the “truthy” responses are common in their training set for a given prompt, you might be more likely to get something useful as output. Feels like we fell into that idea and said - ok this is useful as an information retrieval tool. And now we use RL to reinforce that useful behaviour. But still, it’s a (biased) language model.

I don’t think that’s how humans work. There’s more to it. We need a model of language, but it’s not sufficient to explain our mental mechanisms. We have other ways of thinking than generating language fragments.

Trying to eliminate cases where a stochastic model the size of an LLM gives “undesirable” or “untrue” responses seems rather odd.

replies(9): >>45152948 #>>45153052 #>>45153156 #>>45153672 #>>45153695 #>>45153785 #>>45154058 #>>45154227 #>>45156698 #

didibus ◴[06 Sep 25 21:45 UTC] No.45153156[source]▶

>>45152883 #

I agree with everything you said except:

> Trying to eliminate cases where a stochastic model the size of an LLM gives “undesirable” or “untrue” responses seems rather odd.

Take it back to what it is like you say, this is a predictive model, and the work of any ML scientist is to iterate on the model to try and get perfect accuracy on unseen data. It makes sense to want to tune the models to lower the rate of predictive errors. And because perfect predictive accuracy is rarely possible, you need to make judgment calls between precision and recall, which, in the case of LLMs, directly affects how often the model will hallucinate versus how often it will stay silent or overly cautious.

replies(1): >>45153387 #

1. rubatuga ◴[06 Sep 25 22:16 UTC] No.45153387[source]▶

>>45153156 #

But we're getting into the limits of knowledge and what is true/untrue. A stochastic model will be wrong sometimes.

replies(1): >>45153583 #

2. didibus ◴[06 Sep 25 22:42 UTC] No.45153583[source]▶

>>45153387 (TP) #

Off course, 100% prediction accuracy cannot be achieved.

I just mean that, if you're an ML scientist team, you don't just go, we got 76% accuracy, let's close shop, mail in your resignation, job over.

From that angle, it's not odd at all that the team just continues working and now see if they can achieve greater than 76%.

↑