Why language models hallucinate

(openai.com)

277 points simianwords | 2 comments | 06 Sep 25 07:41 UTC | HN request time: 0.001s | source

Show context

amelius ◴[06 Sep 25 13:39 UTC] No.45149170[source]▶

They hallucinate because it's an ill-defined problem with two conflicting usecases:

1. If I tell it the first two lines of a story, I want the LLM to complete the story. This requires hallucination, because it has to make up things. The story has to be original.

2. If I ask it a question, I want it to reply with facts. It should not make up stuff.

LMs were originally designed for (1) because researchers thought that (2) was out of reach. But it turned out that, without any fundamental changes, LMs could do a little bit of (2) and since that discovery things have improved but not to the point that hallucination disappeared or was under control.

replies(10): >>45149354 #>>45149390 #>>45149708 #>>45149889 #>>45149897 #>>45152136 #>>45152227 #>>45152405 #>>45152996 #>>45156457 #

wavemode ◴[06 Sep 25 14:04 UTC] No.45149354[source]▶

>>45149170 #

Indeed - as Rebecca Parsons puts it, all an LLM knows how to do is hallucinate. Users just tend to find some of these hallucinations useful, and some not.

replies(5): >>45149571 #>>45149593 #>>45149888 #>>45149966 #>>45152431 #

fumeux_fume ◴[06 Sep 25 14:31 UTC] No.45149593[source]▶

>>45149354 #

In the article, OpenAI defines hallucinations as "plausible but false statements generated by language models." So clearly it's not all that LLMs know how to do. I don't think Parsons is working from a useful or widely agreed upon definition of what a hallucination is which leads to these "hot takes" that just clutter and muddy up the conversation around how to reduce hallucinations to produce more useful models.

replies(2): >>45149738 #>>45149740 #

mcphage ◴[06 Sep 25 14:46 UTC] No.45149738[source]▶

>>45149593 #

LLMs don’t know the difference between true and false, or that there even is a difference between true and false, so I think it’s OpenAI whose definition is not useful. As for widely agreed upon, well, I’m assuming the purpose of this post is to try and reframe the discussion.

replies(1): >>45149873 #

hodgehog11 ◴[06 Sep 25 15:02 UTC] No.45149873[source]▶

>>45149738 #

If an LLM outputs a statement, that is by definition either true or false, then we can know whether it is true or false. Whether the LLM "knows" is irrelevant. The OpenAI definition is useful because it implies hallucination is something that can be logically avoided.

> I’m assuming the purpose of this post is to try and reframe the discussion

It's to establish a meaningful and practical definition of "hallucinate" to actually make some progress. If everything is a hallucination as the other comments seem to suggest, then the term is a tautology and is of no use to us.

replies(2): >>45152719 #>>45154382 #

1. username223 ◴[07 Sep 25 01:10 UTC] No.45154382{3}[source]▶

>>45149873 #

"Logically avoided?"

OpenAI has a machine that emits plausible text. They're trying to argue that "emitting plausible text" is the hard problem, and "modeling the natural world, human consciousness, society, etc." is the easy one.

replies(1): >>45156039 #

2. hodgehog11 ◴[07 Sep 25 07:06 UTC] No.45156039[source]▶

>>45154382 (TP) #

Hmm, I don't see where they have suggested this, could you point to where this is? If they do argue for this, then I would also disagree with them.

Modelling those things is a separate problem to emitting plausible text and pursuing one is not necessarily beneficial to the other. It seems more sensible to pursue separate models for each of these tasks.

↑