Can LLMs accurately recall the Bible?

(benkaiser.dev)

223 points benkaiser | 1 comments | 29 Dec 24 03:44 UTC | HN request time: 0.295s | source

Show context

jsenn ◴[30 Dec 24 00:35 UTC] No.42545081[source]▶

Has there been any serious study of exactly how LLMs store and retrieve memorized sequences? There are so many interesting basic questions here.

Does verbatim completion of a bible passage look different from generation of a novel sequence in interesting ways? How many sequences of this length do they memorize? Do the memorized ones roughly correspond to things humans would find important enough to memorize, or do LLMs memorize just as much SEO garbage as they do bible passages?

replies(2): >>42546473 #>>42547984 #

suprjami ◴[30 Dec 24 09:57 UTC] No.42547984[source]▶

>>42545081 #

LLMs do not store and retrieve sequences. LLMs are not databases. LLMs are not predictable state machines. Understand how these things work.

They take the input context and generate the next token, then feed that whole thing back in as context and predict the next token, and repeat until the most likely next token is their stop word.

If they produce anything like a retrieved sequence, that's because they just happened to pick that set of tokens based on their training data. Regenerating the output from exactly the same input has a non-zero chance of generating different output.

replies(2): >>42548033 #>>42549432 #

1. jsenn ◴[30 Dec 24 14:17 UTC] No.42549432[source]▶

>>42547984 #

Sure, and human brains aren’t databases either, but it’s sometimes reasonable to say that we “store” and “retrieve” knowledge. All models are wrong but some are useful.

The question I’m asking is, how is this working in an LLM? How exactly do their weights encode (seemingly) the entire bible such that they can recreate long passages verbatim from a prompt that likely doesn’t appear anywhere in the training data (e.g. some vague description of a particular passage).

↑