(benkaiser.dev)

223 points benkaiser | 2 comments | 29 Dec 24 03:44 UTC | HN request time: 0.413s | source

Show context

ChuckMcM ◴[30 Dec 24 00:24 UTC] No.42544992[source]▶

Interesting that it takes an LLM with 405 BILLION parameters to accurately recall text from a document with slightly less than 728 THOUSAND words. (not quite three decimal orders of magnitude smaller but still).

replies(3): >>42545195 #>>42545228 #>>42545577 #

1. Sabinus ◴[30 Dec 24 00:51 UTC] No.42545195[source]▶

>>42544992 #

I don't think it's necessarily about the parameter count, but the amount of training material about the Bible relative to the rest of the training material, with higher parameter models able to retain more Bible information with a higher proportion of training on other topics.

replies(1): >>42545225 #

2. ChuckMcM ◴[30 Dec 24 00:56 UTC] No.42545225[source]▶

>>42545195 (TP) #

I would be interested to hear your thoughts on what a parameter in an LLM model represents.

↑

Can LLMs accurately recall the Bible?