A non-anthropomorphized view of LLMs

(addxorrol.blogspot.com)

477 points zdw | 1 comments | 06 Jul 25 22:26 UTC | HN request time: 0.298s | source

Show context

Al-Khwarizmi ◴[07 Jul 25 07:19 UTC] No.44487564[source]▶

I have the technical knowledge to know how LLMs work, but I still find it pointless to not anthropomorphize, at least to an extent.

The language of "generator that stochastically produces the next word" is just not very useful when you're talking about, e.g., an LLM that is answering complex world modeling questions or generating a creative story. It's at the wrong level of abstraction, just as if you were discussing an UI events API and you were talking about zeros and ones, or voltages in transistors. Technically fine but totally useless to reach any conclusion about the high-level system.

We need a higher abstraction level to talk about higher level phenomena in LLMs as well, and the problem is that we have no idea what happens internally at those higher abstraction levels. So, considering that LLMs somehow imitate humans (at least in terms of output), anthropomorphization is the best abstraction we have, hence people naturally resort to it when discussing what LLMs can do.

replies(18): >>44487608 #>>44488300 #>>44488365 #>>44488371 #>>44488604 #>>44489139 #>>44489395 #>>44489588 #>>44490039 #>>44491378 #>>44491959 #>>44492492 #>>44493555 #>>44493572 #>>44494027 #>>44494120 #>>44497425 #>>44500290 #

grey-area ◴[07 Jul 25 07:28 UTC] No.44487608[source]▶

>>44487564 #

On the contrary, anthropomorphism IMO is the main problem with narratives around LLMs - people are genuinely talking about them thinking and reasoning when they are doing nothing of that sort (actively encouraged by the companies selling them) and it is completely distorting discussions on their use and perceptions of their utility.

replies(13): >>44487706 #>>44487747 #>>44488024 #>>44488109 #>>44489358 #>>44490100 #>>44491745 #>>44493260 #>>44494551 #>>44494981 #>>44494983 #>>44495236 #>>44496260 #

cmenge ◴[07 Jul 25 07:43 UTC] No.44487706[source]▶

>>44487608 #

I kinda agree with both of you. It might be a required abstraction, but it's a leaky one.

Long before LLMs, I would talk about classes / functions / modules like "it then does this, decides the epsilon is too low, chops it up and adds it to the list".

The difference I guess it was only to a technical crowd and nobody would mistake this for anything it wasn't. Everybody know that "it" didn't "decide" anything.

With AI being so mainstream and the math being much more elusive than a simple if..then I guess it's just too easy to take this simple speaking convention at face value.

EDIT: some clarifications / wording

replies(4): >>44488265 #>>44488849 #>>44489378 #>>44489702 #

flir ◴[07 Jul 25 09:11 UTC] No.44488265[source]▶

>>44487706 #

Agreeing with you, this is a "can a submarine swim" problem IMO. We need a new word for what LLMs are doing. Calling it "thinking" is stretching the word to breaking point, but "selecting the next word based on a complex statistical model" doesn't begin to capture what they're capable of.

Maybe it's cog-nition (emphasis on the cog).

replies(9): >>44488292 #>>44488690 #>>44489190 #>>44489381 #>>44489974 #>>44491127 #>>44491731 #>>44495034 #>>44497480 #

LeonardoTolstoy ◴[07 Jul 25 10:21 UTC] No.44488690[source]▶

>>44488265 #

What does a submarine do? Submarine? I suppose you "drive" a submarine which is getting to the idea: submarines don't swim because ultimately they are "driven"? I guess the issue is we don't make up a new word for what submarines do, we just don't use human words.

I think the above poster gets a little distracted by suggesting the models are creative which itself is disputed. Perhaps a better term, like above, would be to just use "model". They are models after all. We don't make up a new portmanteau for submarines. They float, or drive, or submarine around.

So maybe an LLM doesn't "write" a poem, but instead "models a poem" which maybe indeed take away a little of the sketchy magic and fake humanness they tend to be imbued with.

replies(7): >>44488901 #>>44489424 #>>44489509 #>>44490723 #>>44490885 #>>44491594 #>>44492786 #

j0057 ◴[07 Jul 25 17:40 UTC] No.44492786[source]▶

>>44488690 #

A submarine is a boat and boats sail.

replies(2): >>44493077 #>>44496482 #

TimTheTinker ◴[07 Jul 25 18:07 UTC] No.44493077[source]▶

>>44492786 #

An LLM is a stochastic generative model and stochastic generative models ... generate?

replies(1): >>44493630 #

1. LeonardoTolstoy ◴[07 Jul 25 19:06 UTC] No.44493630[source]▶

>>44493077 #

And we are there. A boat sails, and a submarine sails. A model generates makes perfect sense to me. And saying chatgpt generated a poem feels correct personally. Indeed a model (e.g. a linear regression) generates predictions for the most part.

↑