A non-anthropomorphized view of LLMs

(addxorrol.blogspot.com)

475 points zdw | 3 comments | 06 Jul 25 22:26 UTC | HN request time: 0.028s | source

Show context

Al-Khwarizmi ◴[07 Jul 25 07:19 UTC] No.44487564[source]▶

I have the technical knowledge to know how LLMs work, but I still find it pointless to not anthropomorphize, at least to an extent.

The language of "generator that stochastically produces the next word" is just not very useful when you're talking about, e.g., an LLM that is answering complex world modeling questions or generating a creative story. It's at the wrong level of abstraction, just as if you were discussing an UI events API and you were talking about zeros and ones, or voltages in transistors. Technically fine but totally useless to reach any conclusion about the high-level system.

We need a higher abstraction level to talk about higher level phenomena in LLMs as well, and the problem is that we have no idea what happens internally at those higher abstraction levels. So, considering that LLMs somehow imitate humans (at least in terms of output), anthropomorphization is the best abstraction we have, hence people naturally resort to it when discussing what LLMs can do.

replies(18): >>44487608 #>>44488300 #>>44488365 #>>44488371 #>>44488604 #>>44489139 #>>44489395 #>>44489588 #>>44490039 #>>44491378 #>>44491959 #>>44492492 #>>44493555 #>>44493572 #>>44494027 #>>44494120 #>>44497425 #>>44500290 #

amdivia ◴[07 Jul 25 18:57 UTC] No.44493555[source]▶

>>44487564 #

I beg to differ.

Anthropomorphizing might blind us to solutions to existing problems. Perhaps instead of trying to come up with the correct prompt for a LLM, there exists a string of words (not necessary ones that make sense) that will get the LLM to a better position to answer given questions.

When we anthropomorphize we are inherently ignore certain parts of how LLMs work, and imagining parts that don't even exist

replies(1): >>44493581 #

1. meroes ◴[07 Jul 25 19:00 UTC] No.44493581[source]▶

>>44493555 #

> there exists a string of words (not necessary ones that make sense) that will get the LLM to a better position to answer

exactly. The opposite is also true. You might supply more clarifying information to the LLM, which would help any human answer, but it actually degrades the LLM's output.

replies(1): >>44493895 #

2. mvieira38 ◴[07 Jul 25 19:38 UTC] No.44493895[source]▶

>>44493581 (TP) #

This is frequently the case IME, especially with chat interfaces. One or two bad messages and you derail the quality

replies(1): >>44494533 #

3. lawlessone ◴[07 Jul 25 20:48 UTC] No.44494533[source]▶

>>44493895 #

You can just throw in words to bias it towards certain outcomes too. Same applies with image generators or course.

↑