Most active commenters
  • godelski(7)
  • Tijdreiziger(5)
  • scarface_74(4)
  • fenomas(3)
  • donkeybeer(3)

←back to thread

A non-anthropomorphized view of LLMs

(addxorrol.blogspot.com)
475 points zdw | 33 comments | | HN request time: 2.27s | source | bottom
Show context
Al-Khwarizmi ◴[] No.44487564[source]
I have the technical knowledge to know how LLMs work, but I still find it pointless to not anthropomorphize, at least to an extent.

The language of "generator that stochastically produces the next word" is just not very useful when you're talking about, e.g., an LLM that is answering complex world modeling questions or generating a creative story. It's at the wrong level of abstraction, just as if you were discussing an UI events API and you were talking about zeros and ones, or voltages in transistors. Technically fine but totally useless to reach any conclusion about the high-level system.

We need a higher abstraction level to talk about higher level phenomena in LLMs as well, and the problem is that we have no idea what happens internally at those higher abstraction levels. So, considering that LLMs somehow imitate humans (at least in terms of output), anthropomorphization is the best abstraction we have, hence people naturally resort to it when discussing what LLMs can do.

replies(18): >>44487608 #>>44488300 #>>44488365 #>>44488371 #>>44488604 #>>44489139 #>>44489395 #>>44489588 #>>44490039 #>>44491378 #>>44491959 #>>44492492 #>>44493555 #>>44493572 #>>44494027 #>>44494120 #>>44497425 #>>44500290 #
grey-area ◴[] No.44487608[source]
On the contrary, anthropomorphism IMO is the main problem with narratives around LLMs - people are genuinely talking about them thinking and reasoning when they are doing nothing of that sort (actively encouraged by the companies selling them) and it is completely distorting discussions on their use and perceptions of their utility.
replies(13): >>44487706 #>>44487747 #>>44488024 #>>44488109 #>>44489358 #>>44490100 #>>44491745 #>>44493260 #>>44494551 #>>44494981 #>>44494983 #>>44495236 #>>44496260 #
1. fenomas ◴[] No.44488109[source]
When I see these debates it's always the other way around - one person speaks colloquially about an LLM's behavior, and then somebody else jumps on them for supposedly believing the model is conscious, just because the speaker said "the model thinks.." or "the model knows.." or whatever.

To be honest the impression I've gotten is that some people are just very interested in talking about not anthropomorphizing AI, and less interested in talking about AI behaviors, so they see conversations about the latter as a chance to talk about the former.

replies(4): >>44488326 #>>44489402 #>>44489673 #>>44492369 #
2. latexr ◴[] No.44488326[source]
Respectfully, that is a reflection of the places you hang out in (like HN) and not the reality of the population.

Outside the technical world it gets much worse. There are people who killed themselves because of LLMs, people who are in love with them, people who genuinely believe they have “awakened” their own private ChatGPT instance into AGI and are eschewing the real humans in their lives.

replies(2): >>44488412 #>>44489321 #
3. fenomas ◴[] No.44488412[source]
Naturally I'm aware of those things, but I don't think TFA or GGP were commenting on them so I wasn't either.
4. Xss3 ◴[] No.44489321[source]
The other day a good friend of mine with mental health issues remarked that "his" chatgpt understands him better than most of his friends and gives him better advice than his therapist.

It's going to take a lot to get him out of that mindset and frankly I'm dreading trying to compare and contrast imperfect human behaviour and friendships with a sycophantic AI.

replies(2): >>44493792 #>>44495382 #
5. positron26 ◴[] No.44489402[source]
Most certainly the conversation is extremely political. There are not simply different points of view. There are competitive, gladiatorial opinions ready to ambush anyone not wearing the right colors. It's a situation where the technical conversation is drowning.

I suppose this war will be fought until people are out of energy, and if reason has no place, it is reasonable to let others tire themselves out reiterating statements that are not designed to bring anyone closer to the truth.

replies(1): >>44493823 #
6. scarface_74 ◴[] No.44489673[source]
Wait until a conversation about “serverless” comes up and someone says there is no such thing because there are servers somewhere as if everyone - especially on HN -doesn’t already know that.
replies(1): >>44491614 #
7. Tijdreiziger ◴[] No.44491614[source]
Why would everyone know that? Not everyone has experience in sysops, especially not beginners.

E.g. when I first started learning webdev, I didn’t think about ‘servers’. I just knew that if I uploaded my HTML/PHP files to my shared web host, then they appeared online.

It was only much later that I realized that shared webhosting is ‘just’ an abstraction over Linux/Apache (after all, I first had to learn about those topics).

replies(2): >>44492437 #>>44495406 #
8. Wowfunhappy ◴[] No.44492369[source]
As I write this, Claude Code is currently opening and closing various media files on my computer. Sometimes it plays the file for a few seconds before closing it, sometimes it starts playback and then seeks to a different position, sometimes it fast forwards or rewinds, etc.

I asked Claude to write a E-AC3 audio component so I can play videos with E-AC3 audio in the old version of QuickTime I really like using. Claude's decoder includes the ability to write debug output to a log file, so Claude is studying how QuickTime and the component interact, and it's controlling QuickTime via Applescript.

Sometimes QuickTime crashes, because this ancient API has its roots in the classic Mac OS days and is not exactly good. Claude reads the crash logs on its own—it knows where they are—and continues on its way. I'm just sitting back and trying to do other things while Claude works, although it's a little distracting that something else is using my computer at the same time.

I really don't want to anthropomorphize these programs, but it's just so hard when it's acting so much like a person...

replies(1): >>44495063 #
9. scarface_74 ◴[] No.44492437{3}[source]
I am saying that most people who come on HN and say “there is no such thing as serverless and there are servers somewhere” think they are sounding smart when they are adding nothing to the conversation.

I’m sure you knew that your code was running on computers somewhere even when you first started and wasn’t running in a literal “cloud”.

It’s about as tiring as people on HN who know just a little about LLMs thinking they are sounding smart when they say they are just advanced autocomplete. Both responses are just as unproductive

replies(1): >>44493048 #
10. Tijdreiziger ◴[] No.44493048{4}[source]
> I’m sure you knew that your code was running on computers somewhere even when you first started and wasn’t running in a literal “cloud”.

Meh, I just knew that the browser would display HTML if I wrote it, and that uploading the HTML files made them available on my domain. I didn’t really think about where the files went, specifically.

Try asking an average high school kid how cloud storage works. I doubt you’ll get any further than ‘I make files on my Google Docs and then they are saved there’. This is one step short of ‘well, the files must be on some system in some data center’.

I really disagree that “people who come on HN and say “there is no such thing as serverless and there are servers somewhere” think they are sounding smart when they are adding nothing to the conversation.” On the contrary, it’s an invitation to beginning coders to think about what the ‘serverless’ abstraction actually means.

replies(2): >>44496405 #>>44496463 #
11. bonoboTP ◴[] No.44493792{3}[source]
It's surprisingly common on reddit that people talk about "my chatgpt", and they don't always seem like the type who are "in a relationship" with the bot or unlocking the secrets of the cosmos with it, but still they write "my chatgpt" and "your chatgpt". I guess the custom prompt and the available context does customize the model for them in some sense, but I suspect they likely have a wrong mental model of how this customization works. I guess they imagine it as their own little model being stored on file at OpenAI and as they interact with it, it's being shaped by it, and each time they connect, their model is retrieved from the cloud storage and they connect to it or something.
12. bonoboTP ◴[] No.44493823[source]
If this tech is going to be half as impactful as its proponents predict, then I'd say it's still under-politicized. Of course the politics around it doesn't have to be knee-jerk mudslinging, but it's no surprise that politics enters the picture when the tech can significantly transform society.
replies(1): >>44496330 #
13. godelski ◴[] No.44495063[source]
Would it help you to know that trial and error is a common tactic by machines? Yes, humans do it too, but that doesn't mean the process isn't mechanical. In fact, in computing we might call this a "brute force" approach. You don't have to cover the entire search space to brute force something, and it certainly doesn't mean you can't have optimization strategies and need to grid search (e.g. you can use Bayesian methods, multi-armed bandit approaches, or a whole world of things).

I would call "fuck around and find out" a rather simple approach. It is why we use it! It is why lots of animals use it. Even very dumb animals use it. Though, we do notice more intelligent animals use more efficient optimization methods. All of this is technically hypothesis testing. Even a naive grid search. But that is still in the class of "fuck around and find out" or "brute force", right?

I should also mention two important things.

1) as a human we are biased to anthropomorphize. We see faces in clouds. We tell stories of mighty beings controlling the world in an effort to explain why things happen. This is anthropomorphization of the universe itself!

2) We design LLMs (and many other large ML systems) to optimize towards human preference. This reinforces an anthropomorphized interpretation.

The reason for doing this (2) is based on a naive assumption[0]: If it looks like a duck, swims like a duck, and quacks like a duck, then it *probably* is a duck. But the duck test doesn't rule out a highly sophisticated animatronic. It's a good rule of thumb, but wouldn't it also be incredibly naive to assume that it *is* a duck? Isn't the duck test itself entirely dependent on our own personal familiarity with ducks? I think this is important to remember and can help combat our own propensity for creating biases.

[0] It is not a bad strategy to build in that direction. When faced with many possible ways to go, this is a very reasonable approach. The naive part is if you assume that it will take you all the way to making a duck. It is also a perilous approach because you are explicitly making it harder for you to evaluate. It is, in the fullest sense of the phrase, "metric hacking."

replies(1): >>44495413 #
14. lelanthran ◴[] No.44495382{3}[source]
> The other day a good friend of mine with mental health issues remarked that "his" chatgpt understands him better than most of his friends and gives him better advice than his therapist.

The therapist thing might be correct, though. You can send a well-adjusted person to three renowned therapists and get three different reasons for why they need to continue sessions.

No therapist ever says "Congratulations, you're perfectly normal. Now go away and come back when you have a real problem." Statistically it is vanishingly unlikely that every person who ever visited a therapist is in need of a second (more more) visit.

The main problem with therapy is a lack of objectivity[1]. When people talk about what their sessions resulted in, it's always "My problem is that I'm too perfect". I've known actual bullies whose therapist apparently told them that they are too submissive and need to be more assertive.

The secondary problem is that all diagnosis is based on self-reported metrics of the subject. All improvement is equally based on self-reported metrics. This is no different from prayer.

You don't have a medical practice there; you've got an Imam and a sophisticated but still medically-insured way to plead with thunderstorms[2]. I fail to see how an LLM (or even the Rogerian a-x doctor in Emacs) will do worse on average.

After all, if you're at a therapist and you're doing most of the talking, how would an LLM perform worse than the therapist?

----------------

[1] If I'm at a therapist, and they're asking me to do most of the talking, I would damn well feel that I am not getting my moneys worth. I'd be there primarily to learn (and practice a little) whatever tools they can teach me to handle my $PROBLEM. I don't want someone to vent at, I want to learn coping mechanisms and mitigation strategies.

[2] This is not an obscure reference.

replies(1): >>44497017 #
15. godelski ◴[] No.44495406{3}[source]
I think they fumbled with wording but I interpreted them as meaning "audience of HN" and it seems they confirmed.

We always are speaking to our audience, right? This is also what makes more general/open discussions difficult (e.g. talking on Twitter/Facebook/etc). That there are many ways to interpret anything depending on prior knowledge, cultural biases, etc. But I think it is fair that on HN we can make an assumption that people here are tech savvy and knowledgeable. We'll definitely overstep and understep at times, but shouldn't we also cultivate a culture where it is okay to ask and okay to apologize for making too much of an assumption?

I mean at the end of the day we got to make some assumptions, right? If we assume zero operating knowledge then comments are going to get pretty massive and frankly, not be good at communicating with a niche even if better at communicating with a general audience. But should HN be a place for general people? I think no. I think it should be a place for people interested in computers and programming.

replies(1): >>44501664 #
16. Wowfunhappy ◴[] No.44495413{3}[source]
It wasn't a simple brute force. When Claude was working this morning, it was pretty clearly only playing a file when it actually needed to see packets get decoded, otherwise it would simply open and close the document. Similarly, it would only seek or fast forward when it was debugging specific issues related to those actions. And it even "knew" which test files to open for specific channel layouts.

Yes this is still mechanical in a sense, but then I'm not sure what behavior you wouldn't classify as mechanical. It's "responding" to stimuli in logical ways.

But I also don't quite know where I'm going with this. I don't think LLMs are sentient or something, I know they're just math. But it's spooky.

replies(1): >>44495544 #
17. godelski ◴[] No.44495544{4}[source]

  > It wasn't a simple brute force.
I think you misunderstood me.

"Simple" is the key word here, right? You agree that it is still under the broad class of "brute force"?

I'm not saying Claude is naively brute forcing. In fact, with lack of interpretibility of these machines it is difficult to say what kind of optimization it is doing and how complex that it (this was a key part tbh).

My point was to help with this

  > I really don't want to anthropomorphize these programs, but it's just so hard when it's acting so much like a person...
Which requires you to understand how some actions can be mechanical. You admitted to cognitive dissonance (something we all do and I fully agree is hard not to do) and wanting to fight it. We're just trying to find some helpful avenues to do so.

  > It's "responding" to stimuli in logical ways.
And so too can a simple program, right? A program can respond to user input and there is certainly a logic path it will follow. Our non-ML program is likely going to have a deterministic path (there is still probabilistic programming...), but that doesn't mean it isn't logic, right?

But the real question here, which you have to ask yourself (constantly) is "how do I differentiate a complex program that I don't understand from a conscious entity?" I guarantee you that you don't have the answer (because no one does). But isn't that a really good reason to be careful about anthropomorphizing it?

That's the duck test.

How do you determine if it is a real duck or a highly sophisticated animatronic?

If you anthropomorphize, you rule out the possibility that it is a highly sophisticated animatronic and you *MUST* make the assumption that you are not only an expert, but a perfect, duck detector. But simultaneously we cannot rule out that it is a duck, right? Because, we aren't a perfect duck detector *AND* we aren't an expert in highly sophisticated animatronics (especially of the duck kind).

Remember, there are not two answers to every True-False question, there are three. Every True-False question either has an answer of "True", "False", or "Indeterminate". So don't naively assume it is binary. We all know the Halting Problem, right? (also see my namesake or quantum physics if you want to see such things pop up outside computing)

Though I agree, it can be very spooky. But that only increases the importance of trying to develop mental models that help us more objectively evaluate things. And that requires "indeterminate" be a possibility. This is probably the best place to start to combat the cognitive dissonance.

replies(1): >>44497330 #
18. positron26 ◴[] No.44496330{3}[source]
Go politicize it on Reddit, preferably on a political sub and not a tech sub. On this forum, I would like to expect a lot more intelligent conversation.
19. scarface_74 ◴[] No.44496405{5}[source]
If we can’t count on people on Hacker News to know that code runs on computers, what is this forum for?
replies(1): >>44501642 #
20. fenomas ◴[] No.44496463{5}[source]
> an invitation to beginning coders to think about

If that's how it's phrased, and it's in a spot where that's on-topic, then obviously nobody would mind.

This subthread is talking about cases where there's a technical conversation going on and somebody derails it to argue about terminology.

21. medvezhenok ◴[] No.44497017{4}[source]
Yup, this problem is why I think all therapists should ideally know behavioral genetics and evolutionary psychology (there is at least a plausibly objective measure there which is dissonance between the ancestral environment in which the brain developed and the modern day environment. And at least some amount of psychological problems can be explained by it).

I am a fan of the « Beat Your Genes » podcast, and while some of the prescriptions can be a bit heavy handed, most feel intuitively right. It’s approaching human problems as intelligent mammal problems, as opposed to something in a category of its own.

22. donkeybeer ◴[] No.44497330{5}[source]
I have no idea why some people take so much offense to rhe fact humans are just another machine, there's no reason why another machine can't surpass it here as in all other aveneus machines have already. Many of the reasons people give for llms not being conscious are just as applicable to humans too.
replies(2): >>44497561 #>>44498411 #
23. grey-area ◴[] No.44497561{6}[source]
Absolutely possible (I’d say even likely) for humans to be surpassed by machines who have better recall and storage already.

I’m highly skeptical this will happen with llms though, their output is superficially convincing but without depth and creativity.

24. godelski ◴[] No.44498411{6}[source]
I don't think the question is if humans are a machine or not but rather what is meant by machine. Most people interpret it as meaning deterministic and thus having no free will. That's probably not what you're trying to convey so might not be the best word to use.

But the question is what is special about the human machine? What is special about the animal machine? These are different from all the machines we have built. Is it complexity? Is it indeterministic? Is it more? Certainly these machines have feelings, and we need to account for them when interacting with them.

Though we're getting well off topic from determining if a duck is a duck or is a machine (you know what I mean by this word and that I don't mean a normal duck)

replies(1): >>44508550 #
25. Tijdreiziger ◴[] No.44501642{6}[source]
Is Hacker News only for experienced coders who are already familiar with the inner workings of the ‘cloud’?

I’d like to think that this forum is also a place for the proverbial high school kid, who’s just learned JavaScript and deployed their first site to Vercel using their school Chromebook, to learn a thing or two from the greybeards.

replies(1): >>44504107 #
26. Tijdreiziger ◴[] No.44501664{4}[source]
See my reply here: https://news.ycombinator.com/item?id=44501642
replies(1): >>44502400 #
27. godelski ◴[] No.44502400{5}[source]
I did. I referenced it too
replies(1): >>44502728 #
28. Tijdreiziger ◴[] No.44502728{6}[source]
That’s impossible, because your reply was posted 19 hours ago, while the one I linked was posted 1 hour ago :)
replies(1): >>44505029 #
29. scarface_74 ◴[] No.44504107{7}[source]
No. But it isn’t too much to expect people to know that code runs on computers and that the “cloud” is made up of computers running in a server room somewhere
30. godelski ◴[] No.44505029{7}[source]
I'm a time traveler! Jeremy Bearimy baby

Joking aside, I read too quickly. I got my wires crossed when I responded, mixing up who was who. My bad >.<

But I do agree with scarface and disagree with you. Let me try to respond to this directly. There's a lot to unpack here, but I do ask that you actually read the whole thing. There is nuance here and I think it is important.

  > I’d like to think that this forum is also a place for the proverbial high school kid ... to learn a thing or two from the greybeards.
I agree that HN is *also* this place.

But I still do not believe that means we need to assume non-expertise.

Think of HN as a place that "greybeards" (or more accurately, experts. Because only few have gray beards) hang out, but there is no gatekeeping. We don't check your credentials when you come in nor do we ask you to pass any tests of skill. It's open to all. But this is still the place "experts" hang out. Because we don't check for credentials, we'll treat noobs as peers. Why are you saying this is bad?

Anyone is welcome to sit at the "adult table", but that means having adult conversations. Right? It'd be pretty... childish... for a child to sit at the adult table and expect everyone to start talking about kid stuff.

It's okay if newbies come in and don't understand what is being discussed. In fact, being confused is the very first step to learning! I'll put it this way: the first year (maybe 2) of my PhD I was just reading papers and had no idea what was going on. I had to work and work to understand. Had to ask lots of questions to lots of people (consequently getting over the fear of feeling dumb as well as the fear of asking questions). Then, at some point in time I realized I do know what's going on and being discussed. This is a critical skill to becoming a graybeard. You'll constantly have to wade through waters where you're in well over your head.

It is learning through immersion.

We should help noobs. I frequently say "you can't have wizards without noobs." I don't want to gatekeep and I do actually think we should help the noobs. I need this to be clear[1]

BUT that doesn't mean we should change our conversations between ourselves. To do so would destroy the very reason we come here. There are so few places on the internet where you can talk and operate under the assumption that the other person is reasonable well informed about tech. Frankly, many of those places get destroyed because they get dominated by noobs who change the average level of conversation. While we don't want to kick out noobs, it is *THEIR PREROGATIVE* to ask for help and ask for people to elaborate. There's no shame in this. It's the exact same thing we expect from another expert! It is treating noobs equally. And frankly, if people do make fun of the noobs or treat them disrespectfully I'll gladly downvote, flag them, and likely chastise them. Such a response is rather common around here too (which is what makes it welcoming to noobs).

Ultimately, unless we start credential checking (aka gatekeeping) we have 2 options:

  - Treat everyone as experts
  - Treat everyone as noobs
If we have to modify our language and explain every subtle nuanced detail, well... why would I come here? I'm already a fairly verbose person, and I don't want to write textbooks. I don't expect people to read textbooks either!

I don't come to HN to teach. Nor do I want to come here to be lectured. I would find it insulting if the presumption was that I was a noob.

I come to talk with my peers. Some are direct peers, with expertise in my domain, and some are not. I happily ask questions to those with expertise in other domains and so should noobs. But unless someone makes a pretty egregious assumption (e.g. a very niche subject), then pretty much nobody is going to say something about it. That's perfectly okay. Frankly, being comfortable with not knowing and asking for someone to elaborate is one of the, if not *THE*, most important skill required to become a graybeard. You can't know everything, even about a highly specific domain. There's infinite depth and infinite breadth.

So if you don't know, just ask. It's okay. That's really the only way we can both have expert communities AND not gatekeep.

So I ask you:

Where can g̶r̶a̶y̶b̶e̶a̶r̶d̶s̶ experts go to hang out? Specifically, to hang out with other experts.

TLDR:

If you walk into a biker bar, don't chastise someone who assumes you know something about motorcycles.

[0] https://news.ycombinator.com/item?id=44492437

[1] I even taught a lot during my PhD and was a rather popular TA. The reason being that I am more than happy to help and even would extend my office hours to make sure students got their questions answered. A class is formed through a partnership, not a dictatorship.

31. donkeybeer ◴[] No.44508550{7}[source]
What is indeterminism here? I am not sure the question having or not having free will has any impact of how to make human machines. We are just as in the dark about the future, if we have free will or not. I am not certain of any physical problem in which free will or lack of it plays a role. I could be wrong. So its probably an interesting question but rather pointless.

Even with the everyday machines and programs we have, we can make it behave based on random input taken for example from physical noise. It doesn't suddenly make it a special or different type of machine.

replies(1): >>44515911 #
32. godelski ◴[] No.44515911{8}[source]
I think you missed my point. I agree, being a machine doesn't mean you don't have free will. I agree, free will is orthogonal to being (or not being) a machine.

But that's not what my comment was about.

My comment was about *what the average person interprets*.

You asked why people take offense to being called a machine, and I'm trying to explain that. But to understand this we have to understand that there isn't a singular objective way to interpret statements. We can agree that language is fuzzy, right?

So let me try to translate, again.

You say: "People are machines"

(Many) People hear: "People are mechanical automata, running pre-defined routines"

I hear you, this is not what you are trying to communicate. That's not what you want them to hear. But if you want them to hear what you actually mean it is very helpful to understand that some people will hear something different.

Why do they hear the other thing? Because they don't have intimate familiarity with machines and how general that word is. *You have a better understanding of what a machine is than most people.* That's likely the cause for miscommunication.

When they think of a machine they think of things like a car, a computer, a blender, a TV, an oven, or a multitude of other similar things. Even if some of these use probabilistic programming, the average person is not going to know what probabilistic programming even is. They just see something mechanical. Deterministic.

I'm sure you know this, but it is worth reiterating. Communication has 3 main components: What you intend to communicate, the words/gestures/etc you use to communicate, and what the other person hears. Unfortunately (fortunately?) we can't communicate telepathically, so don't forget that the person you're talking to can have a reasonable interpretation that is significantly different from what you intended to say.

replies(1): >>44516027 #
33. donkeybeer ◴[] No.44516027{9}[source]
Oh of course. I feel I should have been clearer that I meant among technical persons, not average randoms.

When talking about people who are not mathematicians or computer scientists, on average, yes absolutely they hear something like that when told humans are machines.