Popular/hot comments

(steerlabs.substack.com)

Show context

keiferski ◴[08 Dec 25 13:46 UTC] No.46192154[source]▶

The thing that bothers me the most about LLMs is how they never seem to understand "the flow" of an actual conversation between humans. When I ask a person something, I expect them to give me a short reply which includes another question/asks for details/clarification. A conversation is thus an ongoing "dance" where the questioner and answerer gradually arrive to the same shared meaning.

LLMs don't do this. Instead, every question is immediately responded to with extreme confidence with a paragraph or more of text. I know you can minimize this by configuring the settings on your account, but to me it just highlights how it's not operating in a way remotely similar to the human-human one I mentioned above. I constantly find myself saying, "No, I meant [concept] in this way, not that way," and then getting annoyed at the robot because it's masquerading as a human.

replies(37): >>46192230 #>>46192268 #>>46192346 #>>46192427 #>>46192525 #>>46192574 #>>46192631 #>>46192754 #>>46192800 #>>46192900 #>>46193063 #>>46193161 #>>46193374 #>>46193376 #>>46193470 #>>46193656 #>>46193908 #>>46194231 #>>46194299 #>>46194388 #>>46194411 #>>46194483 #>>46194761 #>>46195048 #>>46195085 #>>46195309 #>>46195615 #>>46195656 #>>46195759 #>>46195794 #>>46195918 #>>46195981 #>>46196365 #>>46196372 #>>46196588 #>>46197200 #>>46198030 #

ryandrake ◴[08 Dec 25 15:48 UTC] No.46193656[source]▶

>>46192154 #

LLMs all behave as if they are semi-competent (yet eager, ambitious, and career-minded) interns or administrative assistants, working for a powerful CEO-founder. All sycophancy, confidence and positive energy. "You're absolutely right!" "Here's the answer you are looking for!" "Let me do that for you immediately!" "Here is everything I know about what you just mentioned." Never admitting a mistake unless you directly point it out, and then all sorry-this and apologize-that and "here's the actual answer!" It's exactly the kind of personality you always see bubbling up into the orbit of a rich and powerful tech CEO.

No surprise that these products are all dreamt up by powerful tech CEOs who are used to all of their human interactions being with servile people-pleasers. I bet each and every one of them are subtly or overtly shaped by feedback from executives about how they should respond to conversation.

replies(12): >>46193679 #>>46193872 #>>46193884 #>>46194322 #>>46195018 #>>46195066 #>>46195075 #>>46195385 #>>46196040 #>>46196762 #>>46196779 #>>46213184 #

rzwitserloot ◴[08 Dec 25 16:02 UTC] No.46193872[source]▶

>>46193656 #

I don't think these LLMs were explicitly designed based on the CEO's detailed input that boils down to 'reproduce these servile yes-men in LLM form please'.

Which makes it more interesting. Apparently reddit was a particularly hefty source for most LLMs; your average reddit conversation is absolutely nothing like this.

Separate observation: That kind of semi-slimey obsequious behaviour annoys me. Significantly so. It raises my hackles; I get the feeling I'm being sold something on the sly. Even if I know the content in between all the sycophancy is objectively decent, my instant emotional response is negative and I have to use my rational self to dismiss that part of the ego.

But I notice plenty of people around me that respond positively to it. Some will even flat out ignore any advice if it is not couched in multiple layers of obsequious deference.

Thus, that raises a question for me: Is it innate? Are all people placed on a presumably bell-curve shaped chart of 'emotional response to such things', with the bell curve quite smeared out?

Because if so, that would explain why some folks have turned into absolute zealots for the AI thing, on both sides of it. If you respond negatively to it, any serious attempt to play with it should leave you feeling like it sucks to high heavens. And if you respond positively to it - the reverse.

Idle musings.

replies(3): >>46194004 #>>46194007 #>>46194403 #

1. jordanb ◴[08 Dec 25 16:39 UTC] No.46194403[source]▶

>>46193872 #

The servile stuff was trained into them with RLHF with the trainers largely being low-wage workers in the global south. That's also where some of the other stuff like excessive em-dash stuff came from. I think it's a combination of those workers anticipating how they would be expected to respond by a first-world employer, and also explicit instructions given to them about how the robot should be trained.

replies(1): >>46195129 #

2. OkayPhysicist ◴[08 Dec 25 17:33 UTC] No.46195129[source]▶

>>46194403 (TP) #

I suspect a lot of the em-dash usage also comes from transcriptions of verbal media. In the spoken word, people use the kinds of asides that elicit an em-dash a lot.

replies(1): >>46196219 #

3. mrguyorama ◴[08 Dec 25 18:59 UTC] No.46196219[source]▶

>>46195129 #

I would bet like a dollar that the supposed em-dash usage (which I'm not convinced is an accurate take in the first place) would have come from an enterprising dev somewhere being like "Well, we probably don't need multiple tokens for hyphens" and coercing every dash type thing to just one hyphen like token.

But I'm also showing off my ignorance with how these machines turn text into tokens in practice.

replies(3): >>46196558 #>>46196634 #>>46197572 #

4. thfuran ◴[08 Dec 25 19:31 UTC] No.46196558{3}[source]▶

>>46196219 #

If that were true, it would mean that it couldn't output hyphenated words without turning the hyphens into em dashes.

5. seanmcdirmid ◴[08 Dec 25 19:39 UTC] No.46196634{3}[source]▶

>>46196219 #

Two dashes is still a token. You would only be correct if LLMs were still thinking at the level of characters.

6. TeMPOraL ◴[08 Dec 25 20:57 UTC] No.46197572{3}[source]▶

>>46196219 #

I think all the em-dashes came from scraping Wordpress blogs. Wordpress editor does "typography", then thus introduced em-dashes survive HTML to Markdown process used to scrap them, and end up in datasets.

EDIT: Also PDFs authored in MS Word.

↑

The "confident idiot" problem: Why AI needs hard rules, not vibe checks