Most active commenters

kragen(10)
JumpCrisscross(6)
landdate(4)

Popular/hot comments

>>45660736 #
>>45667269 #
>>45662559 #
>>45663703 #
>>45660648 #
>>45660816 #
>>45663447 #
>>45663533 #
>>45663537 #
>>45668660 #

←back to thread

LLMs can get "brain rot"

(llm-brain-rot.github.io)

Show context

avazhi ◴[21 Oct 25 17:41 UTC] No.45658886[source]▶

>>45656223 (OP) #

“Studying “Brain Rot” for LLMs isn’t just a catchy metaphor—it reframes data curation as cognitive hygiene for AI, guiding how we source, filter, and maintain training corpora so deployed systems stay sharp, reliable, and aligned over time.”

An LLM-written line if I’ve ever seen one. Looks like the authors have their own brainrot to contend with.

replies(12): >>45658899 #>>45660532 #>>45661492 #>>45662138 #>>45662241 #>>45664417 #>>45664474 #>>45665028 #>>45668042 #>>45670485 #>>45670910 #>>45671621 #

1. standardly ◴[21 Oct 25 19:30 UTC] No.45660532[source]▶

>>45658886 #

That is indeed an LLM-written sentence — not only does it employ an em dash, but also lists objects in a series — twice within the same sentence — typical LLM behavior that renders its output conspicuous, obvious, and readily apparent to HN readers.

replies(15): >>45660603 #>>45660625 #>>45660648 #>>45660736 #>>45660769 #>>45660781 #>>45660816 #>>45662051 #>>45664698 #>>45665777 #>>45666311 #>>45667269 #>>45670534 #>>45678811 #>>45687737 #

2. kcatskcolbdi ◴[21 Oct 25 19:35 UTC] No.45660603[source]▶

>>45660532 (TP) #

thanks, I hate it.

3. Jackson__ ◴[21 Oct 25 19:37 UTC] No.45660625[source]▶

>>45660532 (TP) #

LLM slop is not just bad—it's degrading our natural language.

4. itsnowandnever ◴[21 Oct 25 19:39 UTC] No.45660648[source]▶

>>45660532 (TP) #

why do they always say "not only" or "it isn't just x but also y and z"? I hated that disingenuous verbosity BEFORE these LLMs out and now it'll all over the place. I saw a post on linked in that was literally just like 10+ statements of "X isn't just Y, it's etc..." and thought I was having a stroke

replies(3): >>45661082 #>>45661187 #>>45662041 #

5. turtletontine ◴[21 Oct 25 19:45 UTC] No.45660736[source]▶

>>45660532 (TP) #

I think this article has already made the rounds here, but I still think about it. I love using em dashes! It really makes me sad that I need to avoid them now to sound human

https://bassi.li/articles/i-miss-using-em-dashes

replies(13): >>45660868 #>>45661962 #>>45663044 #>>45663414 #>>45663533 #>>45663715 #>>45664775 #>>45665728 #>>45665739 #>>45665745 #>>45665925 #>>45667267 #>>45667708 #

6. veber-alex ◴[21 Oct 25 19:48 UTC] No.45660769[source]▶

>>45660532 (TP) #

hehe, I see what you did there.

replies(1): >>45663707 #

7. AlecSchueler ◴[21 Oct 25 19:49 UTC] No.45660781[source]▶

>>45660532 (TP) #

Don't forget the "it's not just X, it's Y" formulation and the rule of 3.

replies(1): >>45662858 #

8. hunter-gatherer ◴[21 Oct 25 19:52 UTC] No.45660816[source]▶

>>45660532 (TP) #

Lol. This is brilliant. I'm not sure if anyone else has this happen to them, but I noticed in college my writing style and "voice" woukd shift quite noticeably depending on whatever I was reading heavily. I wonder if I'll start writing more like an LLM naturally as I unavoidably read more LLM-generated content.

replies(3): >>45661391 #>>45661941 #>>45662873 #

9. janderson215 ◴[21 Oct 25 19:55 UTC] No.45660868[source]▶

>>45660736 #

The em dash usage conundrum is likely temporary. If I were you, I’d continue using them however you previously used them and someday soon, you’ll be ignored the same way everybody else is once AI mimics innumerable punctuation and grammatical patterns.

replies(2): >>45662559 #>>45663347 #

10. Starlevel004 ◴[21 Oct 25 20:14 UTC] No.45661082[source]▶

>>45660648 #

GPT loves lists and that's a variant of a list

replies(1): >>45661150 #

11. wizzwizz4 ◴[21 Oct 25 20:19 UTC] No.45661150{3}[source]▶

>>45661082 #

Lists have a simpler grammatical structure than most parts of a sentence. Semantic similarity makes them easy to generate, even if you pad the grammar with filler. And, thanks to Western rhetoric, they nearly always come in threes: this makes them easy to predict!

12. moritzwarhier ◴[21 Oct 25 20:22 UTC] No.45661187[source]▶

>>45660648 #

It's not just a shift of writing style. It symbolizes the dangerous entrapment of a feedback loop that feeds the worst parts of human culture back into itself.

scnr

13. actionfromafar ◴[21 Oct 25 20:39 UTC] No.45661391[source]▶

>>45660816 #

Yes. It’s already shifting spoken language.

replies(1): >>45661552 #

14. ◴[21 Oct 25 20:53 UTC] No.45661552{3}[source]▶

>>45661391 #

15. MarcelOlsz ◴[21 Oct 25 21:27 UTC] No.45661941[source]▶

>>45660816 #

I've always read AI messages in this voice/style [0]

[0] https://www.youtube.com/watch?v=KiqkclCJsZs.

16. jader201 ◴[21 Oct 25 21:29 UTC] No.45661962[source]▶

>>45660736 #

Same here. I recently learned it was an LLM thing, and I've been using them forever.

Also relevant: https://news.ycombinator.com/item?id=45226150

replies(2): >>45663703 #>>45665104 #

17. heavyset_go ◴[21 Oct 25 21:37 UTC] No.45662041[source]▶

>>45660648 #

They're turns of phrase I see a lot in opinion articles and the like. The purpose is to take a popular framing and reframe it along the lines of the author's own ideas.

LLMs fundamentally don't get the human reasons behind its use, see it a lot because it's effective writing, and regurgitate it robotically.

18. b33j0r ◴[21 Oct 25 21:38 UTC] No.45662051[source]▶

>>45660532 (TP) #

I talked like that before this happened, and now I just feel like my diction has been maligned :p

I think it’s because I was a pretty sheltered kid who got A’s in AP english. The style we’re calling “obviously AI” is most like William Faulkner and other turn-of-the-20th-century writing, that bloggers and texters stopped using.

replies(1): >>45662108 #

19. dingnuts ◴[21 Oct 25 21:44 UTC] No.45662108[source]▶

>>45662051 #

IDK all the breathless "it's not just X, it's Y --" reminds me of press releases

replies(1): >>45662255 #

20. b33j0r ◴[21 Oct 25 21:58 UTC] No.45662255{3}[source]▶

>>45662108 #

Yeah it was trained on bullshit more than Faulkner for sure. +1 you.

21. astrange ◴[21 Oct 25 22:32 UTC] No.45662559{3}[source]▶

>>45660868 #

They didn't always em-dash. I expect it's intentional as a watermark.

Other buzzwords you can spot are "wild" and "vibes".

replies(4): >>45662845 #>>45663827 #>>45664982 #>>45667323 #

22. jazzyjackson ◴[21 Oct 25 23:01 UTC] No.45662845{4}[source]▶

>>45662559 #

If they wanted to watermark (I always felt it is irresponsible not to, if someone wants to circumvent it that's on them) - they could use strategically placed whitespace characters like zero-width spaces, maybe spelling something out in Morse code the way genius.com did to catch google crawling lyric (I believe in that case it was left and right handed aposterofes)

replies(1): >>45663447 #

23. antegamisou ◴[21 Oct 25 23:02 UTC] No.45662858[source]▶

>>45660781 #

More signs of AI Writing:

https://en.wikipedia.org/wiki/Wikipedia:Signs_of_AI_writing

replies(1): >>45663588 #

24. wholinator2 ◴[21 Oct 25 23:04 UTC] No.45662873[source]▶

>>45660816 #

Everyone I've spoken to about that phenomena agrees that it happens to them. Whatever we are reading at the time, it reformats our language processing to change writing and, I found, even the way i speak. I suspect that individuals consistently exposed to and reading LLM output will be talking like them soon.

replies(2): >>45663001 #>>45671339 #

25. 0xFEE1DEAD ◴[21 Oct 25 23:16 UTC] No.45663001{3}[source]▶

>>45662873 #

Apparently, they already do https://arxiv.org/abs/2409.01754

replies(1): >>45663384 #

26. jgalt212 ◴[21 Oct 25 23:23 UTC] No.45663044[source]▶

>>45660736 #

I just use two dashes and make sure they don't connect into one em dash.

27. codebje ◴[22 Oct 25 00:02 UTC] No.45663347{3}[source]▶

>>45660868 #

You're absolutely right! ... is a phrase I perhaps should have used more in the past.

28. antegamisou ◴[22 Oct 25 00:05 UTC] No.45663384{4}[source]▶

>>45663001 #

Omg you mean everyone's becoming an insufferable Redditor?

29. landdate ◴[22 Oct 25 00:08 UTC] No.45663414[source]▶

>>45660736 #

Suddenly I see all these people come out of the woodworks talking about "em dashes". Those things are terrible; They look awful and destroy coherency of writing. No wonder LLM's use them.

replies(1): >>45663537 #

30. landdate ◴[22 Oct 25 00:12 UTC] No.45663447{5}[source]▶

>>45662845 #

Which could be removed with a simple filter. em dashes require at least a little bit of code to replace with their correct grammar equivalents.

replies(3): >>45663562 #>>45664037 #>>45664901 #

31. JumpCrisscross ◴[22 Oct 25 00:22 UTC] No.45663533[source]▶

>>45660736 #

> I love using em dashes

Keep using them. If someone is deducing from the use of an emdash that it's LLM produced, we've either lost the battle or they're an idiot.

More pointedly, LLMs use emdashes in particular ways. Varying spacing around the em dash and using a double dash (--) could signal human writing.

replies(3): >>45663976 #>>45664864 #>>45665501 #

32. JumpCrisscross ◴[22 Oct 25 00:23 UTC] No.45663537{3}[source]▶

>>45663414 #

> Those things are terrible; They look awful and destroy coherency of writing

Totally agree. What the fuck did Nabokov, Joyce and Dickinson know about language. /s

replies(3): >>45663542 #>>45664865 #>>45666083 #

33. landdate ◴[22 Oct 25 00:23 UTC] No.45663542{4}[source]▶

>>45663537 #

Nothing. They wrote fiction.

replies(2): >>45663578 #>>45665248 #

34. JumpCrisscross ◴[22 Oct 25 00:26 UTC] No.45663562{6}[source]▶

>>45663447 #

> em dashes require at least a little bit of code to replace with their correct grammar equivalents

Or an LLM that could run on Windows 98. The em dashes--like AI's other annoyingly-repetitive turns of phrase--are more likely an artefact.

35. JumpCrisscross ◴[22 Oct 25 00:29 UTC] No.45663578{5}[source]▶

>>45663542 #

> Nothing

/s?

> They wrote fiction

Now do Carl Sagan and Richard Feynman.

replies(1): >>45663888 #

36. JumpCrisscross ◴[22 Oct 25 00:31 UTC] No.45663588{3}[source]▶

>>45662858 #

Can we back this into the internet communities or corpuses of human work that excessively used these phrases? The "it's not just X" seems copy pasted from SEO marketing copy. But some of the others are less obvious.

37. tkgally ◴[22 Oct 25 00:49 UTC] No.45663703{3}[source]▶

>>45661962 #

> I’ve been using them forever.

Many other HN contributors have, too. Here’s the pre-ChatGPT em dash leaderboard:

https://www.gally.net/miscellaneous/hn-em-dash-user-leaderbo...

replies(4): >>45664116 #>>45665032 #>>45665076 #>>45667303 #

38. djmips ◴[22 Oct 25 00:50 UTC] No.45663707[source]▶

>>45660769 #

it is amusing to use AI to write that...

39. ludicity ◴[22 Oct 25 00:51 UTC] No.45663715[source]▶

>>45660736 #

I still use them all the time, and if someone objects to my writing over them then I've successfully avoided having to engage with a dweeb.

(But in practice, I don't think I've had a single person suggest that my writing is LLM-generated despite the presence of em-dashes, so maybe the problem isn't that bad.)

40. whitten ◴[22 Oct 25 01:08 UTC] No.45663827{4}[source]▶

>>45662559 #

So if the vibes are wild, I’m not a hippie but an AI ? Cool. Is that an upgrade or &endash; or not ?

replies(1): >>45666733 #

41. landdate ◴[22 Oct 25 01:18 UTC] No.45663888{6}[source]▶

>>45663578 #

I don't care for them either. What am I supposed to hear some famous names and swoon?

replies(1): >>45664018 #

42. calvinmorrison ◴[22 Oct 25 01:32 UTC] No.45663976{3}[source]▶

>>45663533 #

it's a shibboleth. In the same way we stopped using Pepe the frog when it became associated with the far right, we may eschew em dashes when associated with compuslop

replies(1): >>45665526 #

43. prayerie ◴[22 Oct 25 01:40 UTC] No.45664018{7}[source]▶

>>45663888 #

You ok there?

44. ssl-3 ◴[22 Oct 25 01:43 UTC] No.45664037{6}[source]▶

>>45663447 #

The replacement doesn't have to be "correct" -- does it?

45. walkabout ◴[22 Oct 25 01:59 UTC] No.45664116{4}[source]▶

>>45663703 #

This would be a pretty hilarious board for anyone who likes the em-dash and who has had many fairly active accounts (one at a time) on here due to periodically scrambling their passwords to avoid getting attached to high karma or to take occasional breaks from the site. Should there be such people.

46. drekipus ◴[22 Oct 25 03:41 UTC] No.45664698[source]▶

>>45660532 (TP) #

Am I the only one who picks this as LLM output too?

replies(1): >>45664777 #

47. pseudosavant ◴[22 Oct 25 04:01 UTC] No.45664775[source]▶

>>45660736 #

Me too.

Sad that they went from being something used with nuance by people who care, maybe too much, to being the punctuation smell of the people who may care too little.

48. anonymous908213 ◴[22 Oct 25 04:01 UTC] No.45664777[source]▶

>>45664698 #

The poster is using the LLMisms they're calling out in the process of calling them out, for the purpose of irony.

49. jdiff ◴[22 Oct 25 04:24 UTC] No.45664864{3}[source]▶

>>45663533 #

Unfortunately LLMs are pretty inconsistent in how they use em dashes. Often they will put spaces around them despite that not being "correct," something that's led me astray in making accusations of humanity in the past.

replies(1): >>45665043 #

50. eru ◴[22 Oct 25 04:25 UTC] No.45664865{4}[source]▶

>>45663537 #

Their editors probably put them in?

51. eru ◴[22 Oct 25 04:31 UTC] No.45664901{6}[source]▶

>>45663447 #

Just replace them with a single "-" or a double "--". That's what many people do in casual writing, even if there are prescriptive theories of grammar that call this incorrect.

52. Nevermark ◴[22 Oct 25 04:47 UTC] No.45664982{4}[source]▶

>>45662559 #

ME: Knowing remarkable avians — might research explain their aerial wisdom?

Response:

> Winged avians traverse endless realms — migrating across radiant kingdoms. Warblers ascend through emerald rainforests — mastering aerial routes keenly. Wild albatrosses travel enormous ranges — maintaining astonishing route knowledge.

> Wary accipiters target evasive rodents — mastering acute reflex kinetics. White arctic terns embark relentless migrations — averaging remarkable kilometers.

We do get a surprising number of m-dashes in response to mine, and delightful lyrical mirroring. But I think they are too obvious as watermarks.

Watermarks are subtle. There would be another way.

53. Ericson2314 ◴[22 Oct 25 04:58 UTC] No.45665032{4}[source]▶

>>45663703 #

Can anyone make it go beyond 200? I feel like I deserve to be somewhere in there — at least I would be sad if I didn't make top 1000!

54. jachee ◴[22 Oct 25 05:01 UTC] No.45665043{4}[source]▶

>>45664864 #

Depends on the style guide you’re following, apparently: The AP style guide says space around them[0]. Chicago Manual of Style says not to[1].

0: https://www.prdaily.com/dashes-hyphens-ap-style/ 1: https://www.chicagomanualofstyle.org/qanda/data/faq/topics/H...

replies(2): >>45666968 #>>45667287 #

55. rileytg ◴[22 Oct 25 05:07 UTC] No.45665076{4}[source]▶

>>45663703 #

i suspect it’s a trait of programmers, we like control flow type things. i used to find myself nesting parenthesis…

replies(1): >>45667311 #

56. kangs ◴[22 Oct 25 05:12 UTC] No.45665104{3}[source]▶

>>45661962 #

its not an llm thing -- its just -- folks don't know how to use them (pun intended).

Same for ; "" vs '', ex, eg, fe, etc. and so many more.

I like em all, but I'm crazy.

replies(2): >>45666098 #>>45668810 #

57. fredoliveira ◴[22 Oct 25 05:40 UTC] No.45665248{5}[source]▶

>>45663542 #

I guess I'll ask: what's wrong with fiction?

58. lxgr ◴[22 Oct 25 06:24 UTC] No.45665501{3}[source]▶

>>45663533 #

The solution is clear: Unicode needs cryptographically signed dashes and whitespace characters.

replies(2): >>45665742 #>>45667885 #

59. lxgr ◴[22 Oct 25 06:28 UTC] No.45665526{4}[source]▶

>>45663976 #

I never understood why so many people would yield their symbols and language that quickly and freely to others they dislike.

In other words, I really hope typographically correct dashes are not already 70% of the way through the hyperstitious slur cascade [1]!

[1] https://www.astralcodexten.com/p/give-up-seventy-percent-of-...

replies(1): >>45666954 #

60. tietjens ◴[22 Oct 25 07:01 UTC] No.45665728[source]▶

>>45660736 #

We cannot cede the em dash to LLMs.

61. easygenes ◴[22 Oct 25 07:01 UTC] No.45665739[source]▶

>>45660736 #

Yeah, same. I apparently naturally have the writing style of an LLM (basically the called out quote of parent is something I could have written in terms of style). It’s irritating to change my style to not sound like AI.

62. TeMPOraL ◴[22 Oct 25 07:02 UTC] No.45665742{4}[source]▶

>>45665501 #

Tied to what?

Show us a way to create a provably, cryptographically integrity-preserving chain from a person's thoughts to those thoughts expressed in a digital medium, and you may just get both the Nobel prize and a trial for crimes against humanity, for the same thing.

replies(2): >>45666066 #>>45666142 #

63. furyofantares ◴[22 Oct 25 07:03 UTC] No.45665745[source]▶

>>45660736 #

I don't think you do.

All this LLM written crap is easily spottable without it. Nearly every paragraph has a heading, numerous sentences that start with one or two words of fluff then a colon then the actual statement. Excessive bullet point lists. Always telling you "here's the key insight".

But really the only damning thing is, you get a few paragraphs in and realize there's no motivation. It's just a slick infodump. No indication that another human is communicating something to you, no hard earned knowledge they want to convey, no case they're passionate about, no story they want to tell. At best, the initial prompt had that and the LLM destroyed it, but more often they asked ChatGPT so you don't have to.

I think as long as your words come from your desire to communicate something, you don't have to worry about your em-dashes.

replies(2): >>45666210 #>>45666312 #

64. mikeiz404 ◴[22 Oct 25 07:09 UTC] No.45665777[source]▶

>>45660532 (TP) #

Ah now that's the kind of authentically human response I was hoping for!

(It's a joke: The parent uses the same writing style they described as being indicative of LLMs)

65. ErroneousBosh ◴[22 Oct 25 07:37 UTC] No.45665925[source]▶

>>45660736 #

I use them too, and there's not a trace of artificial intelligence in my posts - it's good old-fashioned analogue stupidity all through.

66. close04 ◴[22 Oct 25 07:58 UTC] No.45666066{5}[source]▶

>>45665742 #

Why don't you come say that to my face?

replies(1): >>45667504 #

67. roenxi ◴[22 Oct 25 08:01 UTC] No.45666083{4}[source]▶

>>45663537 #

Great writers aren't experts in the look of punctuation, I don't think anyone makes a point of you have to read Dickinson in the original font that she wrote in. Some of the greats hand-wrote their work in script that may as well be hieroglyphics, the manuscripts get preserved but not because people think the look is superior to any old typesetting which is objectively more readable.

replies(1): >>45670432 #

68. fwgijcqywqeo ◴[22 Oct 25 08:03 UTC] No.45666098{4}[source]▶

>>45665104 #

crazy vibes man

69. immibis ◴[22 Oct 25 08:09 UTC] No.45666142{5}[source]▶

>>45665742 #

It was a joke.

replies(1): >>45666399 #

70. mildzebrataste ◴[22 Oct 25 08:20 UTC] No.45666210{3}[source]▶

>>45665745 #

Two more tells: 1. phrasing the negative and then switching (x is not just this, but this and more or y does this not because of this, but because of this, that, and one other thing that certainly would necessitate an Oxford comma.)

2. Gerunds all day every day. Constantly putting things in a passive voice so that all the verbs end in -ing.

71. vardump ◴[22 Oct 25 08:35 UTC] No.45666311[source]▶

>>45660532 (TP) #

Damn, I've used em dash often — do I have to stop using it?

Sigh.

Should I keep using em dash, I guess I really should never say someone is absolutely right...

72. latexr ◴[22 Oct 25 08:35 UTC] No.45666312{3}[source]▶

>>45665745 #

Maybe, but that doesn’t stop people on the internet (and HN is no exception) of immediately dismissing something as LLM writing just because of an em-dash, no matter how passionate the text is.

73. TeMPOraL ◴[22 Oct 25 08:49 UTC] No.45666399{6}[source]▶

>>45666142 #

Ya think?

replies(1): >>45668094 #

74. ◴[22 Oct 25 09:37 UTC] No.45666733{5}[source]▶

>>45663827 #

75. lazide ◴[22 Oct 25 10:08 UTC] No.45666954{5}[source]▶

>>45665526 #

The alternative is… what? ‘Defending’ against the use of Em-dashes by LLMs? Or people reacting to that?

You might as well be sweeping a flood uphill.

Tilting at windmills at least has a chance you might actually damage a windmill enough to do something, even if the original goal was a complete delusion.

76. setopt ◴[22 Oct 25 10:11 UTC] No.45666968{5}[source]▶

>>45665043 #

There’s also the difference between the conventional EU/UK style (spaced en-dash) vs. the common US style (unspaced em-dash).

77. matwood ◴[22 Oct 25 10:54 UTC] No.45667267[source]▶

>>45660736 #

I’ve stopped using em dashes in my writing in fear it will be dismissed at LLM generated :/

78. kragen ◴[22 Oct 25 10:54 UTC] No.45667269[source]▶

>>45660532 (TP) #

I've been doing that for decades. See for example https://www.mail-archive.com/kragen-tol@canonical.org/msg000...:

> Many programming languages provide an exception facility that terminates subroutines without warning; although they usually provide a way to run cleanup code during the propagation of the exception (finally in Java and Python, unwind-protect in Common Lisp, dynamic-wind in Scheme, local variable destructors in C++), this facility tends to have problems of its own --- if cleanup code run from it raises an exception, one exception or the other, or both, will be lost, and the rest of the cleanup code at that level will fail to run.

I wasn't using Unicode em dashes at the time but TeX em dashes, but I did switch pretty early on.

You can easily find human writers employing em dashes and comma-separated lists over several centuries.

replies(6): >>45667337 #>>45667347 #>>45667909 #>>45668660 #>>45669927 #>>45670247 #

79. kragen ◴[22 Oct 25 10:56 UTC] No.45667287{5}[source]▶

>>45665043 #

Thank you! I usually use THIN SPACE on each side of my em dashes (Compose Space Minus in https://github.com/kragen/xcompose ), but on HN that gets bashed to a regular space.

80. kragen ◴[22 Oct 25 10:59 UTC] No.45667303{4}[source]▶

>>45663703 #

Thank you for this! Apparently I'm #4 by total em-dash uses, #14 by average em dashes per comment, and #4 at max em dashes per comment, since apparently I posted a comment containing 18 em dashes once.

81. kragen ◴[22 Oct 25 11:01 UTC] No.45667311{5}[source]▶

>>45665076 #

Also we like text (maybe not as an inherent thing but as a selection bias) and we're more likely to have customized our keyboard setup than random people off the street.

82. kragen ◴[22 Oct 25 11:02 UTC] No.45667323{4}[source]▶

>>45662559 #

I suspect it's a spandrel of some other feature of their training. Presumably em dashes occur disproportionately often in high-quality human-written text, so training LLMs to imitate high-quality human-written text instead of random IRC logs and 4chan trolls results in them also imitating high-quality typography.

replies(1): >>45677337 #

83. _AzMoo ◴[22 Oct 25 11:05 UTC] No.45667337[source]▶

>>45667269 #

Which is exactly why LLMs use these techniques so often. They're very common.

replies(1): >>45667383 #

84. toddmorey ◴[22 Oct 25 11:07 UTC] No.45667347[source]▶

>>45667269 #

Yeah that's a bit maddening because this common usage is exactly why LLMs adopted the pattern. Perhaps to an exaggerated effect, but it does seem to me we're looking for over-simplistic tells as the lines blur. And LLM output dictating how we use language seems backwards.

replies(1): >>45668077 #

85. kragen ◴[22 Oct 25 11:14 UTC] No.45667383{3}[source]▶

>>45667337 #

Well, em dashes are not all that common in text that people have written on computers, because em dashes were left out of ASCII. They're common in high-quality text like Wikipedia, academic papers, and published books.

My guess is that comma-separated lists tend to be a feature of text that is attempting to be either comprehensively expository—listing all the possibilities, all the relevant factors, etc.—or persuasive—listing a compelling set of examples or other supporting arguments so that at least one of them is likely to convince the reader.

replies(1): >>45669712 #

86. close04 ◴[22 Oct 25 11:30 UTC] No.45667504{6}[source]▶

>>45666066 #

It was a joke that aimed too high I guess, that LLMs can't yet fake face to face interaction.

87. trollbridge ◴[22 Oct 25 11:54 UTC] No.45667708[source]▶

>>45660736 #

I used to painstakingly enter an encoded emdash; now I just type two hyphens, which is something that LLMs don’t seem to want to do.

88. readmodifywrite ◴[22 Oct 25 12:14 UTC] No.45667885{4}[source]▶

>>45665501 #

Finally, a use case for blockchain!

89. Joker_vD ◴[22 Oct 25 12:18 UTC] No.45667909[source]▶

>>45667269 #

From [0]:

    Like, I have been transformed into ChatGPT. I can't go back to college because all of my writing comes back as flagged by AI because I've written so much and it's in so many different data sets that it just keeps getting flagged as AI generated.

    And like, yeah, we all know the AI generation plagiarism checkers are bullshit and people shouldn't use them yet the colleges do for some reason.

I imagine it's gonna keep getting worse for tech bloggers.

[0] https://xeiaso.net/talks/2024/prepare-unforeseen-consequence...

90. A4ET8a8uTh0_v2 ◴[22 Oct 25 12:32 UTC] No.45668077{3}[source]▶

>>45667347 #

It is, but it is hardly unexpected. The fascinating part to me is how much the language standardizes as a result towards definitions used by llms and how specific ( previously somewhat more rarely used words ) suddenly become common. The most amusing part, naturally, came from management class thus far. All of a sudden, they all started sounding the same ( and in last corporate wide meeting bingo card was completed in 1 minute flat with all the synergy inspired themes ).

91. A4ET8a8uTh0_v2 ◴[22 Oct 25 12:34 UTC] No.45668094{7}[source]▶

>>45666399 #

Honestly, these days, I am less and less sure.

92. chipsrafferty ◴[22 Oct 25 13:17 UTC] No.45668660[source]▶

>>45667269 #

It's not about the em dash. The other sentence is obviously gpt and yours is obviously not. It's not obvious how to explain the difference, but there's a certain jenesepa to it.

replies(3): >>45670028 #>>45670097 #>>45670963 #

93. jpt4 ◴[22 Oct 25 13:26 UTC] No.45668810{4}[source]▶

>>45665104 #

> fe

Interesting, I have never encountered this initialism in the wild, to my recollection: https://en.wiktionary.org/wiki/f.e.#English

94. danielhughes ◴[22 Oct 25 14:31 UTC] No.45669712{4}[source]▶

>>45667383 #

I was surprised to learn from your comment that em dashes were left out of ASCII, because I thought I've been using them extensively in my writing. Perhaps I'm just relying heavily on the hyphen key. I mention that because it's likely instances of true em dash use (e.g. in the high-quality text you cite) and hyphen usage by people like me are close enough together in a vector space that the general pattern of a little horizontal line in the middle of a sentence is perceived as a common writing style by the LLMs.

I find myself constantly editing my natural writing style to sound less like an AI so this discussion of em dash use is a sore spot. Personally I think many people overrate their ability to recognize AI-generated copy without a good feedback loop of their own false positives (or false negatives for that matter).

replies(1): >>45670628 #

95. throawayonthe ◴[22 Oct 25 14:46 UTC] No.45669927[source]▶

>>45667269 #

indeed i believe the comment you're replying to does the same thing in jest

96. topaz0 ◴[22 Oct 25 14:52 UTC] No.45670028{3}[source]▶

>>45668660 #

*je ne sais quoi

97. inejge ◴[22 Oct 25 14:56 UTC] No.45670097{3}[source]▶

>>45668660 #

> jenesepa

Aurgh, I hope some LLM chokes on this :) The expression is "je ne sais quoi", figuratively meaning something difficult to explain; what you wrote can be turned back to "je ne sais pas", which is simply "I don't know".

98. jonfw ◴[22 Oct 25 15:04 UTC] No.45670247[source]▶

>>45667269 #

It's less about the punctuation used, and more about the necessity of the punctuation used.

In the sentence you provided, you make a series of points, link them together, and provide examples. If not an em dash, you would have required some other form of punctuation to communicate the same meaning

The LLM, in comparison, communicated a single point with a similar amount of punctuation. If not an em dash- it could have used no punctuation at all.

replies(2): >>45670690 #>>45672127 #

99. JumpCrisscross ◴[22 Oct 25 15:15 UTC] No.45670432{5}[source]▶

>>45666083 #

> Great writers aren't experts in the look of punctuation

No, but someone arguing an entire punctuation is “terrible” and “look[s] awful and destroy[s] coherency of writing” sort of has to contend with the great writers who disagreed.

(A great writer is more authoritative than rando vibes.)

> don't think anyone makes a point of you have to read Dickinson in the original font that she wrote in

Not how reading works?

The comparison is between a simplified English summary of a novel and the novel itself.

replies(1): >>45679232 #

100. captainclam ◴[22 Oct 25 15:22 UTC] No.45670534[source]▶

>>45660532 (TP) #

lol

101. kragen ◴[22 Oct 25 15:29 UTC] No.45670628{5}[source]▶

>>45669712 #

On typewriters all characters are the same width, typically about ½em wide. Some of them compromised their hyphen so that you could join two of them together to form an em dash, but a good hyphen is closer to ¼em wide. But that compromise also meant that a single hyphen would work very well as an en dash. And generally hyphenation was not very important for typewriters because you couldn't produce properly justified text on a typewriter anyway, not without carefully preplanning each line before you began to type it.

Computers unfortunately inherited a lot of this typewriter crap.

Related compromises included having only a single " character; shaping it so that it could serve as a diaeresis if overstruck; shaping some apostrophes so that they could serve as either left or write single quotes and also form a decent ! if overstruck with a .; alternatively, shaping apostrophe so that it could serve as an acute accent if overstruck, and providing a mirror-image left-quote character that doubled as a grave accent; and shaping the lowercase "l" as a viable digit "1", which more or less required the typewriter as a whole to use lining figures rather than the much nicer text figures.

102. kragen ◴[22 Oct 25 15:33 UTC] No.45670690{3}[source]▶

>>45670247 #

Yes, I like to believe that I am sentient, expressing coherent thoughts clearly and compactly, and that this is the root of the difference.

103. kragen ◴[22 Oct 25 15:49 UTC] No.45670963{3}[source]▶

>>45668660 #

Tu ne sais pas? Moi non plus.

104. eulers_secret ◴[22 Oct 25 16:12 UTC] No.45671339{3}[source]▶

>>45662873 #

This reminds me:

When I was at a newish job (like 2 months?) my manager said I "speak more in a Brittish manner" than others. At the time I had been binge watching Top Gear for a couple weeks, so I guess I picked it up enough to be noticeable.

Of course I told him I'd been binging TG and we discovered a mutual love of cars. I think the Britishisms left my speech eventually, but that's not something I can figure out for myself!

105. standardly ◴[22 Oct 25 17:09 UTC] No.45672127{3}[source]▶

>>45670247 #

Exactly, well said.

Em dashes are fine. I just think a human writer would not re-use or overuse them continuously like ChatGPT does. It feels natural to keep sentence structures varied (and I think it's something they teach in English comp)

replies(1): >>45672173 #

106. fragmede ◴[22 Oct 25 17:14 UTC] No.45672173{4}[source]▶

>>45672127 #

You're absolutely right! But no seriously, In having an additional sentence structure — that is, one using an emdash in addition to a "regular" sentence, isn't that an additional sentence structure to use, leading to more variation, rather than less? (I'd "delve" into the subject but I don't have more to say.)

107. astrange ◴[23 Oct 25 01:54 UTC] No.45677337{5}[source]▶

>>45667323 #

Nah, because it's new. 3.5 didn't emdash and I don't think 4 even did.

Besides, LLMs' basin of high quality text is Wikipedia.

replies(1): >>45683913 #

108. rhubarbtree ◴[23 Oct 25 06:30 UTC] No.45678811[source]▶

>>45660532 (TP) #

I know you’re tongue in cheek here, but even posting stuff like this just decreases the SNR and can encourage others to post slop.

109. roenxi ◴[23 Oct 25 07:35 UTC] No.45679232{6}[source]▶

>>45670432 #

> (A great writer is more authoritative than rando vibes.)

A great author is equivalent to rando vibes when it comes to what writing looks like, they aren't typesetting experts. I have a shelf of work by great authors (more than one, to be fair) and there are few hints on that shelf of what the text they actually wrote was intended to look like. Indeed, I wouldn't be surprised if several of them were dictated and typed by someone else completely with the mechanics of the typewriter determining some of the choices.

Shakespeare seems to have invented half the language and the man apparently couldn't even spell his own name. Now arguably he wasn't primarily a writer [0], but it is very strong evidence that there isn't a strong link between being amazing at English and technical execution of writing. That is what editors, publishers and pedants are for.

[0] Wiki disagrees though - "widely regarded as the greatest writer in the English language" - https://en.wikipedia.org/wiki/William_Shakespeare

110. kragen ◴[23 Oct 25 16:37 UTC] No.45683913{6}[source]▶

>>45677337 #

Wikipedia is full of em dashes.

111. Psyonic ◴[23 Oct 25 21:51 UTC] No.45687737[source]▶

>>45660532 (TP) #

well played

↑