←back to thread

358 points tkgally | 1 comments | | HN request time: 0.211s | source

The use of the em dash (—) now raises suspicions that a text might have been AI-generated. Inspired by a suggestion from dang [1], I created a leaderboard of HN users according to how many of their posts before November 30, 2022—that is, before the release of ChatGPT—contained em dashes. Dang himself comes in number 2—by a very slim margin.

Credit to Claude Code for showing me how to search the HN database through Google BigQuery and for writing the HTML for the leaderboard.

[1] https://news.ycombinator.com/item?id=45053933

Show context
userbinator ◴[] No.45071871[source]
I suspect they are generated via "autocorrect", the same way as "smart (more like stupid) quotes" and other characters that tend to cause a great deal of frustration should they find their way into source code. It would be interesting to see how many users regularly make posts containing non-ASCII characters.
replies(5): >>45071883 #>>45071891 #>>45071897 #>>45071898 #>>45072027 #
1. southwindcg ◴[] No.45072027[source]
I use Autokey. I've added a bunch of occasionally-used HTML entities and Unicode characters so I don't need to go hunting for them.