←back to thread

358 points tkgally | 1 comments | | HN request time: 0s | source

The use of the em dash (—) now raises suspicions that a text might have been AI-generated. Inspired by a suggestion from dang [1], I created a leaderboard of HN users according to how many of their posts before November 30, 2022—that is, before the release of ChatGPT—contained em dashes. Dang himself comes in number 2—by a very slim margin.

Credit to Claude Code for showing me how to search the HN database through Google BigQuery and for writing the HTML for the leaderboard.

[1] https://news.ycombinator.com/item?id=45053933

Show context
PUSH_AX ◴[] No.45072409[source]
It might be more fun to see users who’s emdash usage increased after the release.
replies(6): >>45072489 #>>45072739 #>>45072901 #>>45073019 #>>45073342 #>>45074025 #
idiotsecant ◴[] No.45073019[source]
Even more interesting is the likely increase in emdash usage by those not using an LLM, but merely imitating the writing they see subconsciously. There was a evidence that chatgpt is shifting the frequency of use of some uncommon words and phrases amongst non-users.
replies(1): >>45073327 #
sebastiennight ◴[] No.45073327[source]
Oh really? We should definitely delve into this.
replies(2): >>45073473 #>>45077944 #
1. JdeBP ◴[] No.45073473{3}[source]
You'll need to delve into history back quite a number of years. (-:

* https://news.ycombinator.com/item?id=18439869