Grok: Searching X for "From:Elonmusk (Israel or Palestine or Hamas or Gaza)"

(simonwillison.net)

724 points simonw | 3 comments | 11 Jul 25 00:22 UTC | HN request time: 0.554s | source

Show context

throwaway439080 ◴[11 Jul 25 06:32 UTC] No.44528967[source]▶

Kind of amazing the author just takes everything at face value and doesn't even consider the possibility that there's a hidden layer of instructions. Elon likes to meddle with Grok whenever the mood strikes him, leading to Grok's sudden interest in Nazi topics such as South African "white genocide" and calling itself MechaHitler. Pretty sure that stuff is not in the instructions Grok will tell the user about.

replies(2): >>44529021 #>>44530253 #

1. invalidusernam3 ◴[11 Jul 25 06:40 UTC] No.44529021[source]▶

>>44528967 #

The "MechaHitler" things is particularly obvious in my opinion, it aligns so closely to Musk's weird trying-to-be-funny thing that he does.

There's basically no way an LLM would come up with a name for itself that it consistently uses unless it's extensively referred to by that name in the training data (which is almost definitely not the case here for public data since I doubt anyone on Earth has ever referred to Grok as "MechaHitler" prior to now) or it's added in some kind of extra system prompt. The name seems very obviously intentional.

replies(2): >>44529155 #>>44529505 #

2. orbital-decay ◴[11 Jul 25 07:05 UTC] No.44529155[source]▶

>>44529021 (TP) #

Most LLMs, even pretty small ones, easily come up with creative names like that, depending on the prompt/conversation route.

3. zarwv ◴[11 Jul 25 08:03 UTC] No.44529505[source]▶

>>44529021 (TP) #

Grok was just repeating and expanding on things. Someone either said MechaHitler or mentioned Wolfenstein. If Grok searches Yandex and X, he's going to get quite a lot of crazy ideas. Someone tricked him with a fake article of a woman with a Jewish name saying bad things about flood victims.

↑