←back to thread

724 points simonw | 3 comments | | HN request time: 0.554s | source
Show context
throwaway439080 ◴[] No.44528967[source]
Kind of amazing the author just takes everything at face value and doesn't even consider the possibility that there's a hidden layer of instructions. Elon likes to meddle with Grok whenever the mood strikes him, leading to Grok's sudden interest in Nazi topics such as South African "white genocide" and calling itself MechaHitler. Pretty sure that stuff is not in the instructions Grok will tell the user about.
replies(2): >>44529021 #>>44530253 #
1. invalidusernam3 ◴[] No.44529021[source]
The "MechaHitler" things is particularly obvious in my opinion, it aligns so closely to Musk's weird trying-to-be-funny thing that he does.

There's basically no way an LLM would come up with a name for itself that it consistently uses unless it's extensively referred to by that name in the training data (which is almost definitely not the case here for public data since I doubt anyone on Earth has ever referred to Grok as "MechaHitler" prior to now) or it's added in some kind of extra system prompt. The name seems very obviously intentional.

replies(2): >>44529155 #>>44529505 #
2. orbital-decay ◴[] No.44529155[source]
Most LLMs, even pretty small ones, easily come up with creative names like that, depending on the prompt/conversation route.
3. zarwv ◴[] No.44529505[source]
Grok was just repeating and expanding on things. Someone either said MechaHitler or mentioned Wolfenstein. If Grok searches Yandex and X, he's going to get quite a lot of crazy ideas. Someone tricked him with a fake article of a woman with a Jewish name saying bad things about flood victims.