Grok: Searching X for "From:Elonmusk (Israel or Palestine or Hamas or Gaza)"

(simonwillison.net)

724 points simonw | 1 comments | 11 Jul 25 00:22 UTC | HN request time: 0s | source

Show context

davedx ◴[11 Jul 25 06:18 UTC] No.44528899[source]▶

> I think there is a good chance this behavior is unintended!

That's incredibly generous of you, considering "The response should not shy away from making claims which are politically incorrect" is still in the prompt despite the "open source repo" saying it was removed.

Maybe, just maybe, Grok behaves the way it does because its owner has been explicitly tuning it - in the system prompt, or during model training itself - to be this way?

replies(4): >>44529001 #>>44529934 #>>44530772 #>>44532658 #

numeri ◴[11 Jul 25 09:07 UTC] No.44529934[source]▶

>>44528899 #

I'm a little shocked at Simon's conclusion here. We have a man who bought an social media website so he could control what's said, and founded an AI lab so he could get a bot that agrees with him, and who has publicly threatened said AI with being replaced if it doesn't change its political views/agree with him.

His company has also been caught adding specific instructions in this vein to its prompt.

And now it's searching for his tweets to guide its answers on political questions, and Simon somehow thinks it could be unintended, emergent behavior? Even if it were, calling this unintended would be completely ignoring higher order system dynamics (a behavior is still intended if models are rejected until one is found that implements the behavior) and the possibility of reinforcement learning to add this behavior.

replies(3): >>44531319 #>>44531668 #>>44532724 #

1. JimmaDaRustla ◴[11 Jul 25 14:45 UTC] No.44532724[source]▶

>>44529934 #

On top of all of that, he demonstrates that Grok has an egregious and intentional bias but then claims it's inexplainable happenstance due to some sort of self-awareness? How do you think it became self-aware Simon?

↑