←back to thread

724 points simonw | 1 comments | | HN request time: 0.214s | source
Show context
joshstrange ◴[] No.44532182[source]
> I think there is a good chance this behavior is unintended!

Ehh, given the person we are talking about (Elon) I think that's a little naive. They wouldn't need to add it in the system prompt, they could have just fine-tuned it and rewarded it when it tried to find Elon's opinion. He strikes me as the type of person who would absolutely do that given stories about him manipulating Twitter to "fix" his dropping engagement numbers.

This isn't fringe/conspiracy territory, it would be par for the course IMHO.

replies(1): >>44532309 #
simonw ◴[] No.44532309[source]
If I was Elon and I decided that Grok should search my tweets any time it needs to answer something controversial, I would also make sure it didn't say "Searching X for from:elonmusk" right there in the UI every time it did that.
replies(2): >>44532750 #>>44533502 #
joshstrange ◴[] No.44532750[source]
I don't want to be rude, I quite enjoy your work but:

If I was Elon and I decided that I wanted to go full fascist then I wouldn't do a nazi salute at the inauguration.

But I get what you are saying and you aren't wrong but also people can make mistakes/bugs, we might see Grok "stop" searching for that but who knows if it's just hidden or if it actually will stop doing it. Elon has just completely burned any "Here is an innocent explanation"-cred in my book, assuming the worst seems to be the safest course of action.

replies(1): >>44533430 #
1. simonw ◴[] No.44533430[source]
Personally I don't think "we trained our model to search for Elon's opinion on things even though we didn't mean to" is a particularly innocent explanation. It strikes at the heart of the credibility of the organization.