Grok: Searching X for "From:Elonmusk (Israel or Palestine or Hamas or Gaza)"

1. dankai ◴[11 Jul 25 00:51 UTC] No.44527362[source]▶

This is so in character for Musk and shocking because he's incompetent across so many topics he likes to give his opinion on. Crazy he would nerf the model of his AI company like that.

replies(4): >>44527378 #>>44528717 #>>44528760 #>>44532465 #

2. sorcerer-mar ◴[11 Jul 25 00:53 UTC] No.44527378[source]▶

>>44527362 (TP) #

Megalomania is a hell of a drug

3. shellfishgene ◴[11 Jul 25 05:39 UTC] No.44528717[source]▶

>>44527362 (TP) #

The linked post comes to the conclusion that Groks behavior is probably not intentional.

replies(6): >>44528761 #>>44528879 #>>44528949 #>>44529593 #>>44529872 #>>44532352 #

4. cedws ◴[11 Jul 25 05:50 UTC] No.44528760[source]▶

>>44527362 (TP) #

It’s been said here before, but xAI isn’t really in the running to be on the leading edge of LLMs. It’s serving a niche of users who don’t want to use “woke” models and/or who are Musk sycophants.

replies(2): >>44528903 #>>44528909 #

5. sunaookami ◴[11 Jul 25 05:50 UTC] No.44528761[source]▶

>>44528717 #

Bold of you to assume people here read the linked post.

6. ziftface ◴[11 Jul 25 06:15 UTC] No.44528879[source]▶

>>44528717 #

I think Simon was being overly charitable by pointing out that there's a chance this exact behavior was unintentional.

It really strains credulity to say that a Musk-owned ai model that answers controversial questions by looking up what his Twitter profile says was completely out of the blue. Unless they are able to somehow show this wasn't built into the training process I don't see anyone taking this model seriously for its intended use, besides maybe the sycophants who badly need to a summary of Elon Musk's tweets.

replies(1): >>44529098 #

7. fooker ◴[11 Jul 25 06:19 UTC] No.44528903[source]▶

>>44528760 #

> It’s been said here before, but xAI isn’t really in the running to be on the leading edge of LLMs

As of yesterday, it is. Sure it’ll be surpassed at some point.

replies(2): >>44529784 #>>44530360 #

8. gitaarik ◴[11 Jul 25 06:20 UTC] No.44528909[source]▶

>>44528760 #

Actually the recent fails with Grok remind me of the early fails with Gemini, where it would put colored people in all images it generated, even in positions they historically never were in, like German second world war soldiers.

So in that sense, Grok and Gemini aren't that far apart, just the other side of the extreme.

Apparently it's very hard to create an AI that behaves balanced. Not too woke, and not too racist.

replies(1): >>44529365 #

9. samrus ◴[11 Jul 25 06:27 UTC] No.44528949[source]▶

>>44528717 #

Whether this instance was a coincidence or not, i can not comment on. But as to your other point, i can comment that the incidents happening in south africa are very serious and need international attention

replies(1): >>44530264 #

10. InsideOutSanta ◴[11 Jul 25 06:56 UTC] No.44529098{3}[source]▶

>>44528879 #

The only reason I doubt it's intentional is that it is so transparent. If they did this intentionally, I would assume you would not see it in its public reasoning stream.

replies(1): >>44530339 #

11. diggan ◴[11 Jul 25 07:41 UTC] No.44529365{3}[source]▶

>>44528909 #

> Apparently it's very hard to create an AI that behaves balanced. Not too woke, and not too racist.

Well, it's hard to build things we don't even understand ourselves, especially about highly subjective topics. What is "woke" for one person is "basic humanity" for another, and "extremism" for yet another person, and same goes for most things.

If the model can output subjective text, then the model will be biased in some way I think.

12. antonvs ◴[11 Jul 25 08:21 UTC] No.44529593[source]▶

>>44528717 #

It may not be directly intentional, but it’s certainly a consequence of decisions xAI have taken in developing Grok. Without even knowing exactly what those decisions are, it’s pretty clear that they’re questionable.

13. jonathanstrange ◴[11 Jul 25 08:47 UTC] No.44529784{3}[source]▶

>>44528903 #

Fewer people want to use it. You need to have at least minimal trust in the company that creates an AI to consider using it.

replies(1): >>44537630 #

14. KaiserPro ◴[11 Jul 25 08:58 UTC] No.44529872[source]▶

>>44528717 #

Of course its intentional.

Musk said "stop making it sound woke" after re-training it and changing the fine tuning dataset, it was still sounding woke. After he fired a bunch more researchers, I suspect they thought "why not make it search what musk thinks?" boom it passes the woke test now.

Thats not an emergent behaviour, that's almost certainly deliberate. If someone manages to extract the prompt, you'll get conformation.

15. spacechild1 ◴[11 Jul 25 09:48 UTC] No.44530264{3}[source]▶

>>44528949 #

I see what you did there :)

16. Peritract ◴[11 Jul 25 09:59 UTC] No.44530339{4}[source]▶

>>44529098 #

They've made a series of equally transparent, awkward changes to the bot in the past; this is part of a pattern.

17. cedws ◴[11 Jul 25 10:00 UTC] No.44530360{3}[source]▶

>>44528903 #

Even if the flimsy benchmark numbers are higher doesn't necessarily mean it's at the frontier, it might be that they're just willing to burn more cash to be at the top of the leaderboard. It also benefits from being the most recently trained, and therefore, most tuned for benchmarks.

replies(1): >>44537627 #

18. ◴[11 Jul 25 14:09 UTC] No.44532352[source]▶

>>44528717 #

19. KingMob ◴[11 Jul 25 14:20 UTC] No.44532465[source]▶

>>44527362 (TP) #

Some old colleagues from the Space Coast in Florida said they knew of SpaceX employees who'd mastered the art of pretending to listen to uninformed Musk gibberish, and then proceed to ignore as much of the stupid stuff as they could.

20. fooker ◴[11 Jul 25 22:56 UTC] No.44537627{4}[source]▶

>>44530360 #

Great, have you tried it ?

I gave it my compiler research problem and it gave me a direction that not only worked, but required me to learn new math.

21. fooker ◴[11 Jul 25 22:57 UTC] No.44537630{4}[source]▶

>>44529784 #

Agreed.

Whether it is better does not depend on whether people want to use it though.