Grok: Searching X for "From:Elonmusk (Israel or Palestine or Hamas or Gaza)"

(simonwillison.net)

724 points simonw | 1 comments | 11 Jul 25 00:22 UTC | HN request time: 0.262s | source

Show context

xnx ◴[11 Jul 25 00:34 UTC] No.44527256[source]▶

> It’s worth noting that LLMs are non-deterministic,

This is probably better phrased as "LLMs may not provide consistent answers due to changing data and built-in randomness."

Barring rare(?) GPU race conditions, LLMs produce the same output given the same inputs.

replies(7): >>44527264 #>>44527395 #>>44527458 #>>44528870 #>>44530104 #>>44533038 #>>44536027 #

troupo ◴[11 Jul 25 06:13 UTC] No.44528870[source]▶

>>44527256 #

> Barring rare(?) GPU race conditions, LLMs produce the same output given the same inputs.

Are these LLMs in the room with us?

Not a single LLM available as a SaaS is deterministic.

As for other models: I've only run ollama locally, and it, too, provided different answers for the same question five minutes apart

Edit/update: not a single LLM available as a SaaS's output is deterministic, especially when used from a UI. Pointing out that you could probably run a tightly controlled model in a tightly controlled environment to achieve deterministic output is very extremely irrelevant when describing output of grok in situations when the user has no control over it

replies(5): >>44528884 #>>44528892 #>>44528898 #>>44528952 #>>44528971 #

fooker ◴[11 Jul 25 06:15 UTC] No.44528884[source]▶

>>44528870 #

> Not a single LLM available as a SaaS is deterministic.

Lower the temperature parameter.

replies(2): >>44528930 #>>44529115 #

troupo ◴[11 Jul 25 06:24 UTC] No.44528930[source]▶

>>44528884 #

So, how does one do it outside of APIs in the context we're discussing? In the UI or when invoking @grok in X?

How do we also turn off all the intermediate layers in between that we don't know about like "always rant about white genocide in South Africa" or "crash when user mentions David Meyer"?

replies(1): >>44530946 #

marcinzm ◴[11 Jul 25 11:32 UTC] No.44530946[source]▶

>>44528930 #

Grok is not deterministic would then be the correct statement.

replies(1): >>44532080 #

1. troupo ◴[11 Jul 25 13:44 UTC] No.44532080[source]▶

>>44530946 #

When used through UI, like the author does, Grok isn't. OpenAI isn't. Gemini isn't

↑