Most active commenters

Grok: Searching X for "From:Elonmusk (Israel or Palestine or Hamas or Gaza)"

(simonwillison.net)

Show context

xnx ◴[11 Jul 25 00:34 UTC] No.44527256[source]▶

> It’s worth noting that LLMs are non-deterministic,

This is probably better phrased as "LLMs may not provide consistent answers due to changing data and built-in randomness."

Barring rare(?) GPU race conditions, LLMs produce the same output given the same inputs.

replies(7): >>44527264 #>>44527395 #>>44527458 #>>44528870 #>>44530104 #>>44533038 #>>44536027 #

msgodel ◴[11 Jul 25 00:35 UTC] No.44527264[source]▶

>>44527256 #

I run my local LLMs with a seed of one. If I re-run my "ai" command (which starts a conversation with its parameters as a prompt) I get exactly the same output every single time.

replies(2): >>44527284 #>>44527453 #

1. xnx ◴[11 Jul 25 00:38 UTC] No.44527284[source]▶

>>44527264 #

Yes. This is what I was trying to say. Saying "It’s worth noting that LLMs are non-deterministic" is wrong and should be changed in the blog post.

replies(3): >>44527462 #>>44528765 #>>44529031 #

2. boroboro4 ◴[11 Jul 25 01:11 UTC] No.44527462[source]▶

>>44527284 (TP) #

You’re correct in batch size 1 (local is one), but not in production use case when multiple requests get batched together (and that’s how all the providers do this).

With batching matrix shapes/request position in them aren’t deterministic and this leads to non deterministic results, regardless of sampling temperature/seed.

replies(1): >>44527524 #

3. unsnap_biceps ◴[11 Jul 25 01:22 UTC] No.44527524[source]▶

>>44527462 #

Isn't that true only if the batches are different? If you run exactly the same batch, you're back to a deterministic result.

If I had a black box api, just because you don't know how it's calculated doesn't mean that it's non-deterministic. It's the underlaying algorithm that determines that and a LLM is deterministic.

replies(1): >>44527543 #

4. boroboro4 ◴[11 Jul 25 01:25 UTC] No.44527543{3}[source]▶

>>44527524 #

Providers never run same batches because they mix requests between different clients, otherwise GPUs are gonna be severely underutilized.

It’s inherently non deterministic because it reflects the reality of having different requests coming to the servers at the same time. And I don’t believe there are any realistic workarounds if you want to keep costs reasonable.

Edit: there might be workarounds if matmul algorithms will give stronger guarantees then they are today (invariance on rows/columns swap). Not an expert to say how feasible it is, especially in quantized scenario.

5. TheDong ◴[11 Jul 25 05:51 UTC] No.44528765[source]▶

>>44527284 (TP) #

> Saying "It’s worth noting that LLMs are non-deterministic" is wrong and should be changed in the blog post.

Every person in this thread understood that Simon meant "Grok, ChatGPT, and other common LLM interfaces run with a temperature>0 by default, and thus non-deterministically produce different outputs for the same query".

Sure, he wrote a shorter version of that, and because of that y'all can split hairs on the details ("yes it's correct for how most people interact with LLMs and for grok, but _technically_ it's not correct").

The point of English blog posts is not to be a long wall of logical prepositions, it's to convey ideas and information. The current wording seems fine to me.

The point of what he was saying was to caution readers "you might not get this if you try to repro it", and that is 100% correct.

replies(2): >>44529058 #>>44530499 #

6. DemocracyFTW2 ◴[11 Jul 25 06:42 UTC] No.44529031[source]▶

>>44527284 (TP) #

"Non-deterministic" in the sense that a dice roll is when you don't know every parameter with ultimate precision. On one hand I find insistence on the wrongness on the phrase a bit too OCD, on the other I must agree that a very simple re-phrasing like "appears {non-deterministic|random|unpredictable} to an outside observer" would've maybe even added value even for less technically-inclined folks, so yeah.

7. root_axis ◴[11 Jul 25 06:49 UTC] No.44529058[source]▶

>>44528765 #

Still, the statement that LLMs are non-deterministic is incorrect and could mislead some people who simply aren't familiar with how they work.

Better phrasing would be something like "It's worth noting that LLM products are typically operated in a manner that produces non-deterministic output for the user"

replies(2): >>44529211 #>>44529618 #

8. Veen ◴[11 Jul 25 07:14 UTC] No.44529211{3}[source]▶

>>44529058 #

Simon would be less engaging if he caveated every generalisation in that way. It’s one of the main reasons academic writing is often tedious to read.

9. antonvs ◴[11 Jul 25 08:25 UTC] No.44529618{3}[source]▶

>>44529058 #

> It's worth noting that LLM products are typically operated in a manner that produces non-deterministic output for the user

Or you could abbreviate this by saying “LLMs are non-deterministic.” Yes, it requires some shared context with the audience to interpret correctly, but so does every text.

10. msgodel ◴[11 Jul 25 10:23 UTC] No.44530499[source]▶

>>44528765 #

My temperature is set higher than zero as well. That doesn't make them nondeterministic.

replies(1): >>44531909 #

11. saagarjha ◴[11 Jul 25 13:29 UTC] No.44531909{3}[source]▶

>>44530499 #

I would hope that your temperature is set higher than zero.

↑