(steerlabs.substack.com)

323 points steerlabs | 1 comments | 04 Dec 25 20:48 UTC | HN request time: 0s | source

Show context

jqpabc123 ◴[04 Dec 25 21:38 UTC] No.46153440[source]▶

We are trying to fix probability with more probability. That is a losing game.

Thanks for pointing out the elephant in the room with LLMs.

The basic design is non-deterministic. Trying to extract "facts" or "truth" or "accuracy" is an exercise in futility.

replies(17): >>46155764 #>>46191721 #>>46191867 #>>46191871 #>>46191893 #>>46191910 #>>46191973 #>>46191987 #>>46192152 #>>46192471 #>>46192526 #>>46192557 #>>46192939 #>>46193456 #>>46194206 #>>46194503 #>>46194518 #

steerlabs ◴[05 Dec 25 01:25 UTC] No.46155764[source]▶

>>46153440 #

Exactly. We treat them like databases, but they are hallucination machines.

My thesis isn't that we can stop the hallucinating (non-determinism), but that we can bound it.

If we wrap the generation in hard assertions (e.g., assert response.price > 0), we turn 'probability' into 'manageable software engineering.' The generation remains probabilistic, but the acceptance criteria becomes binary and deterministic.

replies(4): >>46163076 #>>46191658 #>>46191774 #>>46191967 #

jqpabc123 ◴[05 Dec 25 16:01 UTC] No.46163076[source]▶

>>46155764 #

but the acceptance criteria becomes binary and deterministic.

Unfortunately, the use-case for AI is often where the acceptance criteria is not easily defined --- a matter of judgment. For example, "Does this patient have cancer?".

In cases where the criteria can be easily and clearly stipulated, AI often isn't really required.

replies(2): >>46185626 #>>46191846 #

1. multjoy ◴[08 Dec 25 13:15 UTC] No.46191846[source]▶

>>46163076 #

AI doesn’t necessarily mean an LLM, which are the systems making things up.

↑

The "confident idiot" problem: Why AI needs hard rules, not vibe checks