Most active commenters

The "confident idiot" problem: Why AI needs hard rules, not vibe checks

(steerlabs.substack.com)

Show context

jqpabc123 ◴[04 Dec 25 21:38 UTC] No.46153440[source]▶

>>46152838 (OP) #

We are trying to fix probability with more probability. That is a losing game.

Thanks for pointing out the elephant in the room with LLMs.

The basic design is non-deterministic. Trying to extract "facts" or "truth" or "accuracy" is an exercise in futility.

replies(17): >>46155764 #>>46191721 #>>46191867 #>>46191871 #>>46191893 #>>46191910 #>>46191973 #>>46191987 #>>46192152 #>>46192471 #>>46192526 #>>46192557 #>>46192939 #>>46193456 #>>46194206 #>>46194503 #>>46194518 #

1. steerlabs ◴[05 Dec 25 01:25 UTC] No.46155764[source]▶

>>46153440 #

Exactly. We treat them like databases, but they are hallucination machines.

My thesis isn't that we can stop the hallucinating (non-determinism), but that we can bound it.

If we wrap the generation in hard assertions (e.g., assert response.price > 0), we turn 'probability' into 'manageable software engineering.' The generation remains probabilistic, but the acceptance criteria becomes binary and deterministic.

replies(4): >>46163076 #>>46191658 #>>46191774 #>>46191967 #

2. jqpabc123 ◴[05 Dec 25 16:01 UTC] No.46163076[source]▶

>>46155764 (TP) #

but the acceptance criteria becomes binary and deterministic.

Unfortunately, the use-case for AI is often where the acceptance criteria is not easily defined --- a matter of judgment. For example, "Does this patient have cancer?".

In cases where the criteria can be easily and clearly stipulated, AI often isn't really required.

replies(2): >>46185626 #>>46191846 #

3. steerlabs ◴[07 Dec 25 22:00 UTC] No.46185626[source]▶

>>46163076 #

You're 100% right. For a "judgment" task like "Does this patient have cancer?", the final acceptance criteria must be a human expert. A purely deterministic verifier is impossible.

My thesis is that even in those "fuzzy" workflows, the agent's process is full of small, deterministic sub-tasks that can and should be verified.

For example, before the AI even attempts to analyze the X-ray for cancer, it must: 1/ Verify it has the correct patient file (PatientIDVerifier). 2/ Verify the image is a chest X-ray and not a brain MRI (ModalityVerifier). 3/ Verify the date of the scan is within the relevant timeframe (DateVerifier).

These are "boring," deterministic checks. But a failure on any one of them makes the final "judgment" output completely useless.

steer isn't designed to automate the final, high-stakes judgment. It's designed to automate the pre-flight checklist, ensuring the agent has the correct, factually grounded information before it even begins the complex reasoning task. It's about reducing the "unforced errors" so the human expert can focus only on the truly hard part.

replies(1): >>46191761 #

4. scotty79 ◴[08 Dec 25 12:53 UTC] No.46191658[source]▶

>>46155764 (TP) #

> We treat them like databases, but they are hallucination machines.

Which is kind of crazy because we don't even treat people as databases. Or at least we shouldn't.

Maybe it's one of those things that will disappear form culture one funeral at a time.

replies(1): >>46191851 #

5. malfist ◴[08 Dec 25 13:08 UTC] No.46191761{3}[source]▶

>>46185626 #

Why do any of those checks with ai though? All of them you can get a less error prone answer without ai.

replies(1): >>46191922 #

6. squidbeak ◴[08 Dec 25 13:09 UTC] No.46191774[source]▶

>>46155764 (TP) #

I don't agree that users see them as databases. Sure there are those who expect LLMs to be infallible and punish the technology when it disappoints them, but it seems to me that the overwhelmingly majority quickly learn what AI's shortcomings are, and treat them instead like intelligent entities who will sometimes make mistakes.

replies(2): >>46191785 #>>46191917 #

7. philipallstar ◴[08 Dec 25 13:11 UTC] No.46191785[source]▶

>>46191774 #

> but it seems to me that the overwhelmingly majority

The overwhelming majority of what?

replies(1): >>46192444 #

8. multjoy ◴[08 Dec 25 13:15 UTC] No.46191846[source]▶

>>46163076 #

AI doesn’t necessarily mean an LLM, which are the systems making things up.

9. hrimfaxi ◴[08 Dec 25 13:16 UTC] No.46191851[source]▶

>>46191658 #

Humans demand more reliability from our creations than from each other.

10. ◴[08 Dec 25 13:22 UTC] No.46191917[source]▶

>>46191774 #

11. jennyholzer ◴[08 Dec 25 13:22 UTC] No.46191922{4}[source]▶

>>46191761 #

Robo-eugenics is the best answer I can come up with

12. ◴[08 Dec 25 13:26 UTC] No.46191967[source]▶

>>46155764 (TP) #

13. antonvs ◴[08 Dec 25 14:15 UTC] No.46192444{3}[source]▶

>>46191785 #

Of users. It's an implicit subject from the first sentence.

replies(1): >>46194222 #

14. philipallstar ◴[08 Dec 25 16:26 UTC] No.46194222{4}[source]▶

>>46192444 #

But how do they know that, if it's of all users?

replies(1): >>46195062 #

15. antonvs ◴[08 Dec 25 17:29 UTC] No.46195062{5}[source]▶

>>46194222 #

They didn't claim to know it, they said "it seems to me". Presumably they're extrapolating from their experience, or their expectations of how an average user would behave.

↑