The "confident idiot" problem: Why AI needs hard rules, not vibe checks

(steerlabs.substack.com)

323 points steerlabs | 2 comments | 04 Dec 25 20:48 UTC | HN request time: 0.001s | source

Show context

toddmorey ◴[08 Dec 25 13:29 UTC] No.46191994[source]▶

Confident idiot: I’m exploring using LLM for diagram creation.

I’ve found after about 3 prompts to edit an image with Gemini, it will respond randomly with an entirely new image. Another quirk is it will respond “here’s the image with those edits” with no edits made. It’s like a toaster that will catch on fire every eighth or ninth time.

I am not sure how to mitigate this behavior. I think maybe an LLM as a judge step with vision to evaluate the output before passing it on to the poor user.

replies(5): >>46193250 #>>46193673 #>>46194370 #>>46194578 #>>46195816 #

1. RationPhantoms ◴[08 Dec 25 15:22 UTC] No.46193250[source]▶

>>46191994 #

Whats your thoughts on the diagram as code movement? I'd prefer to have an LLM utilize those as it can atleast drive some determinism through it rather than deal with the slippery layer that is prompt control for visual LLMs.

replies(1): >>46197292 #

2. toddmorey ◴[08 Dec 25 20:35 UTC] No.46197292[source]▶

>>46193250 (TP) #

I think that's the right approach and what I've been experimenting with. Diagram as code and then style transfer from output diagram to desired look. That's where I've had the most success.

↑