←back to thread

The "confident idiot" problem: Why AI needs hard rules, not vibe checks

(steerlabs.substack.com)

323 points steerlabs | 1 comments | 04 Dec 25 20:48 UTC | HN request time: 0s | source

Show context

toddmorey ◴[08 Dec 25 13:29 UTC] No.46191994[source]▶

>>46152838 (OP) #

Confident idiot: I’m exploring using LLM for diagram creation.

I’ve found after about 3 prompts to edit an image with Gemini, it will respond randomly with an entirely new image. Another quirk is it will respond “here’s the image with those edits” with no edits made. It’s like a toaster that will catch on fire every eighth or ninth time.

I am not sure how to mitigate this behavior. I think maybe an LLM as a judge step with vision to evaluate the output before passing it on to the poor user.

replies(5): >>46193250 #>>46193673 #>>46194370 #>>46194578 #>>46195816 #

codingdave ◴[08 Dec 25 16:52 UTC] No.46194578[source]▶

Have you considered that perhaps such things simply are not within its capabilities?

replies(1): >>46197277 #

1. toddmorey ◴[08 Dec 25 20:34 UTC] No.46197277[source]▶

I mean, one of its flagship features is to make precise edits to images. And it's really good at it... until it randomly isn't.