←back to thread

323 points steerlabs | 1 comments | | HN request time: 0s | source
Show context
toddmorey ◴[] No.46191994[source]
Confident idiot: I’m exploring using LLM for diagram creation.

I’ve found after about 3 prompts to edit an image with Gemini, it will respond randomly with an entirely new image. Another quirk is it will respond “here’s the image with those edits” with no edits made. It’s like a toaster that will catch on fire every eighth or ninth time.

I am not sure how to mitigate this behavior. I think maybe an LLM as a judge step with vision to evaluate the output before passing it on to the poor user.

replies(5): >>46193250 #>>46193673 #>>46194370 #>>46194578 #>>46195816 #
codingdave ◴[] No.46194578[source]
Have you considered that perhaps such things simply are not within its capabilities?
replies(1): >>46197277 #
1. toddmorey ◴[] No.46197277[source]
I mean, one of its flagship features is to make precise edits to images. And it's really good at it... until it randomly isn't.