←back to thread

549 points thecr0w | 1 comments | | HN request time: 0s | source
Show context
daemonologist ◴[] No.46183891[source]
Interesting - these models are all trained to do pixel-level(ish) measurement now, for bounding boxes and such. I wonder if you could railroad it into being accurate with the right prompt.
replies(2): >>46184095 #>>46184300 #
Lerc ◴[] No.46184095[source]
What models are good at this? I have tried passing images to models and asking them for coordinates for specific features, then overlaid dots on those points and passed that image back to the model so it has a perception of how far out it was. It had a tendency to be consistently off by a fixed amount without getting closer.

I don't doubt that it is possible eventually, but I haven't had much luck.

Something that seemed to assist was drawing a multi coloured transparent chequerboard, if the AI knows the position of the grid colours it can pick out some relative information from the grid.

replies(2): >>46184257 #>>46188645 #
1. ryoshu ◴[] No.46188645[source]
I can't do that either without opening up an image editing tool. Give the model a tool and goal with "vision". Should work better.