←back to thread

289 points sandslash | 1 comments | | HN request time: 0.27s | source
1. yellow_postit ◴[] No.44454366[source]
recent paper on “ How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks” [1]

[1] https://arxiv.org/abs/2507.01955