(www.youtube.com)

289 points sandslash | 1 comments | 01 Jul 25 14:00 UTC | HN request time: 0.27s | source

1. yellow_postit ◴[03 Jul 25 12:35 UTC] No.44454366[source]▶

recent paper on “ How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks” [1]

Fei-Fei Li: Spatial intelligence is the next frontier in AI [video]