GPT-5.2

(openai.com)

1019 points atgctg | 3 comments | 11 Dec 25 18:04 UTC | HN request time: 0.199s | source

https://platform.openai.com/docs/guides/latest-model

System card: https://cdn.openai.com/pdf/3a4153c8-c748-4b71-8e31-aecbde944...

Show context

breakingcups ◴[11 Dec 25 18:37 UTC] No.46235173[source]▶

Is it me, or did it still get at least three placements of components (RAM and PCIe slots, plus it's DisplayPort and not HDMI) in the motherboard image[0] completely wrong? Why would they use that as a promotional image?

0: https://images.ctfassets.net/kftzwdyauwt9/6lyujQxhZDnOMruN3f...

replies(10): >>46235244 #>>46235267 #>>46236405 #>>46236591 #>>46237241 #>>46239493 #>>46240735 #>>46241534 #>>46241550 #>>46241781 #

1. jasonlotito ◴[11 Dec 25 20:18 UTC] No.46236591[source]▶

>>46235173 #

FTA: Both models make clear mistakes, but GPT‑5.2 shows better comprehension of the image.

You can find it right next to the image you are talking about.

replies(2): >>46236847 #>>46237091 #

2. tedsanders ◴[11 Dec 25 20:40 UTC] No.46236847[source]▶

>>46236591 (TP) #

To be fair to OP, I just added this to our blog after their comment, in response to the correct criticisms that our text didn't make it clear how bad GPT-5.2's labels are.

LLMs have always been very subhuman at vision, and GPT-5.2 continues in this tradition, but it's still a big step up over GPT-5.1.

One way to get a sense of how bad LLMs are at vision is to watch them play Pokemon. E.g.,: https://www.lesswrong.com/posts/u6Lacc7wx4yYkBQ3r/insights-i...

They still very much struggle with basic vision tasks that adults, kids, and even animals can ace with little trouble.

3. da_grift_shift ◴[11 Dec 25 21:00 UTC] No.46237091[source]▶

>>46236591 (TP) #

'Commented after article was already edited in response to HN feedback' award

↑