GPT-5.2

(openai.com)

https://platform.openai.com/docs/guides/latest-model

System card: https://cdn.openai.com/pdf/3a4153c8-c748-4b71-8e31-aecbde944...

Show context

breakingcups ◴[11 Dec 25 18:37 UTC] No.46235173[source]▶

Is it me, or did it still get at least three placements of components (RAM and PCIe slots, plus it's DisplayPort and not HDMI) in the motherboard image[0] completely wrong? Why would they use that as a promotional image?

0: https://images.ctfassets.net/kftzwdyauwt9/6lyujQxhZDnOMruN3f...

replies(10): >>46235244 #>>46235267 #>>46236405 #>>46236591 #>>46237241 #>>46239493 #>>46240735 #>>46241534 #>>46241550 #>>46241781 #

tedsanders ◴[11 Dec 25 18:44 UTC] No.46235267[source]▶

>>46235173 #

Yep, the point we wanted to make here is that GPT-5.2's vision is better, not perfect. Cherrypicking a perfect output would actually mislead readers, and that wasn't our intent.

replies(9): >>46235823 #>>46236007 #>>46236072 #>>46236155 #>>46236158 #>>46236250 #>>46236355 #>>46238538 #>>46241716 #

1. BoppreH ◴[11 Dec 25 19:28 UTC] No.46236007[source]▶

>>46235267 #

That would be a laudable goal, but I feel like it's contradicted by the text:

> Even on a low-quality image, GPT‑5.2 identifies the main regions and places boxes that roughly match the true locations of each component

I would not consider it to have "identified the main regions" or to have "roughly matched the true locations" when ~1/3 of the boxes have incorrect labels. The remark "even on a low-quality image" is not helping either.

Edit: credit where credit is due, the recently-added disclaimer is nice:

> Both models make clear mistakes, but GPT‑5.2 shows better comprehension of the image.

replies(4): >>46236196 #>>46236246 #>>46236990 #>>46242585 #

2. hnuser123456 ◴[11 Dec 25 19:45 UTC] No.46236196[source]▶

>>46236007 (TP) #

Yeah, what it's calling RAM slots is the CMOS battery. What it's calling the PCIE slot is the interior side of the DB-9 connector. RAM slots and PCIE slots are not even visible in the image.

replies(1): >>46238203 #

3. ◴[11 Dec 25 19:50 UTC] No.46236246[source]▶

>>46236007 (TP) #

4. furyofantares ◴[11 Dec 25 20:53 UTC] No.46236990[source]▶

>>46236007 (TP) #

They also changed "roughly match" to "sometimes match".

replies(1): >>46237477 #

5. MichaelZuo ◴[11 Dec 25 21:34 UTC] No.46237477[source]▶

>>46236990 #

Did they really change a meaningful word like that after publication without an edit note…?

replies(2): >>46237734 #>>46237877 #

6. piker ◴[11 Dec 25 21:54 UTC] No.46237734{3}[source]▶

>>46237477 #

Eh, I'm no shill but their marketing copy isn't exactly the New York Times. They're given some license to respond to critical feedback in a manner that makes the statements more accurate without the same expectations of being objective journalism of record.

replies(1): >>46241558 #

7. dwohnitmok ◴[11 Dec 25 22:04 UTC] No.46237877{3}[source]▶

>>46237477 #

This has definitely happened before with e.g. the o1 release. I will sometimes use the Wayback Machine to verify changes that have been made.

8. hexaga ◴[11 Dec 25 22:30 UTC] No.46238203[source]▶

>>46236196 #

It just overlaid a typical ATX pattern across the motherboard-like parts of the image, even if that's not really what the image is showing. I don't think it's worthwhile to consider this a 'local recognition failure', as if it just happened to mistake CMOS for RAM slots.

Imagine it as a markdown response:

# Why this is an ATX layout motherboard (Honest assessment, straight to the point, *NO* hallucinations)

1. *RAM* as you can clearly see, the RAM slots are to the right of the CPU, so it's obviously ATX

2. *PCIE* the clearly visible PCIE slots are right there at the bottom of the image, so this definitely cannot be anything except an ATX motherboard

3. ... etc more stuff that is supported only by force of preconception

It's just meta signaling gone off the rails. Something in their post-training pipeline is obviously vulnerable given how absolutely saturated with it their model outputs are.

Troubling that the behavior generalizes to image labeling, but not particularly surprising. This has been a visible problem at least since o1, and the lack of change tells me they do not have a real solution.

9. mkesper ◴[12 Dec 25 06:58 UTC] No.46241558{4}[source]▶

>>46237734 #

Yes, but they should clearly mark updates. That would be professional.

10. guerrilla ◴[12 Dec 25 10:02 UTC] No.46242585[source]▶

>>46236007 (TP) #

Leave it to OpenAI to be dishonest about being dishonest. It seems they're also editing this post without notice as well.

↑