←back to thread

GPT-5.2

(openai.com)

1019 points atgctg | 4 comments | 11 Dec 25 18:04 UTC | HN request time: 0.042s | source

https://platform.openai.com/docs/guides/latest-model

System card: https://cdn.openai.com/pdf/3a4153c8-c748-4b71-8e31-aecbde944...

Show context

zone411 ◴[11 Dec 25 19:46 UTC] No.46236209[source]▶

>>46234788 (OP) #

I've benchmarked it on the Extended NYT Connections benchmark (https://github.com/lechmazur/nyt-connections/):

The high-reasoning version of GPT-5.2 improves on GPT-5.1: 69.9 → 77.9.

The medium-reasoning version also improves: 62.7 → 72.1.

The no-reasoning version also improves: 22.1 → 27.5.

Gemini 3 Pro and Grok 4.1 Fast Reasoning still score higher.

replies(4): >>46236325 #>>46236642 #>>46237650 #>>46241682 #

scrollop ◴[11 Dec 25 21:47 UTC] No.46237650[source]▶

Why no grok 4.1 reasoning?

replies(1): >>46239494 #

sanex ◴[12 Dec 25 00:43 UTC] No.46239494[source]▶

Do people other than Elon fans use grok? Honest question. I've never tried it.

replies(8): >>46240950 #>>46241184 #>>46241391 #>>46241742 #>>46241796 #>>46241902 #>>46242564 #>>46242875 #

1. mac-attack ◴[12 Dec 25 04:45 UTC] No.46240950{3}[source]▶

I can't understand why people would trust a CEO that regularly lies about product timelines, product features, his own personal life, etc. And that's before politicizing his entire kingdom by literally becoming a part of government and one of the larger donations of the current administration.

replies(3): >>46241645 #>>46241905 #>>46242316 #

2. lkjdsklf ◴[12 Dec 25 07:14 UTC] No.46241645[source]▶

>>46240950 (TP) #

If we stopped using products of every company that had a CEO that lied about their products, we’d all be sitting in caves staring at the dirt

3. fatata123 ◴[12 Dec 25 08:02 UTC] No.46241905[source]▶

>>46240950 (TP) #

Because not everyone makes their decisions through the prism of politics

4. delaminator ◴[12 Dec 25 09:17 UTC] No.46242316[source]▶

>>46240950 (TP) #

You’re not narrowing it down.