←back to thread

GPT-5.2

(openai.com)
1019 points atgctg | 4 comments | | HN request time: 0.042s | source
Show context
zone411 ◴[] No.46236209[source]
I've benchmarked it on the Extended NYT Connections benchmark (https://github.com/lechmazur/nyt-connections/):

The high-reasoning version of GPT-5.2 improves on GPT-5.1: 69.9 → 77.9.

The medium-reasoning version also improves: 62.7 → 72.1.

The no-reasoning version also improves: 22.1 → 27.5.

Gemini 3 Pro and Grok 4.1 Fast Reasoning still score higher.

replies(4): >>46236325 #>>46236642 #>>46237650 #>>46241682 #
scrollop ◴[] No.46237650[source]
Why no grok 4.1 reasoning?
replies(1): >>46239494 #
sanex ◴[] No.46239494[source]
Do people other than Elon fans use grok? Honest question. I've never tried it.
replies(8): >>46240950 #>>46241184 #>>46241391 #>>46241742 #>>46241796 #>>46241902 #>>46242564 #>>46242875 #
1. mac-attack ◴[] No.46240950{3}[source]
I can't understand why people would trust a CEO that regularly lies about product timelines, product features, his own personal life, etc. And that's before politicizing his entire kingdom by literally becoming a part of government and one of the larger donations of the current administration.
replies(3): >>46241645 #>>46241905 #>>46242316 #
2. lkjdsklf ◴[] No.46241645[source]
If we stopped using products of every company that had a CEO that lied about their products, we’d all be sitting in caves staring at the dirt
3. fatata123 ◴[] No.46241905[source]
Because not everyone makes their decisions through the prism of politics
4. delaminator ◴[] No.46242316[source]
You’re not narrowing it down.