(openai.com)

1019 points atgctg | 1 comments | 11 Dec 25 18:04 UTC | HN request time: 0s | source

https://platform.openai.com/docs/guides/latest-model

System card: https://cdn.openai.com/pdf/3a4153c8-c748-4b71-8e31-aecbde944...

Show context

sigmar ◴[11 Dec 25 18:39 UTC] No.46235197[source]▶

Are there any specifics about how this was trained? Especially when 5.1 is only a month old. I'm a little skeptical of benchmarks these days and wish they put this up on llmarena

edit: noticed 5.2 is ranked in the webdev arena (#2 tied with gemini-3.0-pro), but not yet in text arena (last update 22hrs ago)

replies(2): >>46235312 #>>46235510 #

emp17344 ◴[11 Dec 25 18:57 UTC] No.46235510[source]▶

>>46235197 #

I’m extremely skeptical because of all those articles claiming OpenAI was freaking out about Gemini - now it turns out they just casually had a better model ready to go? I don’t buy it.

replies(4): >>46235534 #>>46236858 #>>46239622 #>>46240400 #

Workaccount2 ◴[11 Dec 25 20:41 UTC] No.46236858[source]▶

>>46235510 #

I (and others) have a strong suspicion that they can modulate models intelligence in almost real time by adjusting quantization and thinking time.

It seems if anyone wants, they can really gas a model up in the moment and back it off after the hype wave.

replies(2): >>46239626 #>>46239819 #

1. bamboozled ◴[12 Dec 25 00:59 UTC] No.46239626[source]▶

>>46236858 #

Yeah I've noticed with Claude, around the time of the Opus 4.5 release, at least for a few days, Sonnet 4.5 was just dumb, but it seems temporary. I feel that redirected resources to Opus.

↑

GPT-5.2