(openai.com)

1019 points atgctg | 2 comments | 11 Dec 25 18:04 UTC | HN request time: 0s | source

https://platform.openai.com/docs/guides/latest-model

System card: https://cdn.openai.com/pdf/3a4153c8-c748-4b71-8e31-aecbde944...

Show context

sigmar ◴[11 Dec 25 18:39 UTC] No.46235197[source]▶

Are there any specifics about how this was trained? Especially when 5.1 is only a month old. I'm a little skeptical of benchmarks these days and wish they put this up on llmarena

edit: noticed 5.2 is ranked in the webdev arena (#2 tied with gemini-3.0-pro), but not yet in text arena (last update 22hrs ago)

replies(2): >>46235312 #>>46235510 #

emp17344 ◴[11 Dec 25 18:57 UTC] No.46235510[source]▶

>>46235197 #

I’m extremely skeptical because of all those articles claiming OpenAI was freaking out about Gemini - now it turns out they just casually had a better model ready to go? I don’t buy it.

replies(4): >>46235534 #>>46236858 #>>46239622 #>>46240400 #

Workaccount2 ◴[11 Dec 25 20:41 UTC] No.46236858[source]▶

>>46235510 #

I (and others) have a strong suspicion that they can modulate models intelligence in almost real time by adjusting quantization and thinking time.

It seems if anyone wants, they can really gas a model up in the moment and back it off after the hype wave.

replies(2): >>46239626 #>>46239819 #

1. qeternity ◴[12 Dec 25 01:28 UTC] No.46239819[source]▶

>>46236858 #

Quantization is not some magical dial you can just turn. In practice you basically have 3 choices: fp16, fp8 and fp4.

Also thinking time means more tokens which costs more especially at the API level where you are paying per token and would be trivially observable.

There is basically no evidence that either of these are occurring in the way you suggest (boosting up and down).

replies(1): >>46240543 #

2. Workaccount2 ◴[12 Dec 25 03:25 UTC] No.46240543[source]▶

>>46239819 (TP) #

API users probably wouldn't be affected since they are paying in full. Most people complaining are free users, followed by $20/mo users.

↑

GPT-5.2