GPT-5.2

(openai.com)

1053 points atgctg | 1 comments | 11 Dec 25 18:04 UTC | HN request time: 0.267s | source

https://platform.openai.com/docs/guides/latest-model

System card: https://cdn.openai.com/pdf/3a4153c8-c748-4b71-8e31-aecbde944...

Show context

JanSt ◴[11 Dec 25 18:36 UTC] No.46235165[source]▶

The benchmarks are very impressive. Codex and Opus 4.5 are really good coders already and they keep getting better.

No wall yet and I think we might have crossed the threshold of models being as good or better than most engineers already.

GDPval will be an interesting benchmark and I'll happily use the new model to test spreadsheet (and other office work) capabilities. If they can going like this just a little bit further, much of the office workers will stop being useful.... I don't know yet how to feel about this.

Great for humanity probably but but for the individuals?

replies(3): >>46235246 #>>46235323 #>>46235593 #

ionwake ◴[11 Dec 25 18:46 UTC] No.46235323[source]▶

>>46235165 #

it was only about 2-3 weeks when several HNers told me "nah you better re-check your code", when I explained I have over 2 decades xp of coding, yet have not manually edited code (in memory) for the last 6 or so months, whilst performing daily 12 hour daily vibe code seshes

replies(2): >>46235610 #>>46238441 #

ipsum2 ◴[11 Dec 25 19:03 UTC] No.46235610[source]▶

>>46235323 #

It really depends on the complexity of code. I've found models (codex-5.1-max, opus 4.5) to be absolutely useless writing shaders or ML training code, but really good at basic web development.

replies(2): >>46235642 #>>46237574 #

1. sheeshe ◴[11 Dec 25 19:04 UTC] No.46235642[source]▶

>>46235610 #

Which is no surprise as the data for web development stuff exists in large amounts on the web that the models feed off.

↑