←back to thread

GPT-5.2

(openai.com)
1053 points atgctg | 1 comments | | HN request time: 0.267s | source
Show context
JanSt ◴[] No.46235165[source]
The benchmarks are very impressive. Codex and Opus 4.5 are really good coders already and they keep getting better.

No wall yet and I think we might have crossed the threshold of models being as good or better than most engineers already.

GDPval will be an interesting benchmark and I'll happily use the new model to test spreadsheet (and other office work) capabilities. If they can going like this just a little bit further, much of the office workers will stop being useful.... I don't know yet how to feel about this.

Great for humanity probably but but for the individuals?

replies(3): >>46235246 #>>46235323 #>>46235593 #
ionwake ◴[] No.46235323[source]
it was only about 2-3 weeks when several HNers told me "nah you better re-check your code", when I explained I have over 2 decades xp of coding, yet have not manually edited code (in memory) for the last 6 or so months, whilst performing daily 12 hour daily vibe code seshes
replies(2): >>46235610 #>>46238441 #
ipsum2 ◴[] No.46235610[source]
It really depends on the complexity of code. I've found models (codex-5.1-max, opus 4.5) to be absolutely useless writing shaders or ML training code, but really good at basic web development.
replies(2): >>46235642 #>>46237574 #
1. sheeshe ◴[] No.46235642[source]
Which is no surprise as the data for web development stuff exists in large amounts on the web that the models feed off.