(openai.com)

1019 points atgctg | 1 comments | 11 Dec 25 18:04 UTC | HN request time: 0.001s | source

https://platform.openai.com/docs/guides/latest-model

System card: https://cdn.openai.com/pdf/3a4153c8-c748-4b71-8e31-aecbde944...

Show context

minadotcom ◴[11 Dec 25 18:29 UTC] No.46235074[source]▶

They used to compare to competing models from Anthropic, Google DeepMind, DeepSeek, etc. Seems that now they only compare to their own models. Does this mean that the GPT-series is performing worse than its competitors (given the "code red" at OpenAI)?

replies(4): >>46235094 #>>46235110 #>>46235145 #>>46236816 #

Tiberium ◴[11 Dec 25 18:35 UTC] No.46235145[source]▶

>>46235074 #

They did compare it to other models: https://x.com/OpenAI/status/1999182104362668275

https://i.imgur.com/e0iB8KC.png

replies(3): >>46235919 #>>46238146 #>>46241683 #

whimsicalism ◴[11 Dec 25 22:27 UTC] No.46238146[source]▶

>>46235145 #

uh oh, where did SWE bench go :D

replies(1): >>46240109 #

1. whimsicalism ◴[12 Dec 25 02:11 UTC] No.46240109[source]▶

>>46238146 #

maybe they will release with gpt-5.2-codex

↑

GPT-5.2