/top/
/new/
/best/
/ask/
/show/
/job/
^
slacker news
login
about
←back to thread
OpenAI o3 and o4-mini
(openai.com)
555 points
maheshrijal
| 1 comments |
16 Apr 25 17:01 UTC
|
HN request time: 0.949s
|
source
Show context
brap
◴[
16 Apr 25 17:11 UTC
]
No.
43707838
[source]
▶
>>43707719 (OP)
#
Where's the comparison with Gemini 2.5 Pro?
replies(3):
>>43707846
#
>>43707897
#
>>43708606
#
gallerdude
◴[
16 Apr 25 17:16 UTC
]
No.
43707897
[source]
▶
>>43707838
#
For coding, I like the Aider polyglot benchmark, since it covers multiple programming languages.
Gemini 2.5 Pro got 72.9%
o3 high gets 81.3%, o4-mini high gets 68.9%
replies(4):
>>43708090
#
>>43708632
#
>>43709557
#
>>43709763
#
1.
jumpCastle
◴[
16 Apr 25 20:06 UTC
]
No.
43709763
[source]
▶
>>43707897
#
It was a good benchmark until it entered the training set.
ID:
GO
↑