/top/
/new/
/best/
/ask/
/show/
/job/
^
slacker news
login
about
←back to thread
OpenAI o3 and o4-mini
(openai.com)
555 points
maheshrijal
| 1 comments |
16 Apr 25 17:01 UTC
|
HN request time: 0.209s
|
source
Show context
brap
◴[
16 Apr 25 17:11 UTC
]
No.
43707838
[source]
▶
>>43707719 (OP)
#
Where's the comparison with Gemini 2.5 Pro?
replies(3):
>>43707846
#
>>43707897
#
>>43708606
#
gallerdude
◴[
16 Apr 25 17:16 UTC
]
No.
43707897
[source]
▶
>>43707838
#
For coding, I like the Aider polyglot benchmark, since it covers multiple programming languages.
Gemini 2.5 Pro got 72.9%
o3 high gets 81.3%, o4-mini high gets 68.9%
replies(4):
>>43708090
#
>>43708632
#
>>43709557
#
>>43709763
#
1.
croemer
◴[
16 Apr 25 19:45 UTC
]
No.
43709557
[source]
▶
>>43707897
#
Isn't it easy to train on the specific Exercism exercises that this benchmark uses?
ID:
GO
↑