OpenAI o3 and o4-mini | slacker news

1. iamronaldo ◴[16 Apr 25 17:47 UTC] No.43708332[source]▶

Tyler cowen seems convinced https://marginalrevolution.com/marginalrevolution/2025/04/o3...

2. jonahx ◴[16 Apr 25 21:22 UTC] No.43710548[source]▶

It can't solve this puzzle: https://i.imgur.com/AJqbqHJ.png

    Thought for 3m 51s
    Short answer → you can’t.

The breathtaking thing is not the model itself, but that someone as smart as Cowen (and he's not the only one) is uttering "AGI" in the same sentence as any of these models. Now, I'm not a hater, and for many tasks they are amazing, but they are, as of now, not even close to AGI, by any reasonable definition.

replies(2): >>43710949 #>>43711155 #

3. AIPedant ◴[16 Apr 25 22:14 UTC] No.43710949[source]▶

>>43710548 #

  I think it is AGI, seriously.  Try asking it lots of questions, and then ask yourself: just how much smarter was I expecting AGI to be?

That's his whole argument!!!! This is so frustrating coming from a public intellectual. "You don't need rigorous reasoning to answer these questions, baybeee, just go with your vibes." Complete and total disregard for scientific thinking, in favor of confirmation bias and ideology.

4. neonbjb ◴[16 Apr 25 22:45 UTC] No.43711155[source]▶

>>43710548 #

I work for openai.

o4-mini gets much closer (but I'm pretty sure it fumbles at the last moment): https://chatgpt.com/share/680031fb-2bd0-8013-87ac-941fa91cea...

We're pretty bad at model naming and communicating capabilities (in our defense, it's hard!), but o4-mini is actually a _considerably_ better vision model than o3, despite the benchmarks. Similar to how o3-mini-high was a much better coding model than o1. I would recommend using o4-mini-high over o3 for any task involving vision.

replies(1): >>43711469 #

5. jonahx ◴[16 Apr 25 23:34 UTC] No.43711469{3}[source]▶

>>43711155 #

Thanks for the reply. I am not sure the vision is the failing point here, but logic. I routinely try to get these models to solve difficult puzzles or coding challenges (the kind that a good undergrad math major could probably solve, but that most would struggle with). They fail almost always. Even with help.

For example, JaneStreet monthly puzzles. Surprisingly, the new o3 was able to solve this months (previous models were not), which was an easier one. Believe me, I am not trying to minimize the overall achievement -- what it can do incredible -- but I don't believe the phrase AGI should even be mentioned until we are seeing solutions to problems that most professional mathematicians would struggle with, including solutions to unsolved problems.

That might not be enough even, but that should be the minimum bar for even having the conversation.

replies(1): >>43712866 #

6. og_kalu ◴[17 Apr 25 03:34 UTC] No.43712866{4}[source]▶

>>43711469 #

>what it can do incredible -- but I don't believe the phrase AGI should even be mentioned until we are seeing solutions to problems that most professional mathematicians would struggle with, including solutions to unsolved problems.

But Why ? Why should Artificial General Intelligence preclude things a good chunk of humans wouldn't be able to do ? Are those guys no longer General Intelligences ?

I'm not saying this definition is 'wrong' but you have to realize at this point, the individual words of that acronym no longer mean anything.

replies(1): >>43713364 #

7. jonahx ◴[17 Apr 25 05:23 UTC] No.43713364{5}[source]▶

>>43712866 #

Sure, there's no authority who stamps the official definition.

I'll make my case. To me, if you look at how the phrase is usually used -- "when humans have achieved AGI...", etc -- it evokes a science fiction turning point that implies superhuman performance in more or less every intellectual task. It's general, after all. I think of Hal or the movie Her. It's not "Artifical General Just-Like-Most-People-You-Know Intelligence". Though we are not there yet, either, if you consider the full spectrum of human abilities.

Few things would demonstrate general superhuman reasoning ability more definitively than machines producing new, useful, influential math results at a faster rate than people. With that achieved, you would expect it could start writing fiction and screenplays and comedy as well as people too (it's still very far imo), but maybe not, maybe those skills develop at different paces, and I still wouldn't want to call it AGI. But I think truly conquering mathematics would get me there.

replies(2): >>43714845 #>>43748035 #

8. nopinsight ◴[17 Apr 25 10:08 UTC] No.43714845{6}[source]▶

>>43713364 #

A standard term people use for what you describe is superintelligence, not AGI.

Current frontier models are better than average humans in many skills but worse in others. Ethan Mollick calls it “jagged frontier” which sounds about right.

9. spprashant ◴[17 Apr 25 14:13 UTC] No.43717166[source]▶

>>43708332 (TP) #

Tyler Cowen is someone I take seriously. I think he is one of the most rational thought leaders.

But I have to say, his views on LLMs seem a little premature. He definitely has a unique viewpoint of what "general intelligence" is, which might not apply broadly to most jobs. I think "interviews" them like they were a guest on his podcast and bases his judgement on how they compare to his other extremely smart guests.

10. HDThoreaun ◴[21 Apr 25 02:28 UTC] No.43748035{6}[source]▶

>>43713364 #

youre describing superintelligence. This is why these conversations always need to start with definitions