OpenAI o3 and o4-mini

(openai.com)

555 points maheshrijal | 1 comments | 16 Apr 25 17:01 UTC | HN request time: 0s | source

Show context

ApolloFortyNine ◴[16 Apr 25 17:21 UTC] No.43707967[source]▶

Maybe OpenAI needs an easy mode for all these people saying 5 choices of models (and that's only if you pay) is simply too confusing for them.

They even provide a description in the UI of each before you select it, and it defaults to a model for you.

If you just want an answer of what you should use and can't be bothered to research them, just use o3(4)-mini and call it a day.

replies(1): >>43708073 #

brokencode ◴[16 Apr 25 17:27 UTC] No.43708073[source]▶

>>43707967 #

I personally like being able to choose because I understand the tradeoffs and want to choose the best one for what I’m asking. So I hope this doesn’t go away.

But I agree that they probably need some kind of basic mode to make things easier for the average person. The basic mode should decide automatically what model to use and hide this from the user.

replies(2): >>43713066 #>>43713995 #

CaptainFever ◴[17 Apr 25 04:15 UTC] No.43713066[source]▶

>>43708073 #

Would that be considered a Mixture of Experts system?

replies(1): >>43713297 #

1. simonw ◴[17 Apr 25 05:08 UTC] No.43713297[source]▶

>>43713066 #

No, Mixture of Experts is a really confusing term.

It sounds like it means "have a bunch of models, one that's an expert in physics, one that's an expert in health etc and then pick the one that's a best fit for the user's query".

It's not that. The "experts" are each another giant opaque blob of weights. The model is trained to select one of those blobs, but they don't have any form of human-understandable "expertise". It's an optimization that lets you avoid using ALL of the weights for every run through the model, which helps with performance.

https://huggingface.co/blog/moe#what-is-a-mixture-of-experts... is a decent explanation.

↑