Olmo 3: Charting a path through the model flow to lead open-source AI

1. stavros ◴[21 Nov 25 08:00 UTC] No.46002252[source]▶

> the best fully open 32B-scale thinking model

It's absolutely fantastic that they're releasing an actually OSS model, but isn't "the best fully open" a bit of a low bar? I'm not aware of any other fully open models.

replies(9): >>46002293 #>>46002338 #>>46002597 #>>46002842 #>>46002944 #>>46003313 #>>46004177 #>>46006028 #>>46006176 #

2. sanxiyn ◴[21 Nov 25 08:06 UTC] No.46002293[source]▶

>>46002252 (TP) #

Yeah. There are other fully open models like Hugging Face SmolLM but they are not common.

3. glemmaPaul ◴[21 Nov 25 08:13 UTC] No.46002338[source]▶

>>46002252 (TP) #

Well if open source is one of your USP, then better mention that right? Open Source people tend to also like that their work is.. open source.

And otherwise you 1on1 start competing with notsoOpenAI, or say Llama.

replies(1): >>46002350 #

4. stavros ◴[21 Nov 25 08:15 UTC] No.46002350[source]▶

>>46002338 #

My observation was more on "best", rather than on "fully open". It's like Apple saying "this is the best iPhone" for every new iPhone.

5. psychoslave ◴[21 Nov 25 08:52 UTC] No.46002597[source]▶

>>46002252 (TP) #

You need to learn to walk before you can run.

6. shoffmeister ◴[21 Nov 25 09:31 UTC] No.46002842[source]▶

>>46002252 (TP) #

Switzerland, through EPFL, ETH Zurich, and the Swiss National Supercomputing Centre, has released a complete pipeline with all training data - that is "fully open", to my understanding.

See https://www.swiss-ai.org/apertus for details.

https://ethz.ch/en/news-and-events/eth-news/news/2025/07/a-l... was the press release.

replies(1): >>46002918 #

7. YetAnotherNick ◴[21 Nov 25 09:47 UTC] No.46002918[source]▶

>>46002842 #

All the data used by Apertus is just data processed or generated by American companies(NVidia, Apple and huggingface mostly). They didn't release any new data.

Olmo and HF not only processed the data to address language bias, they also publish lot of data augmentation results including European language performance. European LLMs just claim that language bias is the motivator.

8. maxloh ◴[21 Nov 25 09:53 UTC] No.46002944[source]▶

>>46002252 (TP) #

AFSIK, when they use the term "fully open", they mean open dataset and open training code. The Olmo series of models are the only mainstream models out there that satisfy this requirement, hence the clause.

> We go beyond just releasing model weights - we provide our training code, training data, our model weights, and our recipes.

https://docs.allenai.org/#truly-open

replies(1): >>46003698 #

9. ◴[21 Nov 25 11:02 UTC] No.46003313[source]▶

>>46002252 (TP) #

10. stavros ◴[21 Nov 25 12:01 UTC] No.46003698[source]▶

>>46002944 #

Yes, and that's why saying this is "the best" is a tautology. If it's the only one, it's obviously the best, and the worst, and everything.

11. fwip ◴[21 Nov 25 13:11 UTC] No.46004177[source]▶

>>46002252 (TP) #

There's a lot of fully open models made by hobbyists and some by researchers. If you've only heard of this one, it's likely because this one is the closest to being competitive with closed models.

12. comp_raccoon ◴[21 Nov 25 16:36 UTC] No.46006028[source]▶

>>46002252 (TP) #

Olmo author here… would be nice to have some more competition!! I don’t like that we are so lonely either.

We are competitive with open weights models in general, just a couple points behind best Qwen.

Fully open models are important for research community; a lot of fundamental discoveries are made when you have access to training data. We call out we are the best fully open model because researchers would want to know about that.

replies(1): >>46006051 #

13. stavros ◴[21 Nov 25 16:38 UTC] No.46006051[source]▶

>>46006028 #

Makes sense, thanks!

14. fnbr ◴[21 Nov 25 16:52 UTC] No.46006176[source]▶

>>46002252 (TP) #

(I'm a researcher on Olmo.)

There's a bunch of other fully open models, including the [Marin](https://marin.community/) series of models out of Stanford and Nvidia regularly releases fully open models.