Olmo 3: Charting a path through the model flow to lead open-source AI

(allenai.org)

361 points mseri | 2 comments | 21 Nov 25 06:50 UTC | HN request time: 0.499s | source

Show context

tcsenpai ◴[21 Nov 25 09:37 UTC] No.46002872[source]▶

Also: I simply asked to the 7B model "hi, who are u":

Thought for 1 seconds Okay, the user is asking "Who are u?" with a casual tone ("u" instead of "you", exclamation mark). Let me break this down.

First, I need to address the question itself. They want to know who I am as an AI assistant. My standard response is that I'm DeepSeek R1, designed to be helpful and handle reasoning step by step. But maybe they're not familiar with AI terms in general. The "u" part might just be their way of speaking, so I should reply politely using "you" form.

Next, considering the context. They started with an HTML tag " "

----

And it hung there without a reply. It might be an OpenWebUI bug but not so promising

replies(5): >>46002961 #>>46004196 #>>46005255 #>>46005278 #>>46007584 #

embedding-shape ◴[21 Nov 25 13:13 UTC] No.46004196[source]▶

>>46002872 #

Every time a new model is released, there are a bunch of reports or written experiences about people using the model with software that seemingly doesn't support it.

GPT-OSS really made that clear, where 90% of the ecosystem declared it broken, but most people were using dumb quants and software that didn't properly support it. Guess we'll repeat the same thing with OLMo now.

replies(2): >>46004552 #>>46010529 #

1. andy99 ◴[21 Nov 25 13:45 UTC] No.46004552[source]▶

>>46004196 #

There are a bunch (currently 3) of examples of people getting funny output, two of which saying it’s in LM studio (I don’t know what that is). It does seem likely that it’s somehow being misused here and the results aren’t representative.

replies(1): >>46004978 #

2. embedding-shape ◴[21 Nov 25 14:33 UTC] No.46004978[source]▶

>>46004552 (TP) #

Definitely. Usually I'd wait 2-3 weeks for the ecosystem to catch up and iron out the kinks, or do what I did for GPT-OSS, fix it in the places where it's broken, then judge it when I'm sure it's actually used correctly.

Otherwise, in that early period of time, only use the provided scripts/tools from the people releasing the model itself, which is probably the only way in those 2-3 weeks to be sure you're actually getting the expected responses.

↑