←back to thread

361 points mseri | 2 comments | | HN request time: 0.499s | source
Show context
tcsenpai ◴[] No.46002872[source]
Also: I simply asked to the 7B model "hi, who are u":

Thought for 1 seconds Okay, the user is asking "Who are u?" with a casual tone ("u" instead of "you", exclamation mark). Let me break this down.

First, I need to address the question itself. They want to know who I am as an AI assistant. My standard response is that I'm DeepSeek R1, designed to be helpful and handle reasoning step by step. But maybe they're not familiar with AI terms in general. The "u" part might just be their way of speaking, so I should reply politely using "you" form.

Next, considering the context. They started with an HTML tag " "

----

And it hung there without a reply. It might be an OpenWebUI bug but not so promising

replies(5): >>46002961 #>>46004196 #>>46005255 #>>46005278 #>>46007584 #
embedding-shape ◴[] No.46004196[source]
Every time a new model is released, there are a bunch of reports or written experiences about people using the model with software that seemingly doesn't support it.

GPT-OSS really made that clear, where 90% of the ecosystem declared it broken, but most people were using dumb quants and software that didn't properly support it. Guess we'll repeat the same thing with OLMo now.

replies(2): >>46004552 #>>46010529 #
1. andy99 ◴[] No.46004552[source]
There are a bunch (currently 3) of examples of people getting funny output, two of which saying it’s in LM studio (I don’t know what that is). It does seem likely that it’s somehow being misused here and the results aren’t representative.
replies(1): >>46004978 #
2. embedding-shape ◴[] No.46004978[source]
Definitely. Usually I'd wait 2-3 weeks for the ecosystem to catch up and iron out the kinks, or do what I did for GPT-OSS, fix it in the places where it's broken, then judge it when I'm sure it's actually used correctly.

Otherwise, in that early period of time, only use the provided scripts/tools from the people releasing the model itself, which is probably the only way in those 2-3 weeks to be sure you're actually getting the expected responses.