Qwen2.5-VL-32B: Smarter and Lighter

(qwenlm.github.io)

544 points tosh | 3 comments | 24 Mar 25 18:35 UTC | HN request time: 0s | source

Show context

simonw ◴[24 Mar 25 18:52 UTC] No.43464227[source]▶

Big day for open source Chinese model releases - DeepSeek-v3-0324 came out today too, an updated version of DeepSeek v3 now under an MIT license (previously it was a custom DeepSeek license). https://simonwillison.net/2025/Mar/24/deepseek/

replies(5): >>43464375 #>>43464498 #>>43464686 #>>43465383 #>>43467111 #

echelon ◴[24 Mar 25 19:20 UTC] No.43464498[source]▶

>>43464227 #

Pretty soon I won't be using any American models. It'll be a 100% Chinese open source stack.

The foundation model companies are screwed. Only shovel makers (Nvidia, infra companies) and product companies are going to win.

replies(7): >>43464607 #>>43464651 #>>43464792 #>>43466340 #>>43466493 #>>43469085 #>>43469922 #

refulgentis ◴[24 Mar 25 19:54 UTC] No.43464792[source]▶

>>43464498 #

I've been waiting since November for 1, just 1*, model other than Claude than can reliably do agentic tool call loops. As long as the Chinese open models are chasing reasoning and benchmark maxxing vs. mid-2024 US private models, I'm very comfortable with somewhat ignoring these models.

(this isn't idle prognostication hinging on my personal hobby horse. I got skin in the game, I'm virtually certain I have the only AI client that is able to reliably do tool calls with open models in an agentic setting. llama.cpp got a massive contribution to make this happen and the big boys who bother, like ollama, are still using a dated json-schema-forcing method that doesn't comport with recent local model releases that can do tool calls. IMHO we're comfortably past a point where products using these models can afford to focus on conversational chatbots, thats cute but a commodity to give away per standard 2010s SV thinking)

* OpenAI's can but are a little less...grounded?...situated? i.e. it can't handle "read this file and edit it to do $X". Same-ish for Gemini, though, sometimes I feel like the only person in the world who actually waits for the experimental models to go GA, as per letter of the law, I shouldn't deploy them until then

replies(3): >>43464831 #>>43472567 #>>43473947 #

cess11 ◴[25 Mar 25 17:47 UTC] No.43473947[source]▶

>>43464792 #

You mean like https://manusai.ai/ is supposed to function?

replies(1): >>43475271 #

1. refulgentis ◴[25 Mar 25 20:01 UTC] No.43475271[source]▶

>>43473947 #

Yes, exactly, and no trivially: Manus is Sonnet with tools

replies(1): >>43479830 #

2. cess11 ◴[26 Mar 25 07:53 UTC] No.43479830[source]▶

>>43475271 (TP) #

Right. Apparently they also claim it's more than that:

https://xcancel.com/peakji/status/1898997311646437487

replies(1): >>43501400 #

3. refulgentis ◴[28 Mar 25 04:01 UTC] No.43501400[source]▶

>>43479830 #

No, they don't, that's just a bunch of other stuff (ex. Something something we don't differ from academic papers on agents (???))

↑