Qwen2.5-VL-32B: Smarter and Lighter

(qwenlm.github.io)

544 points tosh | 1 comments | 24 Mar 25 18:35 UTC | HN request time: 0.202s | source

Show context

simonw ◴[24 Mar 25 18:52 UTC] No.43464227[source]▶

Big day for open source Chinese model releases - DeepSeek-v3-0324 came out today too, an updated version of DeepSeek v3 now under an MIT license (previously it was a custom DeepSeek license). https://simonwillison.net/2025/Mar/24/deepseek/

replies(5): >>43464375 #>>43464498 #>>43464686 #>>43465383 #>>43467111 #

echelon ◴[24 Mar 25 19:20 UTC] No.43464498[source]▶

>>43464227 #

Pretty soon I won't be using any American models. It'll be a 100% Chinese open source stack.

The foundation model companies are screwed. Only shovel makers (Nvidia, infra companies) and product companies are going to win.

replies(7): >>43464607 #>>43464651 #>>43464792 #>>43466340 #>>43466493 #>>43469085 #>>43469922 #

jsheard ◴[24 Mar 25 19:32 UTC] No.43464607[source]▶

>>43464498 #

I still don't get where the money for new open source models is going to come from once setting investor dollars on fire is no longer a viable business model. Does anyone seriously expect companies to keep buying and running thousands of ungodly expensive GPUs, plus whatever they spend on human workers to do labelling/tuning, and then giving away the spoils for free, forever?

replies(12): >>43464649 #>>43464673 #>>43464679 #>>43464701 #>>43464720 #>>43464725 #>>43465054 #>>43465195 #>>43465674 #>>43467099 #>>43470575 #>>43471233 #

finnjohnsen2 ◴[24 Mar 25 19:37 UTC] No.43464649[source]▶

>>43464607 #

ads again. somehow. its like a law of nature.

replies(1): >>43464678 #

api ◴[24 Mar 25 19:40 UTC] No.43464678[source]▶

>>43464649 #

If nationalist propaganda counts as ads, that might already be supporting Chinese models. Ask them about Tiananmen Square.

Any kind of media with zero or near zero copying/distribution costs becomes a deflationary race to the bottom. Someone will eventually release something that's free, and at that point nothing can compete with free unless it's some kind of very specialized offering. Then you run into a the problem the OP described: how do you fund free? Answer: ads. Now the customer is the advertiser, not the user/consumer, which is why most media converges on trash.

replies(1): >>43464740 #

Imustaskforhelp ◴[24 Mar 25 19:48 UTC] No.43464740[source]▶

>>43464678 #

These ads can also have ads blockers though.

Perplexity released the deepseek r1 1331? ( I am not sure I forgot) It basically removes chinese censorships / yes you can ask it about the tiananmen square.

I think the next iteration of these ai model ads would be sneaky which might be hard to remove

Though it's funny you comment about chinese censorship yet american censorship is fine lol

replies(2): >>43464820 #>>43466389 #

Zambyte ◴[24 Mar 25 23:12 UTC] No.43466389[source]▶

>>43464740 #

There are lots of "alliterated" versions of models too, which is where people will essentially remove the models ability to reject responding to a prompt. The huihui r1 14b alliterated had some trouble telling me about tiananmen square, basically dodging the question by telling me about itself, but after some coaxing I was able to get the info out of it.

I say this because I think that the Perplexity model is tuned on additional information, whereas the alliterated models only include information trained into the underlying model, which is interesting to see.

replies(1): >>43471651 #

bigfudge ◴[25 Mar 25 14:16 UTC] No.43471651[source]▶

>>43466389 #

Abliterated? Alliterated LLMs might be fun though…

replies(1): >>43480359 #

1. Zambyte ◴[26 Mar 25 09:36 UTC] No.43480359[source]▶

>>43471651 #

Oops, yeah I don't know how that got autocorrected three times without my noticing. Abliterated.

↑