Most active commenters

Qwen2.5-VL-32B: Smarter and Lighter

(qwenlm.github.io)

Show context

simonw ◴[24 Mar 25 18:52 UTC] No.43464227[source]▶

Big day for open source Chinese model releases - DeepSeek-v3-0324 came out today too, an updated version of DeepSeek v3 now under an MIT license (previously it was a custom DeepSeek license). https://simonwillison.net/2025/Mar/24/deepseek/

replies(5): >>43464375 #>>43464498 #>>43464686 #>>43465383 #>>43467111 #

chaosprint ◴[24 Mar 25 19:06 UTC] No.43464375[source]▶

>>43464227 #

it seems that this free version "may use your prompts and completions to train new models"

https://openrouter.ai/deepseek/deepseek-chat-v3-0324:free

do you think this needs attention?

replies(7): >>43464399 #>>43464480 #>>43464512 #>>43464616 #>>43464961 #>>43468548 #>>43470210 #

1. huijzer ◴[24 Mar 25 19:21 UTC] No.43464512[source]▶

>>43464375 #

Since we are on HN here, I can highly recommend open-webui with some OpenAI-compatible provider. I'm running with Deep Infra for more than a year now and am very happy. New models are usually available within one or two days after release. Also have some friends who use the service almost daily.

replies(7): >>43464718 #>>43465081 #>>43466430 #>>43466464 #>>43466949 #>>43469369 #>>43473139 #

2. unquietwiki ◴[24 Mar 25 19:45 UTC] No.43464718[source]▶

>>43464512 (TP) #

I'm using open-webui at home with a couple of different models. gemma2-9b fits in VRAM on a NV 3060 card + performs nicely.

replies(2): >>43465594 #>>43469350 #

3. l72 ◴[24 Mar 25 20:28 UTC] No.43465081[source]▶

>>43464512 (TP) #

I too run openweb-ui locally and use deepinfra.com as my backend. It has been working very well, and I am quite happy with deepinfra's pricing and privacy policy.

I have set up the same thing at work for my colleagues, and they find it better than openai for their tasks.

replies(1): >>43468529 #

4. zakki ◴[24 Mar 25 21:30 UTC] No.43465594[source]▶

>>43464718 #

What is the memory of your NV3060? 8GB?

replies(1): >>43466243 #

5. ngvjmfgb ◴[24 Mar 25 22:53 UTC] No.43466243{3}[source]▶

>>43465594 #

12GB (edit: that is what mine is)

6. ◴[24 Mar 25 23:17 UTC] No.43466430[source]▶

>>43464512 (TP) #

7. totetsu ◴[24 Mar 25 23:22 UTC] No.43466464[source]▶

>>43464512 (TP) #

And it’s quite easy to set up a Cloudflare tunnel to make your open-webui instance accessible online too just you

replies(1): >>43466484 #

8. simonw ◴[24 Mar 25 23:25 UTC] No.43466484[source]▶

>>43466464 #

... or a TailScale network. I've been leaving open-webui running on my laptop on my desk and then going out into the word and accessing it from my phone via TailScale, works great.

replies(2): >>43466956 #>>43467521 #

9. wkat4242 ◴[25 Mar 25 00:40 UTC] No.43466949[source]▶

>>43464512 (TP) #

Yeah OpenWebUI is great with local models too. I love it. You can even do a combo, send the same prompt to local and cloud and even various providers and compare the results.

10. wkat4242 ◴[25 Mar 25 00:40 UTC] No.43466956{3}[source]▶

>>43466484 #

Yeah this sounds like the more secure option, you don't want to be dependent on a single flaw in a web service

11. totetsu ◴[25 Mar 25 02:15 UTC] No.43467521{3}[source]▶

>>43466484 #

I would use tail scale. But I specifically want to use open web-ui from a place I can’t install a Tailscale client

replies(1): >>43468733 #

12. jychang ◴[25 Mar 25 06:22 UTC] No.43468529[source]▶

>>43465081 #

Yeah, openweb-ui is the best frontend for API queries. Everything seems to work well.

I've tried LibreChat before, but the app is terrible at generating titles for chats instead of leaving it as "New Chat". Also it lacks a working Code Interpreter.

13. fragmede ◴[25 Mar 25 07:17 UTC] No.43468733{4}[source]▶

>>43467521 #

where's that?

14. mdp2021 ◴[25 Mar 25 09:21 UTC] No.43469350[source]▶

>>43464718 #

> performs nicely

Do you have rough indication of token/s ?

15. eurekin ◴[25 Mar 25 09:24 UTC] No.43469369[source]▶

>>43464512 (TP) #

I've tried using it, but it's browser tab seems to peg one core to 100% after some time. Anyone else experienced it?

16. indigodaddy ◴[25 Mar 25 16:27 UTC] No.43473139[source]▶

>>43464512 (TP) #

Can open-webui update code on your local computer ala cursor etc?

replies(1): >>43473514 #

17. cess11 ◴[25 Mar 25 17:04 UTC] No.43473514[source]▶

>>43473139 #

It has a module system so maybe it can but it seems more people are using Aider or Continue for that. There's a bit of stitching things together regardless of whether you show your project to some SaaS or run local models but if you can manage a Linux system it'll be easy.

Personally I heavily dislike the experience though, so I might not be the best one to answer.

↑