←back to thread

544 points tosh | 1 comments | | HN request time: 0.212s | source
Show context
simonw ◴[] No.43464227[source]
Big day for open source Chinese model releases - DeepSeek-v3-0324 came out today too, an updated version of DeepSeek v3 now under an MIT license (previously it was a custom DeepSeek license). https://simonwillison.net/2025/Mar/24/deepseek/
replies(5): >>43464375 #>>43464498 #>>43464686 #>>43465383 #>>43467111 #
chaosprint ◴[] No.43464375[source]
it seems that this free version "may use your prompts and completions to train new models"

https://openrouter.ai/deepseek/deepseek-chat-v3-0324:free

do you think this needs attention?

replies(7): >>43464399 #>>43464480 #>>43464512 #>>43464616 #>>43464961 #>>43468548 #>>43470210 #
huijzer ◴[] No.43464512[source]
Since we are on HN here, I can highly recommend open-webui with some OpenAI-compatible provider. I'm running with Deep Infra for more than a year now and am very happy. New models are usually available within one or two days after release. Also have some friends who use the service almost daily.
replies(7): >>43464718 #>>43465081 #>>43466430 #>>43466464 #>>43466949 #>>43469369 #>>43473139 #
unquietwiki ◴[] No.43464718[source]
I'm using open-webui at home with a couple of different models. gemma2-9b fits in VRAM on a NV 3060 card + performs nicely.
replies(2): >>43465594 #>>43469350 #
1. mdp2021 ◴[] No.43469350[source]
> performs nicely

Do you have rough indication of token/s ?