←back to thread

507 points martinald | 1 comments | | HN request time: 0.236s | source
Show context
sc68cal ◴[] No.45053212[source]
This whole article is built off using DeepSeek R1, which is a huge premise that I don't think is correct. DeepSeek is much more efficient and I don't think it's a valid way to estimate what OpenAI and Anthropic's costs are.

https://www.wheresyoured.at/deep-impact/

Basically, DeepSeek is _very_ efficient at inference, and that was the whole reason why it shook the industry when it was released.

replies(7): >>45053283 #>>45053303 #>>45053401 #>>45053455 #>>45053507 #>>45053923 #>>45054034 #
CjHuber ◴[] No.45053401[source]
The reason it shook the market at least was because of the claim that its training cost was 5 million.
replies(2): >>45053502 #>>45055728 #
1. vitaflo ◴[] No.45055728[source]
Also the fact that it cost 10% of what other models cost. Pretty much still does.