←back to thread

S1: A $6 R1 competitor?

(timkellogg.me)
851 points tkellogg | 1 comments | | HN request time: 0.205s | source
Show context
mig1 ◴[] No.42962415[source]
This argument that the data centers and all the GPUs will be useful even in the context of Deepseek doesn't add up... basically they showed that it's diminishing returns after a certain amount. And so far it didn't make OpenAI or Anthropic go faster, did it?
replies(1): >>42962473 #
1. rayboy1995 ◴[] No.42962473[source]
What is the source for the diminishing returns? I would like to read about it as I have only seen papers referring to the scaling law still applying.