←back to thread

426 points benchmarkist | 3 comments | | HN request time: 0.621s | source
1. gorkempacaci ◴[] No.42181139[source]
nvidia hates this one little trick
replies(1): >>42181177 #
2. zurfer ◴[] No.42181177[source]
I laughed and upvoted, but if anything I bet they put their best people on it to replicate this offering.

What I take away from this is: we are just getting started. I remember in 2023 begging OpenAI to give us more than 7 tokens/second on GPT-4.

replies(1): >>42191224 #
3. ryao ◴[] No.42191224[source]
Nvidia’s target is performance across concurrent users and they are likely already outperforming Cerebras there as far as costs are concerned. They have no reason to try to beat the single user performance of this.