/top/
/new/
/best/
/ask/
/show/
/job/
^
slacker news
login
about
←back to thread
Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference
(cerebras.ai)
426 points
benchmarkist
| 1 comments |
19 Nov 24 00:15 UTC
|
HN request time: 0.217s
|
source
1.
adhambadr
◴[
19 Nov 24 09:34 UTC
]
No.
42181543
[source]
▶
>>42178761 (OP)
#
is it just me or isn't the most important contender in speed, Groq, missing from the comparison ? not sure why does it matter to put azure there, no one uses it for speed.
ID:
GO
↑