/top/
/new/
/best/
/ask/
/show/
/job/
^
slacker news
login
about
←back to thread
Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference
(cerebras.ai)
426 points
benchmarkist
| 1 comments |
19 Nov 24 00:15 UTC
|
HN request time: 0.211s
|
source
1.
jadbox
◴[
19 Nov 24 02:04 UTC
]
No.
42179444
[source]
▶
>>42178761 (OP)
#
Not open beta until Q1 2025
ID:
GO
↑