/top/
/new/
/best/
/ask/
/show/
/job/
^
slacker news
login
about
←back to thread
Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference
(cerebras.ai)
426 points
benchmarkist
| 2 comments |
19 Nov 24 00:15 UTC
|
HN request time: 0.416s
|
source
1.
gdiamos
◴[
19 Nov 24 03:25 UTC
]
No.
42179843
[source]
▶
>>42178761 (OP)
#
I'm so curious to see some multi-agent systems running with inference this fast.
replies(1):
>>42179877
#
ID:
GO
2.
ipsum2
◴[
19 Nov 24 03:31 UTC
]
No.
42179877
[source]
▶
>>42179843 (TP)
#
There's no good open source agent models at the moment unfortunately.
↑