(github.com)

488 points dbreunig | 1 comments | 16 Oct 24 19:44 UTC | HN request time: 0.237s | source

1. lambda-research ◴[17 Oct 24 13:55 UTC] No.41869708[source]▶

The benchmark is matrix multiplcation with the shapes `(6, 1500, 256) X (6, 256, 1500)`, which just aren't that big in the AI world. I think the gap would be larger with much larger matrices.

E.g. Llama 3.1 8B which is one of the smaller models has matrix multiplications like `(batch, 14336, 4096) x (batch, 4096, 14336)`.

I just don't think this benchmark is realistic enough.

↑

AI PCs Aren't Good at AI: The CPU Beats the NPU