←back to thread

486 points dbreunig | 2 comments | | HN request time: 0.515s | source
1. _davide_ ◴[] No.41866184[source]
The RTX 4080 should be capable of ~40 TFLOPS, yet they only report 2,160 billion operations per second. Shouldn't this be enough to reconsider the benchmark? They probably made some serious error in measuring FLOPS. Regarding the fact that CPU beats NPU is possible but they should benchmark many matrix multiplications without any application synchronization in order to have a decent comparison.
replies(1): >>41866435 #
2. Grimblewald ◴[] No.41866435[source]
That isnt the half of it. A quick skim of the documentation shows that the cpu inference wasnt done in a comparable way either.