AI PCs Aren't Good at AI: The CPU Beats the NPU

(github.com)

488 points dbreunig | 1 comments | 16 Oct 24 19:44 UTC | HN request time: 0.2s | source

Show context

eightysixfour ◴[16 Oct 24 20:32 UTC] No.41863546[source]▶

I thought the purpose of these things was not to be fast, but to be able to run small models with very little power usage? I have a newer AMD laptop with an NPU, and my power usage doesn't change using the video effects that supposedly run on it, but goes up when using the nvidia studio effects.

It seems like the NPUs are for very optimized models that do small tasks, like eye contact, background blur, autocorrect models, transcription, and OCR. In particular, on Windows, I assumed they were running the full screen OCR (and maybe embeddings for search) for the rewind feature.

replies(7): >>41863632 #>>41863779 #>>41863821 #>>41863886 #>>41864628 #>>41864828 #>>41869772 #

boomskats ◴[16 Oct 24 20:56 UTC] No.41863779[source]▶

>>41863546 #

That's especially true because yours is a Xilinx FPGA. The one that they just attached to the latest gen mobile ryzens is 5x more capable too.

AMD are doing some fantastic work at the moment, they just don't seem to be shouting about it. This one is particularly interesting https://lore.kernel.org/lkml/DM6PR12MB3993D5ECA50B27682AEBE1...

edit: not an FPGA. TIL. :'(

replies(5): >>41863852 #>>41863876 #>>41864048 #>>41864435 #>>41865733 #

pclmulqdq ◴[16 Oct 24 21:32 UTC] No.41864048[source]▶

>>41863779 #

It's not an FPGA. It's a VLIW DSP that Xilinx built to go into an FPGA-SoC to help run ML models.

replies(1): >>41864242 #

1. almostgotcaught ◴[16 Oct 24 21:56 UTC] No.41864242[source]▶

>>41864048 #

this is the correct answer. one of the compilers for this DSP is https://github.com/Xilinx/llvm-aie.

↑