(github.com)

488 points dbreunig | 1 comments | 16 Oct 24 19:44 UTC | HN request time: 0.389s | source

Show context

protastus ◴[16 Oct 24 21:10 UTC] No.41863883[source]▶

Deploying a model on an NPU requires significant profile based optimization. Picking up a model that works fine on the CPU but hasn't been optimized for an NPU usually leads to disappointing results.

replies(2): >>41864613 #>>41864649 #

1. catgary ◴[16 Oct 24 22:43 UTC] No.41864613[source]▶

>>41863883 #

Yeah whenever I’ve spoken to people who work on stuff like IREE or OpenXLA they gave me the impression that understanding how to use those compilers/runtimes is an entire job.

↑

AI PCs Aren't Good at AI: The CPU Beats the NPU