←back to thread

486 points dbreunig | 1 comments | | HN request time: 0s | source
Show context
protastus ◴[] No.41863883[source]
Deploying a model on an NPU requires significant profile based optimization. Picking up a model that works fine on the CPU but hasn't been optimized for an NPU usually leads to disappointing results.
replies(2): >>41864613 #>>41864649 #
1. catgary ◴[] No.41864613[source]
Yeah whenever I’ve spoken to people who work on stuff like IREE or OpenXLA they gave me the impression that understanding how to use those compilers/runtimes is an entire job.