(www.tomshardware.com)

172 points marban | 1 comments | 16 Apr 24 13:38 UTC | HN request time: 0.212s | source

Show context

Aissen ◴[16 Apr 24 14:53 UTC] No.40052746[source]▶

A quick search into it shows that this Ryzen AI NPU's support isn't integrated into upstream inference frameworks yet — so right now it's just useless silicon surface you pay for :-/

replies(3): >>40052844 #>>40053100 #>>40060474 #

dhruvdh ◴[16 Apr 24 15:18 UTC] No.40053100[source]▶

>>40052746 #

There is a VitisAI execution provider for ONNX, and you can use ONNX backends for inference frameworks that support it. More info here - https://ryzenai.docs.amd.com/en/latest/

But regardless, 16 TOPs is no good for LLMs. Though there is a Ryzen AI demo that shows Llama 7B running on these at 8 tokens/sec. A sub-par experience for a sub-par LLM.

replies(3): >>40054182 #>>40054664 #>>40142456 #

1. markdog12 ◴[16 Apr 24 17:20 UTC] No.40054664[source]▶

>>40053100 #

Wow, that's simply embarrassing.

↑

AMD unveils Ryzen Pro 8000-series processors