←back to thread

548 points nsagent | 2 comments | | HN request time: 0.507s | source
Show context
benreesman ◴[] No.44566742[source]
I wonder how much this is a result of Strix Halo. I had a fairly standard stipend for a work computer that I didn't end up using for a while so I recently cashed it in on the EVO-X2 and fuck me sideways: that thing is easily competitive with the mid-range znver5 EPYC machines I run substitors on. It mops the floor with any mere-mortal EC2 or GCE instance, like maybe some r1337.xxxxlarge.metal.metal or something has an edge, but the z1d.metal and the c6.2xlarge or whatever type stuff (fast cores, good NIC, table stakes), blows them away. And those things are 3-10K a month with heavy provisioned IOPS. This thing has real NVME and it cost 1800.

I haven't done much local inference on it, but various YouTubers are starting to call the DGX Spark overkill / overpriced next to Strix Halo. The catch of course is ROCm isn't there yet (they're seeming serious now though, matter of time).

Flawless CUDA on Apple gear would make it really tempting in a way that isn't true with Strix so cheap and good.

replies(6): >>44566825 #>>44566885 #>>44566921 #>>44567049 #>>44569265 #>>44570399 #
1. jitl ◴[] No.44566825[source]
It’s pretty explicitly targeting cloud cluster training in the PR description.
replies(1): >>44567289 #
2. ivape ◴[] No.44567289[source]
If we believe that there’s not enough hardware to meet demand, then one could argue this helps Apple meet demand, even if it’s just by a few percentage points.