AMD funded a drop-in CUDA implementation built on ROCm: It's now open-source

1. codedokode ◴[12 Feb 24 22:09 UTC] No.39351226[source]▶

As I understand, Vulkan allows to run custom code on GPU, including the code to multiply matrices. Can one simply use Vulkan and ignore CUDA, PyTorch and ROCm?

replies(5): >>39352478 #>>39353773 #>>39354188 #>>39354540 #>>39355589 #

2. Const-me ◴[13 Feb 24 00:08 UTC] No.39352478[source]▶

>>39351226 (TP) #

I did a few times with Direct3D 11 compute shaders. Here’s an open-source example: https://github.com/Const-me/Cgml

Pretty sure Vulkan gonna work equally well, at the very least there’s an open source DXVK project which implements D3D11 on top of Vulkan.

3. eddiewithzato ◴[13 Feb 24 02:59 UTC] No.39353773[source]▶

>>39351226 (TP) #

of course, but then you are just recreating CUDA. And that won’t scale well across an industry since each company would have their own language. AMD can just do what you are describing and then sell it as a standard.

I mean they literally did that, but then dropped it so yea

4. sorenjan ◴[13 Feb 24 03:57 UTC] No.39354188[source]▶

>>39351226 (TP) #

ncnn uses Vulkan for GPU acceleration, I've seen it used in a few projects to get AMD hardware support.

https://github.com/Tencent/ncnn

5. 0xDEADFED5 ◴[13 Feb 24 04:47 UTC] No.39354540[source]▶

>>39351226 (TP) #

there's a pretty cool Vulkan LLM engine here for example:

https://github.com/mlc-ai/mlc-llm

6. PeterisP ◴[13 Feb 24 08:05 UTC] No.39355589[source]▶

>>39351226 (TP) #

You probably can, but why would you? The main (only?) reason to ignore the CUDA-based stack is so that you could save a bit of money by using some other hardware instead of nVidia. So the amount of engineering labor/costs you should be willing to accept is directly tied to how much hardware you intend to buy or rent and what % discount, if any, the alternative hardware enables compared to nVidia.

So if you'd want to ignore CUDA+PyTorch and reimplement all of what you need on top of Vulkan.... well, that becomes worthy of discussion only if you expect to spend a lot on hardware, if you really consider that savings on hardware can recoup many engineer-years of costs - otherwise it's more effective to just go with the flow.