(www.phoronix.com)

1045 points mfiguiere | 5 comments | 12 Feb 24 14:00 UTC | HN request time: 0.441s | source

1. sam_goody ◴[12 Feb 24 14:34 UTC] No.39345188[source]▶

I don't really follow this, but isn't it a bad sign for ROCm that, for example, ZLUDA + Blender 4's CUDA back-end delivers better performance than the native Radeon HIP back-end?

replies(4): >>39345214 #>>39345404 #>>39345464 #>>39345623 #

2. fariszr ◴[12 Feb 24 14:37 UTC] No.39345214[source]▶

>>39345188 (TP) #

It really shows how neglected their software stack is, or at least how neglected this implementation is.

3. whizzter ◴[12 Feb 24 14:54 UTC] No.39345404[source]▶

>>39345188 (TP) #

Could be that the CUDA backend has seen far more specialization optimizations whereas the seeingly fairly fresh HIP backend hasn't had as many developers looking at it, in the end a few more control instructions on the CPU side to go through the ZLUDA wrapper will be insignificant compared to all the time spent inside better optimized GPU kernels.

4. KeplerBoy ◴[12 Feb 24 14:59 UTC] No.39345464[source]▶

>>39345188 (TP) #

Surely this can be attributed to Blender's HIP code just being suboptimal because nobody really cares about it. By extension nobody cares about it because performance is suboptimal.

It's AMDs job to break that circle.

5. mdre ◴[12 Feb 24 15:11 UTC] No.39345623[source]▶

>>39345188 (TP) #

I'd say it's even worse, since for rendering Optix is like 30% faster than CUDA. But that requires the tensor cores. At this point AMD is waaay behind hardware wise.

↑

AMD funded a drop-in CUDA implementation built on ROCm: It's now open-source