(yosefk.com)

138 points shipp02 | 1 comments | 10 Jun 24 06:05 UTC | HN request time: 0.217s | source

1. mkoubaa ◴[11 Jun 24 19:21 UTC] No.40650353[source]▶

This type of parallelism is sort of like a flops metric. Optimizing the amount of wall time the GPU is actually doing computation is just as important (if not more). There are some synchronization and pipelining tools in CUDA and Vulkan but they are scary at first glance.

↑

SIMD < SIMT < SMT: Parallelism in Nvidia GPUs (2011)