←back to thread

Faster Argmin on Floats

(algorithmiker.github.io)
19 points return_to_monke | 1 comments | | HN request time: 0.204s | source
1. why_only_15 ◴[] No.45310355[source]
This trick is very useful on Nvidia GPUs for calculating mins and maxes in some cases, e.g. atomic mins (better u32 support than f32) or warp-wide mins with `redux.sync` (only supports u32, not f32).