How fast if you write a for loop and keep track of the index and value of the smallest (possibly treating them as ints)?
replies(1):
I wonder could that be made faster by using AVX instructions; they allow to find the minimum value among several u32 values, but not immediately its index.