←back to thread

623 points magicalhippo | 1 comments | | HN request time: 0s | source
Show context
gnatman ◴[] No.42625457[source]
>> The IBM Roadrunner was the first supercomputer to reach one petaflop (1 quadrillion floating point operations per second, or FLOPS) on May 25, 2008.

$100M, 2.35MW, 6000 ft^2

>>Designed for AI researchers, data scientists, and students, Project Digits packs Nvidia’s new GB10 Grace Blackwell Superchip, which delivers up to a petaflop of computing performance for prototyping, fine-tuning, and running AI models.

$3000, 1kW, 0.5 ft^2

replies(1): >>42625811 #
DannyBee ◴[] No.42625811[source]
Digits is petaflops of FP4, roadrunner is petaflops of FP32. So at least a factor of 8 difference, but in practice much more. (IE I strongly doubt digits can do 1/8th petaflop of FP32)

Beyond that, the factors seem reasonable for 2 decades?

replies(2): >>42625873 #>>42627111 #
dotancohen ◴[] No.42625873[source]
Why even use a floating point if you have only 4 bits? Models with INT8 features are not unheard of.
replies(1): >>42626945 #
cjbgkagh ◴[] No.42626945[source]
1 sign and 3 exponent bits. AFAIK at the small number of bits it’s basically a teeny tiny look up table so you can precompute the table to be whatever math you want. Having an exponent instead of mantissa just means that the values that can be expressed are not linearly separated.
replies(1): >>42637416 #
1. dotancohen ◴[] No.42637416[source]
> Having an exponent instead of mantissa just means that the values that can be expressed are not linearly separated.

Being expressed as 1 10 100 1000 is not really different than being expressed as 1 2 3 4. There's still only four bits, i.e. 16 different possible values, no matter how we decide to express that in human-friendly terms.