(www.tomshardware.com)

370 points ferriswil | 1 comments | 19 Oct 24 18:03 UTC | HN request time: 0s | source

Show context

remexre ◴[19 Oct 24 18:48 UTC] No.41889747[source]▶

Isn't this just taking advantage of "log(x) + log(y) = log(xy)"? The IEEE754 floating-point representation stores floats as sign, mantissa, and exponent -- ignore the first two (you quantitized anyway, right?), and the exponent is just an integer storing log() of the float.

replies(2): >>41889800 #>>41890236 #

mota7 ◴[19 Oct 24 19:49 UTC] No.41890236[source]▶

>>41889747 #

Not quite: It's taking advantage of (1+a)(1+b) = 1 + a + b + ab. And where a and b are both small-ish, ab is really small and can just be ignored.

So it turns the (1+a)(1+b) into 1+a+b. Which is definitely not the same! But it turns out, machine guessing apparently doesn't care much about the difference.

replies(3): >>41890382 #>>41890513 #>>41892121 #

1. tommiegannert ◴[19 Oct 24 20:25 UTC] No.41890513[source]▶

>>41890236 #

Plus the 2^-l(m) correction term.

Feels like multiplication shouldn't be needed for convergence, just monotonicity? I wonder how well it would perform if the model was actually trained the same way.

↑

AI engineers claim new algorithm reduces AI power consumption by 95%