Alice's adventures in a differentiable wonderland

(www.sscardapane.it)

235 points tosh | 1 comments | 30 Apr 24 17:03 UTC | HN request time: 0.001s | source

Show context

xanderlewis ◴[30 Apr 24 18:23 UTC] No.40214349[source]▶

> Stripped of anything else, neural networks are compositions of differentiable primitives

I’m a sucker for statements like this. It almost feels philosophical, and makes the whole subject so much more comprehensible in only a single sentence.

I think François Chollet says something similar in his book on deep learning: one shouldn’t fall into the trap of anthropomorphising and mysticising models based on the ‘neural’ name; deep learning is simply the application of sequences of operations that are nonlinear (and hence capable of encoding arbitrary complexity) but nonetheless differentiable and so efficiently optimisable.

replies(12): >>40214569 #>>40214829 #>>40215168 #>>40215198 #>>40215245 #>>40215592 #>>40215628 #>>40216343 #>>40216719 #>>40216975 #>>40219489 #>>40219752 #

jxy ◴[30 Apr 24 19:32 UTC] No.40215245[source]▶

>>40214349 #

> > Stripped of anything else, neural networks are compositions of differentiable primitives

> I’m a sucker for statements like this. It almost feels philosophical, and makes the whole subject so much more comprehensible in only a single sentence.

And I hate inaccurate statements like this. It pretends to be rigorous mathematical, but really just propagates erroneous information, and makes the whole article so much more amateur in only a single sentence.

The simple relu is continuous but not differentiable at 0, and its derivative is discontinuous at 0.

replies(3): >>40215358 #>>40215380 #>>40233579 #

1. kmmlng ◴[02 May 24 07:23 UTC] No.40233579[source]▶

>>40215245 #

Eh, it really doesn't matter much in practice. Additionally, there are many other activation functions without this issue.

↑