←back to thread

235 points tosh | 3 comments | | HN request time: 0.484s | source
1. p1esk ◴[] No.40215053[source]
And then you learn about binary or ternary networks where gradients don’t really exist anywhere, and you start to wonder about the importance of this differentiability.
replies(2): >>40215343 #>>40216262 #
2. whimsicalism ◴[] No.40215343[source]
binary networks don't really work well unless you do a relaxation first
3. ubj ◴[] No.40216262[source]
...And then you start learning about generalizations of the notion of "gradient" to scenarios where the classical gradient doesn't exist :)