←back to thread

426 points benchmarkist | 1 comments | | HN request time: 0.21s | source
Show context
owenpalmer ◴[] No.42180575[source]
The fact that such a boost is possible with new hardware, I wonder what the ceiling is for improving performance for training via hardware as well.
replies(2): >>42180618 #>>42180710 #
bufferoverflow ◴[] No.42180618[source]
The ultimate solution would be to convert an LLM to a pure ASIC.

My guess is that would 10X the performance. But then it's a very very expensive solution.

replies(2): >>42180716 #>>42187711 #
1. tiagod ◴[] No.42187711[source]
There's some interesting research on using stacked flat lenses to build analog, physical neural network inference that operate directly on light (each lens is a hidden layer). If we managed to make this work for non-trivial cases, it could be absurdly fast.