←back to thread

Basic Facts about GPUs

(damek.github.io)
338 points ibobev | 1 comments | | HN request time: 0.203s | source
1. bjornsing ◴[] No.44373852[source]
So how are we doing with whole program optimization on the compiler level? Feels kind of backwards that people are optimizing these LLM architectures, one at a time.