(damek.github.io)

338 points ibobev | 1 comments | 24 Jun 25 12:15 UTC | HN request time: 0.203s | source

1. bjornsing ◴[25 Jun 25 05:20 UTC] No.44373852[source]▶

So how are we doing with whole program optimization on the compiler level? Feels kind of backwards that people are optimizing these LLM architectures, one at a time.

↑

Basic Facts about GPUs