←back to thread

837 points turrini | 1 comments | | HN request time: 0.209s | source
1. Scene_Cast2 ◴[] No.43973301[source]
Where lack of performance costs money, optimization is quite invested in. See PyTorch (Inductor CUDA graphs), Triton, FlashAttention, Jax, etc.