(www.zach.be)

322 points laserduck | 1 comments | 16 Nov 24 14:07 UTC | HN request time: 0.202s | source

1. frizdny5 ◴[16 Nov 24 19:30 UTC] No.42158583[source]▶

The bottleneck for LLM is fast and large memory, not compute power.

Whoever is recommending investing in better chip(ALU) design hasn't done even a basic analysis of the problem.

Tokens per second = memory bandwidth divided by model size.

YC is wrong about LLMs for chip design