←back to thread

LLM Visualization

(bbycroft.net)
638 points gmays | 1 comments | | HN request time: 0.223s | source
Show context
southp ◴[] No.45136237[source]
It's fascinating, even though my knowledge to LLM is so limited that I don't really understand what's happening. I'm curious how the examples are plotted and how much resemblance they are to the real models, though. If one day we could reliably plot a LLM into modules like this using an algorithm, does that mean we would be able to turn LLMs into chips, rather than data centers?
replies(5): >>45136340 #>>45136985 #>>45136988 #>>45137239 #>>45166151 #
1. yapyap ◴[] No.45136985[source]
in my understanding the data centers are mostly for scaling so that many people can use an LLM service at a time and training so that training a new LLM’s weights won’t take months to years because of GPU constraints.

Its already possible to run an LLM off chips, of course depending on the LLM and the chip.