(bbycroft.net)

638 points gmays | 1 comments | 04 Sep 25 18:06 UTC | HN request time: 0.223s | source

Show context

southp ◴[05 Sep 25 08:20 UTC] No.45136237[source]▶

It's fascinating, even though my knowledge to LLM is so limited that I don't really understand what's happening. I'm curious how the examples are plotted and how much resemblance they are to the real models, though. If one day we could reliably plot a LLM into modules like this using an algorithm, does that mean we would be able to turn LLMs into chips, rather than data centers?

replies(5): >>45136340 #>>45136985 #>>45136988 #>>45137239 #>>45166151 #

1. yapyap ◴[05 Sep 25 10:23 UTC] No.45136985[source]▶

>>45136237 #

in my understanding the data centers are mostly for scaling so that many people can use an LLM service at a time and training so that training a new LLM’s weights won’t take months to years because of GPU constraints.

Its already possible to run an LLM off chips, of course depending on the LLM and the chip.

↑

LLM Visualization