←back to thread

899 points georgehill | 4 comments | | HN request time: 0.626s | source
1. s1k3s ◴[] No.36219584[source]
I'm out of the loop on this entire thing so call me an idiot if I get it wrong. Isn't this whole movement based on a model leak from Meta? Aren't licenses involved that prevent it from going commercial?
replies(3): >>36219616 #>>36220025 #>>36221012 #
2. dimfeld ◴[] No.36219616[source]
Only the weights themselves. There have been other models since then built on the same Llama architecture, but trained from scratch so they're safe for commercial user. The GGML code and related projects (llama.cpp and so on) also support some other model types now such as Mosaic's MPT series.
3. detrites ◴[] No.36220025[source]
GGML is essentially a library of lego pieces that can be put together to work with many LLM or other types of ML models.

Meta's leaked model is one for which GGML has been applied to for fast, local inference.

4. ac29 ◴[] No.36221012[source]
It wasn't a leak, LLaMa was released publicly under an open-ish license (the code is GPL, the model weights require registration and prohibit commercial use).