(ggml.ai)

899 points georgehill | 1 comments | 06 Jun 23 16:50 UTC | HN request time: 0.351s | source

Show context

samwillis ◴[06 Jun 23 17:28 UTC] No.36216196[source]▶

ggml and llama.cpp are such a good platform for local LLMs, having some financial backing to support development is brilliant. We should be concentrating as much as possible to do local inference (and training) based on privet data.

I want a local ChatGPT fine tuned on my personal data running on my own device, not in the cloud. Ideally open source too, llama.cpp is looking like the best bet to achieve that!

replies(6): >>36216377 #>>36216465 #>>36216508 #>>36217604 #>>36217847 #>>36221973 #

behnamoh ◴[06 Jun 23 17:47 UTC] No.36216508[source]▶

>>36216196 #

I wonder if ClosedAI and other companies use the findings of the open source community in their products. For example, do they use QLORA to reduce the costs of training and inference? Do they quantize their models to serve non-subscribing consumers?

replies(2): >>36216688 #>>36217149 #

1. jmoss20 ◴[06 Jun 23 18:32 UTC] No.36217149[source]▶

>>36216508 #

Quantization is hardly a "finding of the open source community". (IIRC the first TPU was int8! Though the tradition is much older than that.)

↑

GGML – AI at the Edge