(ggml.ai)

899 points georgehill | 3 comments | 06 Jun 23 16:50 UTC | HN request time: 0.407s | source

Show context

samwillis ◴[06 Jun 23 17:28 UTC] No.36216196[source]▶

ggml and llama.cpp are such a good platform for local LLMs, having some financial backing to support development is brilliant. We should be concentrating as much as possible to do local inference (and training) based on privet data.

I want a local ChatGPT fine tuned on my personal data running on my own device, not in the cloud. Ideally open source too, llama.cpp is looking like the best bet to achieve that!

replies(6): >>36216377 #>>36216465 #>>36216508 #>>36217604 #>>36217847 #>>36221973 #

brucethemoose2 ◴[06 Jun 23 17:39 UTC] No.36216377[source]▶

>>36216196 #

If MeZO gets implemented, we are basically there: https://github.com/princeton-nlp/MeZO

replies(1): >>36216988 #

moffkalast ◴[06 Jun 23 18:20 UTC] No.36216988[source]▶

>>36216377 #

Basically there, with what kind of VRAM and processing requirements? I doubt anyone running on a CPU can fine tune in a time frame that doesn't give them an obsolete model when they're done.

replies(1): >>36217136 #

nl ◴[06 Jun 23 18:31 UTC] No.36217136[source]▶

>>36216988 #

According to the paper it fine tunes at the speed of inference (!!)

This would make fine tuning a qantized 13B model achievable in ~0.3 seconds per training example on a CPU.

replies(6): >>36217261 #>>36217324 #>>36217354 #>>36217827 #>>36218026 #>>36218841 #

1. isoprophlex ◴[06 Jun 23 19:22 UTC] No.36217827[source]▶

>>36217136 #

If you go through the drudgery of integrating with all the existing channels (mail, Teams, discord, slack, traditional social media, texts, ...), such rapid finetuning speeds could enable an always up to date personality construct, modeled on you.

Which is my personal holy grail towards making myself unnecessary; it'd be amazing to be doing some light gardening while the bot handles my coworkers ;)

replies(2): >>36217987 #>>36221420 #

2. ◴[06 Jun 23 19:34 UTC] No.36217987[source]▶

>>36217827 (TP) #

3. vgb2k18 ◴[07 Jun 23 01:05 UTC] No.36221420[source]▶

>>36217827 (TP) #

> while the bot handles my coworkers

Or it handles their bots ;)

↑

GGML – AI at the Edge