(github.com)

1311 points msoad | 2 comments | 31 Mar 23 20:37 UTC | HN request time: 0.427s | source

Show context

bsaul ◴[31 Mar 23 21:28 UTC] No.35393916[source]▶

how is llama performance relative to chatgpt ? is it as good as chatgpt3 or even 4 ?

terafo ◴[31 Mar 23 21:38 UTC] No.35394022[source]▶

It is as good as GPT-3 at most sizes. Instruct layer needs to be put on top in order for it to compete with GPT 3.5(which powers ChatGPT). It can be done with comparatively little amount of compute(couple hundred bucks worth of compute for small models, I'd assume low thousands for 65B).

replies(1): >>35394938 #

1. stavros ◴[31 Mar 23 23:01 UTC] No.35394938[source]▶

>>35394022 #

That's surprising to read, given that ChatGPT (at least the first version) was much worse than text-davinci-003 at following instructions. The new version seems to be much better, though.

replies(1): >>35396955 #

2. weird-eye-issue ◴[01 Apr 23 03:57 UTC] No.35396955[source]▶

>>35394938 (TP) #

No, it wasn't

↑

Llama.cpp 30B runs with only 6GB of RAM now