Most active commenters

cubefox(3)

Popular/hot comments

>>35394111 #

←back to thread

Llama.cpp 30B runs with only 6GB of RAM now

(github.com)

Show context

sillysaurusx ◴[31 Mar 23 21:16 UTC] No.35393782[source]▶

>>35393284 (OP) #

On the legal front, I’ve been working with counsel to draft a counterclaim to Meta’s DMCA against llama-dl. (GPT-4 is surprisingly capable, but I’m talking to a few attorneys: https://twitter.com/theshawwn/status/1641841064800600070?s=6...)

An anonymous HN user named L pledged $200k for llama-dl’s legal defense: https://twitter.com/theshawwn/status/1641804013791215619?s=6...

This may not seem like much vs Meta, but it’s enough to get the issue into the court system where it can be settled. The tweet chain has the details.

The takeaway for you is that you’ll soon be able to use LLaMA without worrying that Facebook will knock you offline for it. (I wouldn’t push your luck by trying to use it for commercial purposes though.)

Past discussion: https://news.ycombinator.com/item?id=35288415

I’d also like to take this opportunity to thank all of the researchers at MetaAI for their tremendous work. It’s because of them that we have access to such a wonderful model in the first place. They have no say over the legal side of things. One day we’ll all come together again, and this will just be a small speedbump in the rear view mirror.

EDIT: Please do me a favor and skip ahead to this comment: https://news.ycombinator.com/item?id=35393615

It's from jart, the author of the PR the submission points to. I really had no idea that this was a de facto Show HN, and it's terribly rude to post my comment in that context. I only meant to reassure everyone that they can freely hack on llama, not make a huge splash and detract from their moment on HN. (I feel awful about that; it's wonderful to be featured on HN, and no one should have to share their spotlight when it's a Show HN. Apologies.)

replies(7): >>35393813 #>>35393848 #>>35394028 #>>35394029 #>>35394084 #>>35394156 #>>35394431 #

1. cubefox ◴[31 Mar 23 21:39 UTC] No.35394028[source]▶

>>35393782 #

Even if using LLaMA turns out to be legal, I very much doubt it is ethical. The model got leaked while it was only intended for research purposes. Meta engineered and paid for the training of this model. It's theirs.

replies(5): >>35394052 #>>35394067 #>>35394111 #>>35394143 #>>35394388 #

2. Uupis ◴[31 Mar 23 21:41 UTC] No.35394052[source]▶

>>35394028 (TP) #

I feel like most-everything about these models gets really ethically-grey — at worst — very quickly.

3. willcipriano ◴[31 Mar 23 21:43 UTC] No.35394067[source]▶

>>35394028 (TP) #

What did they train it on?

replies(1): >>35394204 #

4. faeriechangling ◴[31 Mar 23 21:47 UTC] No.35394111[source]▶

>>35394028 (TP) #

Did Meta ask permission from every user they trained their model on? Did all those users consent, and when I say consent I'm saying was there a meeting of minds not something buried in page 89 of a EULA, to Meta building an AI with their data?

Turnabout is fair play. I don't feel the least bit sorry for Meta.

replies(3): >>35394149 #>>35394190 #>>35394302 #

5. dodslaser ◴[31 Mar 23 21:51 UTC] No.35394143[source]▶

>>35394028 (TP) #

Meta as a company has shown pretty blatantly that they don't really care about ethitcs, nor the law for that sake.

6. terafo ◴[31 Mar 23 21:52 UTC] No.35394149[source]▶

>>35394111 #

LLaMa was trained on data of Meta users, though.

replies(1): >>35399778 #

7. cubefox ◴[31 Mar 23 21:55 UTC] No.35394190[source]▶

>>35394111 #

But it doesn't copy any text one to one. The largest one was trained on 1.4 trillion tokens, if I recall correctly, but the model size is just 65 billion parameters. (I believe they use 16 bit per token and parameter.) It seems to be more like a human who has read large parts of the internet, but doesn't remember anything word by word. Learning from reading stuff was never considered a copyright violation.

replies(2): >>35394552 #>>35395849 #

8. cubefox ◴[31 Mar 23 21:56 UTC] No.35394204[source]▶

>>35394067 #

On partly copyrighted text. Same as you and me.

9. shepardrtc ◴[31 Mar 23 22:04 UTC] No.35394302[source]▶

>>35394111 #

They don't ask permission when they're stealing users' data, so why should users ask permission for stealing their data?

https://www.usatoday.com/story/tech/2022/09/22/facebook-meta...

10. seydor ◴[31 Mar 23 22:11 UTC] No.35394388[source]▶

>>35394028 (TP) #

It's an index of the web and our own comments, barely something they can claim ownership on , and especially to resell.

But OTOH, by preventing commercial use, they have sparked the creation of an open source ecosystem where people are building on top of it because it's fun, not because they want to build a moat to fill it with sweet VC $$$money.

It's great to see that ecosystem being built around it, and soon someone will train a fully open source model to replace Llama

11. Avicebron ◴[31 Mar 23 22:24 UTC] No.35394552{3}[source]▶

>>35394190 #

> It seems to be more like a human who has read large parts of the internet, but doesn't remember anything word by word. Learning from reading stuff was never considered a copyright violation.

This is one of the most common talking points I see brought up, especially when defending things like ai "learning" from the style of artists and then being able to replicate that style. On the surface we can say, oh it's similar to a human learning from an art style and replicating it. But that implies that the program is functioning like a human mind (as far as I know the jury is still out on that and I doubt we know exactly how a human mind actually "learns" (I'm not a neuroscientist)).

Let's say for the sake of experiment I ask you to cut out every word of pride and prejudice, and keep them all sorted. Then when asked to write a story in the style of jane austen you pull from that pile of snipped out words and arranged them in a pattern that most resembles her writing, did you transform it? Sure maybe, if a human did that I bet they could even copyright it, but I think that as a machine, it took those words, phrases, and applied an algorithm to generating output, even with stochastic elements the direct backwards traceability albeit a 65B convolution of it means that the essence of the copyrighted materials has been directly translated.

From what I can see we can't prove the human mind is strictly deterministic. But an ai very well might be in many senses. So the transference of non-deterministic material (the original) through a deterministic transform has to root back to the non-deterministic model (the human mind and therefore the original copyright holder).

12. ◴[01 Apr 23 00:53 UTC] No.35395849{3}[source]▶

>>35394190 #

13. terafo ◴[01 Apr 23 12:35 UTC] No.35399778{3}[source]▶

>>35394149 #

I was sleepy, I meant to say that it WASN'T trained on data of Meta users.

↑