←back to thread

390 points pyman | 1 comments | | HN request time: 0.462s | source
Show context
pyman ◴[] No.44488332[source]
Anthropic's cofounder, Ben Mann, downloaded million copies of books from Library Genesis in 2021, fully aware that the material was pirated.

Stealing is stealing. Let's stop with the double standards.

replies(8): >>44488391 #>>44488540 #>>44488816 #>>44490720 #>>44491032 #>>44491583 #>>44492035 #>>44493242 #
originalvichy ◴[] No.44488540[source]
At least most pirates just consume for personal use. Profiting from piracy is a whole other level beyond just pirating a book.
replies(4): >>44488621 #>>44488853 #>>44489003 #>>44490718 #
KoolKat23 ◴[] No.44489003[source]
This isn't really profiting from piracy. They don't make money off the raw input data. It's no different to consuming for personal use.

They make money off the model weights, which is fair use (as confirmed by recent case law).

replies(1): >>44489216 #
j_w ◴[] No.44489216[source]
This is absurd. Remove all of the content from the training data that was pirated and what is the quality of the end product now?
replies(2): >>44489279 #>>44489283 #
pyman ◴[] No.44489279[source]
With Claude, people are paying Anthropic to access answers that are generated from pirated books, without the authors permission, credit, or compensation.
replies(1): >>44489304 #
KoolKat23 ◴[] No.44489304[source]
There is no copyright on knowledge.

If it outputs parts of the book verbatim then that's a different story.

replies(2): >>44489612 #>>44492025 #
1. SirMaster ◴[] No.44492025[source]
>If it outputs parts of the book verbatim then that's a different story.

But it does...