←back to thread

394 points pyman | 5 comments | | HN request time: 0.876s | source
Show context
pyman ◴[] No.44488332[source]
Anthropic's cofounder, Ben Mann, downloaded million copies of books from Library Genesis in 2021, fully aware that the material was pirated.

Stealing is stealing. Let's stop with the double standards.

replies(8): >>44488391 #>>44488540 #>>44488816 #>>44490720 #>>44491032 #>>44491583 #>>44492035 #>>44493242 #
originalvichy ◴[] No.44488540[source]
At least most pirates just consume for personal use. Profiting from piracy is a whole other level beyond just pirating a book.
replies(4): >>44488621 #>>44488853 #>>44489003 #>>44490718 #
KoolKat23 ◴[] No.44489003[source]
This isn't really profiting from piracy. They don't make money off the raw input data. It's no different to consuming for personal use.

They make money off the model weights, which is fair use (as confirmed by recent case law).

replies(1): >>44489216 #
j_w ◴[] No.44489216[source]
This is absurd. Remove all of the content from the training data that was pirated and what is the quality of the end product now?
replies(2): >>44489279 #>>44489283 #
1. pyman ◴[] No.44489279[source]
With Claude, people are paying Anthropic to access answers that are generated from pirated books, without the authors permission, credit, or compensation.
replies(1): >>44489304 #
2. KoolKat23 ◴[] No.44489304[source]
There is no copyright on knowledge.

If it outputs parts of the book verbatim then that's a different story.

replies(2): >>44489612 #>>44492025 #
3. pyman ◴[] No.44489612[source]
Let's don't change the focus of the debate.

Pirating 7 million books, remixing their content, and using that to power Claude.ai is like counterfeiting 7 million branded products and selling them on your personal website. The original creators don't get credit or payment, and someone’s profiting off their work.

All this happens while authors, many of them teachers, are left scratching their heads with four kids to feed

replies(1): >>44489775 #
4. KoolKat23 ◴[] No.44489775{3}[source]
That may be the case, but you'd have to have laws changed.
5. SirMaster ◴[] No.44492025[source]
>If it outputs parts of the book verbatim then that's a different story.

But it does...