Anthropic agrees to pay $1.5B to settle lawsuit with book authors

(www.nytimes.com)

989 points acomjean | 2 comments | 05 Sep 25 19:52 UTC | HN request time: 0.528s | source

Also https://www.washingtonpost.com/technology/2025/09/05/anthrop..., https://www.reuters.com/sustainability/boards-policy-regulat...

Show context

aeon_ai ◴[05 Sep 25 20:46 UTC] No.45143392[source]▶

>>45142885 (OP) #

To be very clear on this point - this is not related to model training.

It’s important in the fair use assessment to understand that the training itself is fair use, but the pirating of the books is the issue at hand here, and is what Anthropic “whoopsied” into in acquiring the training data.

Buying used copies of books, scanning them, and training on it is fine.

Rainbows End was prescient in many ways.

replies(36): >>45143460 #>>45143461 #>>45143507 #>>45143513 #>>45143567 #>>45143731 #>>45143840 #>>45143861 #>>45144037 #>>45144244 #>>45144321 #>>45144837 #>>45144843 #>>45144845 #>>45144903 #>>45144951 #>>45145884 #>>45145907 #>>45146038 #>>45146135 #>>45146167 #>>45146218 #>>45146268 #>>45146425 #>>45146773 #>>45146935 #>>45147139 #>>45147257 #>>45147558 #>>45147682 #>>45148227 #>>45150324 #>>45150567 #>>45151562 #>>45151934 #>>45153210 #

mdp2021 ◴[05 Sep 25 21:48 UTC] No.45144037[source]▶

>>45143392 #

> Buying used copies of books

It remains deranged.

Everyone has more than a right to freely have read everything is stored in a library.

(Edit: in fact initially I wrote 'is supposed to' in place of 'has more than a right to' - meaning that "knowledge is there, we made it available: you are supposed to access it, with the fullest encouragement").

replies(3): >>45144141 #>>45145658 #>>45145964 #

vkou ◴[06 Sep 25 01:16 UTC] No.45145658[source]▶

>>45144037 #

> Everyone has more than a right to freely have read everything is stored in a library.

Every human has the right to read those books.

And now, this is obvious, but it seems to be frequently missed - an LLM is not a human, and does not have such rights.

replies(2): >>45145778 #>>45147703 #

nl ◴[06 Sep 25 01:36 UTC] No.45145778[source]▶

>>45145658 #

By US law, cccording to Author's Guild vs Google[1] on the Google book scanning project, scanning books for indexes is fair use.

Additionally:

> Every human has the right to read those books.

Since when?

I strongly disagree - knowledge should be free.

I don't think the author's arrangement of the words should be free to reproduce (ie, I think some degree of copyright protection is ethical) but if I want to use a tool to help me understand the knowledge in a book then I should be able to.

[1] https://en.wikipedia.org/wiki/Authors_Guild,_Inc._v._Google,....

replies(6): >>45145933 #>>45146371 #>>45147476 #>>45150582 #>>45153091 #>>45153137 #

LunaSea ◴[06 Sep 25 08:01 UTC] No.45147476[source]▶

>>45145778 #

> knowledge should be free

As soo as OpenAI open sources their model's source code I'll agree.

replies(2): >>45147505 #>>45147720 #

mdp2021 ◴[06 Sep 25 09:00 UTC] No.45147720[source]▶

>>45147476 #

That is an elision for "public knowledge". Of course there are nuances. In the case of books, there is little doubt: printed for sale is literally named "published".

(The "for sale" side does not limit the purpose to sales only, before somebody wants to attack that.)

replies(1): >>45148295 #

1. LunaSea ◴[06 Sep 25 11:12 UTC] No.45148295[source]▶

>>45147720 #

Books are private objects sold to buyers. By definition, its not public knowledge.

replies(1): >>45151368 #

2. mdp2021 ◴[06 Sep 25 17:47 UTC] No.45151368[source]▶

>>45148295 (TP) #

Again and again: the "book", the item, is a private object, access to the text is freely available - to those member of societies that have decided that knowledge be freely available and have thus established libraries. (They have collected the books - their own - so that we can freely access the texts.)

↑