←back to thread

290 points nobody9999 | 1 comments | | HN request time: 0.277s | source
Show context
jawns ◴[] No.45187038[source]
I'm an author, and I've confirmed that 3 of my books are in the 500K dataset.

Thus, I stand to receive about $9,000 as a result of this settlement.

I think that's fair, considering that two of those books received advances under $20K and never earned out. Also, while I'm sure that Anthropic has benefited from training its models on this dataset, that doesn't necessarily mean that those models are a lasting asset.

replies(22): >>45187319 #>>45187366 #>>45187519 #>>45187839 #>>45188602 #>>45189683 #>>45189684 #>>45190184 #>>45190223 #>>45190237 #>>45190555 #>>45190731 #>>45191633 #>>45192016 #>>45192191 #>>45192348 #>>45192404 #>>45192630 #>>45193043 #>>45195516 #>>45201246 #>>45218895 #
visarga ◴[] No.45187519[source]
How is it fair? Do you expect 9,000 from Google, Meta, OpenAI, and everyone else? Were your books imitated by AI?

Infringement was supposed to imply substantial similarity. Now it is supposed to mean statistical similarity?

replies(4): >>45187577 #>>45187677 #>>45187811 #>>45187853 #
gruez ◴[] No.45187577[source]
>Were your books imitated by AI?

Given that books can be imitated by humans with no compensation, this isn't as strong as an argument as you think. Moreover AFAIK the training itself has been ruled legal, so Anthropic could have theoretically bought the book for $20 (or whatever) and be in the clear, which would obviously bring less revenue than the $9k settlement.

replies(2): >>45187621 #>>45188044 #
visarga ◴[] No.45187621[source]
Copyright should be about copying rights, not statistical similarities. Similarity vs causal link - a different standard all together.
replies(3): >>45187751 #>>45187806 #>>45187851 #
dotnet00 ◴[] No.45187851[source]
Those statistical similarities originate from a copyright violation, there's your causal link. Basically the same as selling a game made using pirated Photoshop.
replies(2): >>45188135 #>>45189695 #
reissbaker ◴[] No.45188135[source]
Selling a game whose assets were made with a pirated copy of Photoshop does not extend Adobe's copyright to cover your game itself. They can sue you for using the pirated copy of Photoshop, but they can't extend copyright vampirically in that manner — at least, not in the United States.

(They can still sue for damages, but they can't claim copyright over your game itself.)

replies(3): >>45188574 #>>45189126 #>>45190966 #
dotnet00 ◴[] No.45189126[source]
Are the authors claiming copyright over the LLM? My understanding is they were suing Anthropic for using the authors' data in their training product. The court ruled that using the books for training would be fair use, but that piracy is not fair use.

Thus, isn't the settlement essentially Anthropic admitting that they don't really have an effective defense against the piracy claim?

replies(1): >>45202377 #
1. reissbaker ◴[] No.45202377[source]
Oh I don't disagree that the authors may have a compelling case — I was just responding to the statistical similarities vs copying argument. Anthropic may have violated the authors rights, but technically that doesn't extend copyright via a "causal link."

The authors can still sue for damages though (and did, and had a strong enough case Anthropic is trying to settle for over a billion dollars).