←back to thread

399 points pyman | 2 comments | | HN request time: 1.074s | source
Show context
dehrmann ◴[] No.44491718[source]
The important parts:

> Alsup ruled that Anthropic's use of copyrighted books to train its AI models was "exceedingly transformative" and qualified as fair use

> "All Anthropic did was replace the print copies it had purchased for its central library with more convenient space-saving and searchable digital copies for its central library — without adding new copies, creating new works, or redistributing existing copies"

It was always somewhat obvious that pirating a library would be copyright infringement. The interesting findings here are that scanning and digitizing a library for internal use is OK, and using it to train models is fair use.

replies(6): >>44491820 #>>44491944 #>>44492844 #>>44494100 #>>44494132 #>>44494944 #
6gvONxR4sf7o ◴[] No.44491944[source]
You skipped quotes about the other important side:

> But Alsup drew a firm line when it came to piracy.

> "Anthropic had no entitlement to use pirated copies for its central library," Alsup wrote. "Creating a permanent, general-purpose library was not itself a fair use excusing Anthropic's piracy."

That is, he ruled that

- buying, physically cutting up, physically digitizing books, and using them for training is fair use

- pirating the books for their digital library is not fair use.

replies(6): >>44492103 #>>44492512 #>>44492665 #>>44493580 #>>44493641 #>>44495079 #
jasonlotito ◴[] No.44492665[source]
From my understanding:

> pirating the books for their digital library is not fair use.

"Pirating" is a fuzzy word and has no real meaning. Specifically, I think this is the cruz:

> without adding new copies, creating new works, or redistributing existing copies

Essentially: downloading is fine, sharing/uploading up is not. Which makes sense. The assertion here is that Anthropic (from this line) did not distribute the files they downloaded.

replies(3): >>44492681 #>>44493044 #>>44493056 #
AlotOfReading ◴[] No.44493044[source]
The legal context here is that "format shifting" has not previously been held to be sufficient for fair use on its own, and downloading for personal use has also been considered infringing. Just look at the numerous media industry lawsuits against individuals that only mention downloading, not sharing for examples.

It's a bit surprising that you can suddenly download copyrighted materials for personal use and and it's kosher as long as you don't share them with others.

replies(1): >>44494514 #
jasonlotito ◴[] No.44494514[source]
> the numerous media industry lawsuits against individuals that only mention downloading,

I never saw any of these. All the cases I saw were related to people using torrents or other P2P software (which aren't just downloading). These might exist, but I haven't seen them.

> It's a bit surprising that you can suddenly download copyrighted materials for personal use and it's kosher as long as you don't share them with others.

Every click on a link is a risk of downloading copyrighted material you don't have the rights to.

Searching the internet, it appears that it's a civil infraction, but it's also confused with the notion that "piracy" is illegal, a term that's used for many different purposes. I see "It is illegal to download any music or movies that are copyrighted." under legal advice, which I know as a statement is not true.

Hence my confusion.

I should note: I'm not arguing from the perspective of whether it's morally or ethically right. Only that even in the context of this thread, things are phrased that aren't clear.

replies(1): >>44494675 #
AlotOfReading ◴[] No.44494675[source]
I just checked first individual suit I could find, which was BMG v. Gonzalez. She used P2P, but the case was specifically about her downloading, not redistributing.
replies(1): >>44495546 #
1. travoc ◴[] No.44495546[source]
Most P2P tools work in a way where you cannot download without simultaneously uploading.
replies(1): >>44495750 #
2. AlotOfReading ◴[] No.44495750[source]
Which is beside the point if the plaintiffs don't claim it as an issue. Take the anthropic opinion in the article, where the judge explicitly calls out that there's an unresolved question of whether the model outputs might be infringing that can't be ruled on because the plaintiffs only talk about the inputs.

Gonzalez is a ruling about downloading even though there was also distribution.