←back to thread

398 points pyman | 2 comments | | HN request time: 0.415s | source
Show context
bgwalter ◴[] No.44490836[source]
Here is how individuals are treated for massive copyright infringement:

https://investors.autodesk.com/news-releases/news-release-de...

replies(8): >>44490942 #>>44491257 #>>44491526 #>>44491536 #>>44491907 #>>44493281 #>>44493918 #>>44493925 #
farceSpherule ◴[] No.44491907[source]
Peterson was copying and selling pirated software.

Come up with a better comparison.

replies(1): >>44491926 #
organsnyder ◴[] No.44491926[source]
Anthropic is selling a service that incorporates these pirated works.
replies(1): >>44492293 #
adolph ◴[] No.44492293[source]
That a service incorporating the authors' works exists is not at issue. The plaintiffs' claims are, as summarized by Alsup:

  First, Authors argue that using works to train Claude’s underlying LLMs 
  was like using works to train any person to read and write, so Authors 
  should be able to exclude Anthropic from this use (Opp. 16). 

  Second, to that last point, Authors further argue that the training was 
  intended to memorize their works’ creative elements — not just their 
  works’ non-protectable ones (Opp. 17).

  Third, Authors next argue that computers nonetheless should not be 
  allowed to do what people do. 
https://media.npr.org/assets/artslife/arts/2025/order.pdf
replies(4): >>44492411 #>>44492758 #>>44492890 #>>44493381 #
codedokode ◴[] No.44492411[source]
Computers cannot learn and are not subjects to laws. What happens, is a human takes a copyrighted work, makes an unauthorized digital copy, and loads it into a computer without authorization from copyright owner.
replies(2): >>44493047 #>>44493124 #
KoolKat23 ◴[] No.44493047[source]
And they are not selling this or distributing this.

The model is very different.

replies(1): >>44494018 #
cmiles74 ◴[] No.44494018[source]
I have to disagree, without all the copyrighted input data there would be no output data for these companies to sell. This output data is the product and they are distributing it for dollars.
replies(1): >>44494072 #
KoolKat23 ◴[] No.44494072[source]
Copyright is concerned with the the actual physical copy. The model isn't this. The end user would have to carefully prompt the models algorithm to output a copyright infringing piece.

This argument is more along the lines of: blaming Microsoft Word for someone typing characters into the word processors algorithm, and outputting a copy of an existing book. (Yes, it is a lot easier, but the rationale is the same). In my mind the end user prompting the model would be the one potentially infringing.

replies(1): >>44494226 #
1. cmiles74 ◴[] No.44494226[source]
FWIW, I don’t think there is a prompt that would reliably produce, verbatim, a copyrighted work.

I do think that a big part of the reason Anthropic downloaded millions of books from pirate torrents was because they needed that input data in order to generate the output, their product.

I don’t know what that is, but, IMHO, not sharing those dollars with the creators of the content is clearly wrong.

replies(1): >>44495306 #
2. ◴[] No.44495306[source]