Most active commenters
  • shkkmo(3)
  • dfxm12(3)
  • flyingcircus3(3)

←back to thread

451 points croes | 38 comments | | HN request time: 0.001s | source | bottom
1. prvc ◴[] No.43962193[source]
The released draft report seems merely to be a litany of copyright holder complaints repeated verbatim, with little depth of reasoning to support the conclusions it makes.
replies(4): >>43962324 #>>43962424 #>>43962648 #>>43962893 #
2. raverbashing ◴[] No.43962324[source]
I don't have much spare sympathy here honestly
3. bgwalter ◴[] No.43962424[source]
The required reasoning is not very deep though: If an AI reads 100 scientific papers and churns out a new one, it is plagiarism.

If a savant has perfect recall, remembers text perfectly and rearranges that text to create a marginally new text, he'd be sued for breach of copyright.

Only large corporations get away with it.

replies(9): >>43962554 #>>43962560 #>>43962638 #>>43962665 #>>43962744 #>>43962820 #>>43963108 #>>43963228 #>>43963944 #
4. scraptor ◴[] No.43962554[source]
Plagiarism is not an issue of copyright law, it's an entirely separate system of rules maintained by academia. The US Copyright Office has no business having opinions about it. If a AI^W human reads 100 papers and then churns out a new one this is usually called research.
replies(5): >>43962756 #>>43962757 #>>43963247 #>>43963863 #>>43966801 #
5. satanfirst ◴[] No.43962560[source]
That's not logical. If the savant has perfect recall and makes minor edits they are like a digital copy and aren't really like a human, neural network or by extension any other ML model that isn't over-fitted.
6. tantalor ◴[] No.43962638[source]
If AI really could "churn out a new scientific paper" we would all be ecstatically rejoicing in the dawning of an age of AGI. We are nowhere near that.
replies(1): >>43962711 #
7. nadermx ◴[] No.43962648[source]
Not only does it read like a litany[0]. It seems like the copyright holders are not happy with how the meta case is working through court and are trying to sidestep fair use entirely.

https://www.copyright.gov/ai/Copyright-and-Artificial-Intell...

replies(1): >>43963315 #
8. glial ◴[] No.43962665[source]
It reminds me of the old joke.

"To steal ideas from one person is plagiarism; to steal from many is research."

replies(1): >>43974206 #
9. viraptor ◴[] No.43962711{3}[source]
We're relatively close already https://openreview.net/pdf?id=12T3Nt22av And we don't need anything even close to AGI to achieve that.
10. shkkmo ◴[] No.43962744[source]
> If a savant has perfect recall, remembers text perfectly and rearranges that text to create a marginally new text, he'd be sued for breach of copyright.

Any suits would be based on the degree the marginally new copy was fair use. You wouldn't be able to sue the savant for reading and remembering the text.

Using AI to creat marginally new copies of copyrighted work is ALREADY a violation. We don't need a dramatic expansion of copyright law that says that just giving the savant the book to real is a copyright violation.

Plagarism and copyright are two entirely different things. Plagarism is about citations and intellectual integrity. Copyright is a about protecting economic interests, has nothing to to with intellectual integrity, and isn't resolved by citing the original work. In fact most of the contexts where you would be accused of plagarism, would be places like reporting, criticism, education or research goals make fair use arguments much easier.

11. ta1243 ◴[] No.43962756{3}[source]
Only when those papers are referenced
12. dfxm12 ◴[] No.43962757{3}[source]
Please argue in good faith. A new research paper is obviously materially different from "rearranging that text to create a marginally new text".
replies(2): >>43962849 #>>43962855 #
13. Maxatar ◴[] No.43962820[source]
Plagiarism isn't illegal, has nothing to do with the law.
replies(1): >>43962877 #
14. shkkmo ◴[] No.43962849{4}[source]
The comment is responding to this line:

> If an AI reads 100 scientific papers and churns out a new one, it is plagiarism.

That is a specific claim that is being directly addressed and pretty clearly qualifies as "good faith".

15. int_19h ◴[] No.43962855{4}[source]
"Rearranging text" is not what modern LLMs do though, unless you specifically ask them to.
replies(1): >>43963842 #
16. shkkmo ◴[] No.43962877{3}[source]
Plagarism is often illegal. If you use plagarism to obtain a financial or other benefit, that can be fraud.
replies(1): >>43963390 #
17. JKCalhoun ◴[] No.43963108[source]
My understanding — LLMs are nothing at all like a "savant with perfect recall".

More like a speed-reader who retains a schema-level grasp of what they’ve read.

18. mr_toad ◴[] No.43963228[source]
> If a savant has perfect recall

AI don’t have perfect recall.

19. palmotea ◴[] No.43963247{3}[source]
> Plagiarism is not an issue of copyright law, it's an entirely separate system of rules maintained by academia. The US Copyright Office has no business having opinions about it. If a AI^W human reads 100 papers and then churns out a new one this is usually called research.

If you draw a Venn Diagram of plagiarism and copyright violations, there's a big intersection. For example: if I take your paper, scratch off your name, make some minor tweaks, and submit it; I'm guilty of both plagiarism and copyright violation.

20. mr_toad ◴[] No.43963315[source]
Copywriter holders have always hated fair use, and often like to pretend it doesn’t exist.

The average copywrite holder would like you to think that the law only allows use of their works in ways that they specifically permit, i.e. that which is not explicitly permitted is forbidden.

But the law is largely the reverse; it only denies use of copyright works in certain ways. That which is not specifically forbidden is permitted.

replies(1): >>43964188 #
21. jobigoud ◴[] No.43963390{4}[source]
That further drives the point that the issue is not what the AI is doing but what people using it are doing.
22. dfxm12 ◴[] No.43963842{5}[source]
I didn't make this claim. Feel free to bring a cogent argument to a commenter who did.
replies(1): >>43965264 #
23. biophysboy ◴[] No.43963863{3}[source]
Having actually done research and published scientific papers, the key limitation is experimentation. Review papers are useful, and AI is useful, but creating new knowledge is more useful. I haven't had much luck using LLMs to extrapolate well beyond their knowledge domain.
replies(1): >>43966819 #
24. wizee ◴[] No.43963944[source]
Is reading and memorizing a copyrighted text a breach of copyright? I.e. is creating a copy of the text in your mind a breach of copyright or fair fair use? Is it a breach of copyright if a digital “mind” similarly memorizes copyrighted text? Or is it only a breach of copyright to output and publish that memorized text?

What about loosely memorizing the gist of a copyrighted text. Is that a breach or fair use? What if a machine does something similar?

This falls under a rather murky area of the law that is not well defined.

replies(1): >>43964865 #
25. flyingcircus3 ◴[] No.43964010[source]
The very fact that you can bring this tired retort to any argument regardless of context reveals it for what it is: an off ramp to any conversation you have no better argument against.
replies(2): >>43964818 #>>43964889 #
26. ls612 ◴[] No.43964188{3}[source]
That used to be how it worked. Then the DMCA 1201 provisions arrived and so now anything not expressly permitted by the enumerated exceptions is forbidden. Even talking about how it works is punishable as a felony (upheld by SCOTUS in like 2000 or 2001, they basically said the Copyright clause is in the constitution so the government can censor information on how to defeat DRM).
replies(1): >>43964771 #
27. nadermx ◴[] No.43964771{4}[source]
Breaking DRM, is in fact, Fair Use: https://www.ca5.uscourts.gov/opinions/pub/08/08-10521-CV0.wp...
28. dylan604 ◴[] No.43964818{3}[source]
It also assumes that the orange man has an original thought and not something that he's been convinced of by all of the direct underlings or even 3rd party NGOs that advise/lobby those underlings.
29. aeonik ◴[] No.43964865{3}[source]
"Filthy eidetics. Their freeloading had become too much for our society to bear. Something had to be done. We found the mutation in their hippocampus and released a new CRISPR-mRNA-based gene suppression system.

Those who were immune were put under the scalpel."

30. 93po ◴[] No.43964889{3}[source]
People see actions and make assumptions on intentions behind those actions. They also make assumptions on who actually called for those actions, or the percent to which people contributed to those decisions.

If you don't have a tape recorder showing Trump saying "Fire Shira, I don't like what she did and she needs to get out" then you are making assumptions both for his reasons and his involvement. No one has that tape. Which means any claims that this is what happening is entirely speculation. We've seen a decade of people claiming these assumptions as fact, and it's really tiresome.

replies(1): >>43965090 #
31. flyingcircus3 ◴[] No.43965090{4}[source]
Now apply this reasoning to Trump standing in Air Force One and saying that he would bring someone back of the Supreme Court said to. It's on video.
replies(1): >>43966315 #
32. gruez ◴[] No.43965264{6}[source]
>I didn't make this claim

???

Did you not literally comment the following?

>A new research paper is obviously materially different from "rearranging that text to create a marginally new text".

What did you mean by that, if that's not your claim?

replies(1): >>43965763 #
33. dfxm12 ◴[] No.43965763{7}[source]
I made that comment, but the bit in quotes is not my claim. I was quoting a grandparent post. If you read from the top, the quotation marks and general flow of the thread should make this clear.
34. 93po ◴[] No.43966315{5}[source]
I spent 10+ minutes trying to find anything Trump has said on camera about the copyright office, and went through the only video I could find of Trump on air force one in the past week to see any references to this, and saw none.
replies(1): >>43966830 #
35. anigbrowl ◴[] No.43966801{3}[source]
You were supposed to keep reading past the first sentence, instead of trying to refute the first thing you saw that you found disagreeable. By doing so, you missed the point that plagiarism is substantively different from copyright infringement.
36. scraptor ◴[] No.43966819{4}[source]
I certainly don't see much value in AI generated papers myself, I just object to the claim that the mere act of reading a large number of existing papers before writing yours is inherently plagiarism.
37. flyingcircus3 ◴[] No.43966830{6}[source]
https://www.washingtonpost.com/video/politics/trump-if-supre...

It's not related to copyright. It is an example of your hypothetical standard required to attribute something to Trump. My point is that even when he is on camera saying something, that does not prevent the post facto rationalizations. Even if he was on tape firing this person, people would rationalize this away too.

38. slipnslider ◴[] No.43974206{3}[source]
Einstein once said "the key to genius is to hide your sources well"

And honestly there is truth to it. Some people (at work, in rea life, wherever) might come off very inteligent but the moment they say "oh I just read that relevant fact on reddit/twitter/news site 5 minutes ago" you realize they are just like you and repeating relevant information that was consumed recently.