←back to thread

452 points croes | 2 comments | | HN request time: 0.001s | source
Show context
mattxxx ◴[] No.43962976[source]
Well, firing someone for this is super weird. It seems like an attempt to censor an interpretation of the law that:

1. Criticizes a highly useful technology 2. Matches a potentially-outdated, strict interpretation of copyright law

My opinion: I think using copyrighted data to train models for sure seems classically illegal. Despite that, Humans can read a book, get inspiration, and write a new book and not be litigated against. When I look at the litany of derivative fantasy novels, it's obvious they're not all fully independent works.

Since AI is and will continue to be so useful and transformative, I think we just need to acknowledge that our laws did not accomodate this use-case, then we should change them.

replies(19): >>43963017 #>>43963125 #>>43963168 #>>43963214 #>>43963243 #>>43963311 #>>43963423 #>>43963517 #>>43963612 #>>43963721 #>>43963943 #>>43964079 #>>43964280 #>>43964365 #>>43964448 #>>43964562 #>>43965792 #>>43965920 #>>43976732 #
palmotea[dead post] ◴[] No.43963168[source]
[flagged]
Suppafly ◴[] No.43967443[source]
>The fatal flaw in your reasoning: machines aren't humans.

I don't see how that affects the argument. The machines are being used by humans. Your argument then boils down to the idea that you can do something manually but it becomes illegal if you use a tool to do it efficiently.

replies(1): >>43967533 #
const_cast ◴[] No.43967533{3}[source]
It's not about the tool, how you use it, or even how it works. It's about the end result.

I can go through and manually compress "Revenge of the Sith" and then post it online. Or, I can use a compression program like handbrake. Regardless, it is copyright infringement.

Can AI reproduce almost* the same things that exist in it's training data? Sometimes, so sometimes it's copyright infringement. Doesn't help that it's explicitly for-profit and seeks to obsolesce and siphon value from it's training material.

replies(1): >>43967637 #
Suppafly ◴[] No.43967637{4}[source]
>Sometimes, so sometimes it's copyright infringement.

So in those cases, the original authors might have a case. Generally you don't see these LLM doing that though.

>Doesn't help that it's explicitly for-profit and seeks to obsolesce and siphon value from it's training material.

Doesn't hurt either. That's a reason to be butthurt, but that's not a legal argument.

replies(1): >>43967723 #
1. const_cast ◴[] No.43967723{5}[source]
> That's a reason to be butthurt, but that's not a legal argument.

It is a legal argument, fair use specifically takes into account the intention. Just using it for commercial ventures makes the water hotter.

replies(1): >>43976273 #
2. Suppafly ◴[] No.43976273[source]
>It is a legal argument

Not a very good one then.