←back to thread

989 points acomjean | 1 comments | | HN request time: 0s | source
Show context
petralithic ◴[] No.45143482[source]
This is sad for open source AI, piracy for the purpose of model training should also be fair use because otherwise only the big companies who can afford to pay off publishers like Anthropic will be able to do so. There is no way to buy billions of books just for model training, it simply can't happen.
replies(9): >>45143523 #>>45143780 #>>45143876 #>>45144861 #>>45145004 #>>45145076 #>>45146993 #>>45147328 #>>45148584 #
1. Aurornis ◴[] No.45145076[source]
This is a settlement. It does not set a precedent nor even admit to wrongdoing.

> otherwise only the big companies who can afford to pay off publishers like Anthropic will be able to do so

Only well funded companies can afford to hire a lot of expensive engineers and train AI models on hundreds of thousands of expensive GPUs, too.

Something tells me many the grassroots LLM training people are less concerned about legality of their source training set than the big companies anyway.