←back to thread

989 points acomjean | 2 comments | | HN request time: 4.637s | source
Show context
petralithic ◴[] No.45143482[source]
This is sad for open source AI, piracy for the purpose of model training should also be fair use because otherwise only the big companies who can afford to pay off publishers like Anthropic will be able to do so. There is no way to buy billions of books just for model training, it simply can't happen.
replies(9): >>45143523 #>>45143780 #>>45143876 #>>45144861 #>>45145004 #>>45145076 #>>45146993 #>>45147328 #>>45148584 #
1. josh-sematic ◴[] No.45148584[source]
Setting aside whether or not I think it should be fair use, you’re only going to be training a new foundation model these days if you have billions of dollars to spend on the endeavor anyway. Nobody is training Llama 5 in their garage.
replies(1): >>45155128 #
2. petralithic ◴[] No.45155128[source]
Millions, not billions, as DeepSeek and others have shown.