←back to thread

989 points acomjean | 1 comments | | HN request time: 0s | source
Show context
aeon_ai ◴[] No.45143392[source]
To be very clear on this point - this is not related to model training.

It’s important in the fair use assessment to understand that the training itself is fair use, but the pirating of the books is the issue at hand here, and is what Anthropic “whoopsied” into in acquiring the training data.

Buying used copies of books, scanning them, and training on it is fine.

Rainbows End was prescient in many ways.

replies(36): >>45143460 #>>45143461 #>>45143507 #>>45143513 #>>45143567 #>>45143731 #>>45143840 #>>45143861 #>>45144037 #>>45144244 #>>45144321 #>>45144837 #>>45144843 #>>45144845 #>>45144903 #>>45144951 #>>45145884 #>>45145907 #>>45146038 #>>45146135 #>>45146167 #>>45146218 #>>45146268 #>>45146425 #>>45146773 #>>45146935 #>>45147139 #>>45147257 #>>45147558 #>>45147682 #>>45148227 #>>45150324 #>>45150567 #>>45151562 #>>45151934 #>>45153210 #
ants_everywhere ◴[] No.45143731[source]
I wonder what Aaron Swartz would think if he lived to see the era of libgen.
replies(2): >>45143762 #>>45144481 #
r14c ◴[] No.45144481[source]
Didn't he get in trouble for contributing to sci-hub before he died?
replies(1): >>45145012 #
dekhn ◴[] No.45145012[source]
He got into trouble for breaking into an unsecured network closet at MIT and using MIT credentials to download a bunch of copyrighted content.

The whole incident is written up in detail, https://swartz-report.mit.edu/ by Hal Abelson (who wrote SICP among other things). It is a well-researched document.

replies(1): >>45145229 #
ants_everywhere ◴[] No.45145229[source]
I think the parent may be getting at why he was downloading the content. I don't know the answer to this. Maybe someone here does. What was he intending to do with the articles?

The report speculates to his motivations on page 31, but it seems to be unknown with any certainty.

replies(2): >>45145708 #>>45147141 #
1. dekhn ◴[] No.45145708{3}[source]
Swartz, like many of us, see pay-for-access journals as an affront. I believe he wanted to "liberate" the content of these articles so that more people could read them.

Information may want to be free, but sometimes it takes a revolutionary to liberate it.