←back to thread

257 points ColinWright | 1 comments | | HN request time: 0.21s | source
Show context
bakql ◴[] No.45775259[source]
>These were scrapers, and they were most likely trying to non-consensually collect content for training LLMs.

"Non-consensually", as if you had to ask for permission to perform a GET request to an open HTTP server.

Yes, I know about weev. That was a travesty.

replies(15): >>45775283 #>>45775392 #>>45775754 #>>45775912 #>>45775998 #>>45776008 #>>45776055 #>>45776210 #>>45776222 #>>45776270 #>>45776765 #>>45776932 #>>45777727 #>>45777934 #>>45778166 #
sdenton4 ◴[] No.45776055[source]
The problem is that serving content costs money. Llm scraping is essentially ddos'ing content meant for human consumption. Ddos'ing sucks.
replies(2): >>45776078 #>>45778039 #
dylan604 ◴[] No.45778039[source]
running the scraping bots cost money too.
replies(2): >>45778147 #>>45780634 #
1. meepmorp ◴[] No.45780634[source]
> Won’t somebody please think of the parasites?