←back to thread

192 points beedeebeedee | 5 comments | | HN request time: 1.167s | source
1. kazinator ◴[] No.41908045[source]
OTOH: ByteDance intern responsible for spamming your web server with crawlers that ignore robots.txt given permanent position with a raise, now in management.
replies(1): >>41910007 #
2. 123yawaworht456 ◴[] No.41910007[source]
honoring robots.txt is an informal courtesy, not international law.
replies(1): >>41910655 #
3. davemp ◴[] No.41910655[source]
Not breaking the law is just about the lowest bar you can set for an organization.
replies(2): >>41911135 #>>41914451 #
4. not_a_bot_4sho ◴[] No.41911135{3}[source]
We can go lower
5. hnfong ◴[] No.41914451{3}[source]
FYI, we're still not sure whether the scraped AI training datasets involve copyright infringement.