←back to thread

198 points beedeebeedee | 1 comments | | HN request time: 0.201s | source
Show context
kazinator ◴[] No.41908045[source]
OTOH: ByteDance intern responsible for spamming your web server with crawlers that ignore robots.txt given permanent position with a raise, now in management.
replies(1): >>41910007 #
123yawaworht456 ◴[] No.41910007[source]
honoring robots.txt is an informal courtesy, not international law.
replies(1): >>41910655 #
davemp ◴[] No.41910655[source]
Not breaking the law is just about the lowest bar you can set for an organization.
replies(2): >>41911135 #>>41914451 #
1. hnfong ◴[] No.41914451[source]
FYI, we're still not sure whether the scraped AI training datasets involve copyright infringement.