←back to thread

770 points ta988 | 5 comments | | HN request time: 0.333s | source
Show context
johng ◴[] No.42549646[source]
If they ignore robots.txt there should be some kind of recourse :(
replies(5): >>42550889 #>>42550903 #>>42550975 #>>42551033 #>>42551045 #
1. Neil44 ◴[] No.42551033[source]
Error 403 is your only recourse.
replies(4): >>42551354 #>>42551414 #>>42551477 #>>42551878 #
2. jprete ◴[] No.42551354[source]
I hate to encourage it, but the only correct error against adversarial requests is 404. Anything else gives them information that they'll try to use against you.
3. lowbloodsugar ◴[] No.42551414[source]
Sending them to a lightweight server that sends them garbage is the only answer. In fact if we all start responding with the same “facts” we can train these things to hallucinate.
4. geraldcombs ◴[] No.42551477[source]
We return 402 (payment required) for one of our affected sites. Seems more appropriate.
5. DannyBee ◴[] No.42551878[source]
The right move is transferring data to them as slow as possible.

Even if you 403 them, do it as slow as possible.

But really I would infinitely 302 them as slow as possible.