AI companies cause most of traffic on forums

(pod.geraspora.de)

770 points ta988 | 1 comments | 30 Dec 24 14:37 UTC | HN request time: 0.277s | source

Show context

markerz ◴[30 Dec 24 17:07 UTC] No.42551173[source]▶

One of my websites was absolutely destroyed by Meta's AI bot: Meta-ExternalAgent https://developers.facebook.com/docs/sharing/webmasters/web-...

It seems a bit naive for some reason and doesn't do performance back-off the way I would expect from Google Bot. It just kept repeatedly requesting more and more until my server crashed, then it would back off for a minute and then request more again.

My solution was to add a Cloudflare rule to block requests from their User-Agent. I also added more nofollow rules to links and a robots.txt but those are just suggestions and some bots seem to ignore them.

Cloudflare also has a feature to block known AI bots and even suspected AI bots: https://blog.cloudflare.com/declaring-your-aindependence-blo... As much as I dislike Cloudflare centralization, this was a super convenient feature.

replies(14): >>42551260 #>>42551410 #>>42551412 #>>42551513 #>>42551649 #>>42551742 #>>42552017 #>>42552046 #>>42552437 #>>42552763 #>>42555123 #>>42562686 #>>42565119 #>>42572754 #

MetaWhirledPeas ◴[30 Dec 24 17:58 UTC] No.42551742[source]▶

>>42551173 #

> Cloudflare also has a feature to block known AI bots and even suspected AI bots

In addition to other crushing internet risks, add wrongly blacklisted as a bot to the list.

replies(4): >>42551773 #>>42552921 #>>42562510 #>>42564887 #

kmeisthax ◴[31 Dec 24 23:00 UTC] No.42562510[source]▶

>>42551742 #

This is already a thing for basically all of the second[0] and third worlds. A non-trivial amount of Cloudflare's security value is plausible algorithmic discrimination and collective punishment as a service.

[0] Previously Soviet-aligned countries; i.e. Russia and eastern Europe.

replies(5): >>42562599 #>>42563762 #>>42564357 #>>42566973 #>>42567500 #

ls612 ◴[31 Dec 24 23:18 UTC] No.42562599[source]▶

>>42562510 #

People hate collective punishment because it works so well.

replies(5): >>42562792 #>>42563310 #>>42563642 #>>42563761 #>>42563805 #

eckesicle ◴[31 Dec 24 23:57 UTC] No.42562792[source]▶

>>42562599 #

Anecdatally, by default, we now block all Chinese and Russian IPs across our servers.

After doing so, all of our logs, like ssh auth etc, are almost completely free and empty of malicious traffic. It’s actually shocking how well a blanket ban worked for us.

replies(5): >>42562837 #>>42563023 #>>42567554 #>>42569757 #>>42574189 #

1. citrin_ru ◴[01 Jan 25 22:12 UTC] No.42569757[source]▶

>>42562792 #

Being slightly annoyed by noise in SSH logs I’ve blocked APNIC IPs and now see a comparable number of brute force attempts from ARIN IPs (mostly US ones). Geo blocks are totally ineffective against TAs which use a global network of proxies.

↑