←back to thread

550 points polskibus | 1 comments | | HN request time: 0s | source
Show context
jordan801 ◴[] No.19116099[source]
Anyone who has written a few scrappers knows how brutally ineffective this is. Yelp tried to pull the same thing and it took me about 3 minutes to rectify my "for fun" scraper. It's also really not that difficult to write a smart scraper that you say, "Look for these things in this post. However you find them, replicate it for the others". Which is ultimately what I made my Yelp scraper do.

If there's a pattern, I will find it, and I will exploit it. <3

replies(8): >>19116147 #>>19116340 #>>19116656 #>>19116724 #>>19117143 #>>19117402 #>>19117423 #>>19121248 #
singron ◴[] No.19117402[source]
It actually seems pretty effective in this case. This uBlock Origin issue has been open for half a year and none of the default lists block the ads: https://github.com/uBlockOrigin/uAssets/issues/3367
replies(1): >>19118064 #
robryan ◴[] No.19118064[source]
Extensions like ublock having public block lists makes it even easier for Facebook to write something that breaks the filter.
replies(1): >>19118386 #
1. IshKebab ◴[] No.19118386[source]
Yeah and Facebook has far more resources than uBlock to fight the cat-and-mouse. So it is effective.