←back to thread

454 points positiveblue | 7 comments | | HN request time: 0s | source | bottom
Show context
TIPSIO ◴[] No.45066555[source]
Everyone loves the dream of a free for all and open web.

But the reality is how can someone small protect their blog or content from AI training bots? E.g.: They just blindly trust someone is sending Agent vs Training bots and super duper respecting robots.txt? Get real...

Or, fine what if they do respect robots.txt, but they buy the data that may or may not have been shielded through liability layers via "licensed data"?

Unless you're reddit, X, Google, or Meta with scary unlimited budget legal teams, you have no power.

Great video: https://www.youtube.com/shorts/M0QyOp7zqcY

replies(37): >>45066600 #>>45066626 #>>45066827 #>>45066906 #>>45066945 #>>45066976 #>>45066979 #>>45067024 #>>45067058 #>>45067180 #>>45067399 #>>45067434 #>>45067570 #>>45067621 #>>45067750 #>>45067890 #>>45067955 #>>45068022 #>>45068044 #>>45068075 #>>45068077 #>>45068166 #>>45068329 #>>45068436 #>>45068551 #>>45068588 #>>45069623 #>>45070279 #>>45070690 #>>45071600 #>>45071816 #>>45075075 #>>45075398 #>>45077464 #>>45077583 #>>45080415 #>>45101938 #
1. andy99 ◴[] No.45066979[source]
> But the reality is how can someone small protect their blog or content from AI training bots?

A paywall.

In reality, what some want is to get all the benefits of having their content on the open internet while still controlling who gets to access it. That is the root cause here.

replies(2): >>45067033 #>>45067053 #
2. littlecranky67 ◴[] No.45067033[source]
This. We need to get rid of the ad-supported free internet economy. If you want your content to be free, you release it and have no issues with AI. If you want to make money of your content, add a paywall.

We need micropayments going forward, Lightning (Bitcoin backend) could be the solution.

replies(1): >>45067113 #
3. notatoad ◴[] No.45067053[source]
Which is really all that cloudflare is building here that people are mad about. It’s a way to give bots access to paywalled content.
replies(1): >>45067204 #
4. rustc ◴[] No.45067113[source]
> If you want your content to be free, you release it and have no issues with AI. If you want to make money of your content, add a paywall.

What about licenses like CC-BY-NC (Creative Commons - Non Commercial)?

replies(1): >>45068234 #
5. positiveblue ◴[] No.45067204[source]
Where everyone needs a cloudflare account to be able to pay*
replies(1): >>45067278 #
6. notatoad ◴[] No.45067278{3}[source]
“Everyone” in this context being bot operators who want to access websites who have decided to use cloudflare to block unauthenticated bot traffic.

Which is not everyone.

7. notpushkin ◴[] No.45068234{3}[source]
What about them? As we can see scrapers don’t care about copyright at all, so public licenses don’t really matter to them either.