←back to thread

439 points diggan | 1 comments | | HN request time: 0s | source
Show context
AlecSchueler ◴[] No.45062904[source]
Am I the only one that assumed everything was already being used for training?
replies(9): >>45062929 #>>45063168 #>>45063951 #>>45064966 #>>45065323 #>>45065428 #>>45065912 #>>45066950 #>>45070135 #
Aurornis ◴[] No.45065912[source]
I don't understand this mindset. Why would you assume anything? It took me a couple minutes at most to check when I first started using Claude.

I check when I start using any new service. The cynical assumption that everything's being shared leads to shrugging it off and making no attempt to look for settings.

It only takes a moment to go into settings -> privacy and look.

replies(7): >>45065932 #>>45065968 #>>45066053 #>>45066125 #>>45068206 #>>45068998 #>>45070223 #
lbrito ◴[] No.45066125[source]
>Why would you assume anything?

Because they already used data without permission on a much larger scale, so it's a perfectly logical assumption that they would continue doing so with their users?

replies(1): >>45067797 #
simonw ◴[] No.45067797[source]
I don't think that logically makes sense.

Training on everything you can publicly scrape from the internet is a very different thing from training on data that your users submit directly to your service.

replies(2): >>45069962 #>>45070009 #
1. rpgbr ◴[] No.45069962[source]
>Training on everything you can publicly scrape from the internet is a very different thing from training on data that your users submit directly to your service.

Yes. It's way easier and cheaper when the data comes to you instead of having to scrape everything elsewhere.