Most active commenters

ljosifov(21)
(20)
SoftTalker(11)
Aurornis(9)
diggan(8)
soraminazuki(7)
I_am_tiberius(7)
demarq(6)
const_cast(6)
benterix(6)

Popular/hot comments

>>45062852 #
>>45064773 #
>>45063736 #
>>45062982 #
>>45064530 #
>>45062871 #
>>45063262 #
>>45062930 #
>>45062941 #
>>45064455 #
>>45064920 #
>>45062975 #
>>45063064 #
>>45062782 #
>>45063116 #
>>45062905 #
>>45063738 #
>>45063343 #
>>45064143 #
>>45063306 #

Updates to Consumer Terms and Privacy Policy

(www.anthropic.com)

1. aurareturn ◴[29 Aug 25 11:45 UTC] No.45062782[source]▶

Just opened Claude app on Mac and saw a popup asking me if it's ok to train on my chats. It's on by default. Unchecked it.

I think Claude saw that OpenAI was reaping too much benefit from this so they decided to do it too.

replies(5): >>45062800 #>>45062824 #>>45062865 #>>45063224 #>>45065138 #

2. demarq ◴[29 Aug 25 11:47 UTC] No.45062800[source]▶

>>45062782 #

Also your chats will now be stored for 5 years.

replies(2): >>45062821 #>>45062948 #

3. aurareturn ◴[29 Aug 25 11:52 UTC] No.45062821{3}[source]▶

>>45062800 #

I used to not care about this stuff but with the way this administration is going about things, I suddenly care very much about it.

replies(2): >>45062871 #>>45062903 #

4. ing33k ◴[29 Aug 25 11:52 UTC] No.45062822[source]▶

>>45062683 (OP) #

https://www.anthropic.com/news/updates-to-our-consumer-terms

5. staticman2 ◴[29 Aug 25 11:53 UTC] No.45062824[source]▶

>>45062782 #

Given how competitive Claude has been with ChatGPT models without training on users I'm curious how useful OpenAI could have found it.

6. BoorishBears ◴[29 Aug 25 11:53 UTC] No.45062825[source]▶

>>45062683 (OP) #

Someone's hallucination riddled Perplexity search should not be the source for this: https://www.anthropic.com/news/updates-to-our-consumer-terms

https://news.ycombinator.com/item?id=45053806

replies(1): >>45063115 #

7. troad ◴[29 Aug 25 11:56 UTC] No.45062852[source]▶

>>45062683 (OP) #

You can opt out, but the fact that it's opt-in by default and made to look like a simple T/C update prompt leaves a sour taste in my mouth. The five year retention period seems... excessive. I wonder if they've buried anything else objectionable in the new terms.

It was the kick in the pants I needed to cancel my subscription.

replies(22): >>45062875 #>>45062894 #>>45062895 #>>45062930 #>>45062936 #>>45062949 #>>45062975 #>>45063015 #>>45063070 #>>45063116 #>>45063150 #>>45063171 #>>45063186 #>>45063387 #>>45063615 #>>45064792 #>>45064955 #>>45064986 #>>45064996 #>>45066593 #>>45070194 #>>45074231 #

8. ◴[29 Aug 25 11:56 UTC] No.45062854[source]▶

>>45062683 (OP) #

9. echelon ◴[29 Aug 25 11:57 UTC] No.45062865[source]▶

>>45062782 #

We should be able to train on foundation model outputs.

These bastard companies pirated the world's data, then they train on our personal data. But they have the gall to say we can't save their model's inputs and outputs and distill their models.

replies(2): >>45062880 #>>45063053 #

10. bayindirh ◴[29 Aug 25 11:58 UTC] No.45062871{4}[source]▶

>>45062821 #

Trusting companies more than the government always feels strange. It's something I can't grasp.

replies(7): >>45062913 #>>45062918 #>>45062925 #>>45062928 #>>45062971 #>>45063033 #>>45063100 #

11. ◴[29 Aug 25 11:58 UTC] No.45062875[source]▶

>>45062852 #

12. Lionga ◴[29 Aug 25 11:58 UTC] No.45062878[source]▶

>>45062683 (OP) #

Feeling cute, might put all your private conversation into my public LLM later

13. jacooper ◴[29 Aug 25 11:59 UTC] No.45062880{3}[source]▶

>>45062865 #

You can, they might not like it but there's no legal basis saying you can't.

replies(1): >>45062919 #

14. perihelions ◴[29 Aug 25 12:00 UTC] No.45062894[source]▶

>>45062852 #

What are you replacing it with?

replies(2): >>45062939 #>>45063029 #

15. kordlessagain ◴[29 Aug 25 12:00 UTC] No.45062895[source]▶

>>45062852 #

> It was the kick in the pants I needed to cancel my subscription.

As if barely two 9s of uptime wasn't enough.

16. demarq ◴[29 Aug 25 12:01 UTC] No.45062903{4}[source]▶

>>45062821 #

It’s more that five years worth of peoples most personal conversations is an absolute treasure trove and makes their systems much more inviting for hackers and yes governments.

The part that irks me is that this includes people who are literally paying for the service.

17. I_am_tiberius ◴[29 Aug 25 12:01 UTC] No.45062905[source]▶

>>45062683 (OP) #

In my opinion, training models on user data without their real consent (real consent = e.g. the user must sign a contract or so, so he's definitely aware), should be considered a serious criminal offense.

replies(5): >>45062989 #>>45063008 #>>45063221 #>>45063771 #>>45064402 #

18. lewdwig ◴[29 Aug 25 12:01 UTC] No.45062906[source]▶

>>45062683 (OP) #

TBH I’m surprised it’s taken them this long to change their mind on this, because I find it incredibly frustrating to know that current gen agentic coding systems are incapable of actually learning anything from their interactions with me - especially when they make the same stupid mistakes over and over.

replies(3): >>45063010 #>>45063492 #>>45063725 #

19. demarq ◴[29 Aug 25 12:01 UTC] No.45062913{5}[source]▶

>>45062871 #

One has next to no consequences or oversight

20. aleph_minus_one ◴[29 Aug 25 12:02 UTC] No.45062918{5}[source]▶

>>45062871 #

Why not distrust both?! :-)

replies(1): >>45062968 #

21. datadrivenangel ◴[29 Aug 25 12:02 UTC] No.45062919{4}[source]▶

>>45062880 #

violating terms and conditions can be sufficient to be at least charged with computer abuse and fraud.

replies(1): >>45063048 #

22. elzbardico ◴[29 Aug 25 12:02 UTC] No.45062925{5}[source]▶

>>45062871 #

Trusting any of them is a luxury afforded in a short period of history in rich countries.

That's why the usual ethos in places like HN of treating any doubt about government actions as lowbrow paranoid conspiracy theory stuff, is so exasperating, for those of us who came from either the former soviet bloc or third world nations.

replies(1): >>45063405 #

23. AlecSchueler ◴[29 Aug 25 12:03 UTC] No.45062928{5}[source]▶

>>45062871 #

How many companies can disappear me to El Salvador?

replies(3): >>45063004 #>>45063089 #>>45063825 #

24. JohnnyMarcone ◴[29 Aug 25 12:03 UTC] No.45062930[source]▶

>>45062852 #

I got a pop-up when I opened the app explaining the change and an option to opt out. That seems very transparent to me.

replies(7): >>45062973 #>>45063111 #>>45063442 #>>45063450 #>>45063748 #>>45064206 #>>45064407 #

25. I_am_tiberius ◴[29 Aug 25 12:03 UTC] No.45062936[source]▶

>>45062852 #

"five year retention". If it's in a model once, it's there forever.

replies(3): >>45063032 #>>45064024 #>>45064875 #

26. ivape ◴[29 Aug 25 12:03 UTC] No.45062939{3}[source]▶

>>45062894 #

I like think using OpenRouter is better, but there’s absolutely no guarantee from any of the individual providers with respect to privacy and no logging.

27. psychoslave ◴[29 Aug 25 12:03 UTC] No.45062941[source]▶

>>45062683 (OP) #

What a surprise, a big corp collected large amount of personal data under some promises, and now reveals actually they will exploit it in completely unrelated manner.

replies(7): >>45062982 #>>45063078 #>>45063239 #>>45064031 #>>45064041 #>>45064193 #>>45064287 #

28. ◴[29 Aug 25 12:04 UTC] No.45062945[source]▶

>>45062683 (OP) #

29. pcwelder ◴[29 Aug 25 12:04 UTC] No.45062946[source]▶

>>45062683 (OP) #

Navigate to `https://claude.ai/settings/data-privacy-controls` and disable it before Sept 28. Isn't applicable to team plan.

replies(1): >>45062995 #

30. OtherShrezzing ◴[29 Aug 25 12:04 UTC] No.45062948{3}[source]▶

>>45062800 #

And there's no way to opt-out of the training, without agreeing to the 5 year retention. Anthropic has slipped so far and fast from its objective of being the ethical AI company.

replies(1): >>45063194 #

31. demarq ◴[29 Aug 25 12:05 UTC] No.45062949[source]▶

>>45062852 #

Are you sure the opt out isn’t only training? The retention does not seem affected by the toggle.

replies(2): >>45063038 #>>45063233 #

32. padolsey ◴[29 Aug 25 12:05 UTC] No.45062958[source]▶

>>45062683 (OP) #

Ugh Anthropic please don't become the villain.

replies(2): >>45063795 #>>45064028 #

33. ◴[29 Aug 25 12:06 UTC] No.45062968{6}[source]▶

>>45062918 #

34. visarga ◴[29 Aug 25 12:06 UTC] No.45062970[source]▶

>>45062683 (OP) #

I think there is amazing signal inside the chat logs. Every idea or decision taken can be analyzed in hindsight 20 messages later, or days later. Eventually a feedback signal or outcome lands back in the chat logs. That is real world idea validation. Considering the hundreds of millions of users and their diverse tasks that collect across time - this is probably the most efficient way to improve AI. I coined it the human-AI experience flywheel.

To make it respect user privacy I would use this data for training preference models, and those preference models used to finetune the base model. So the base model never sees particular user data, instead it learns to spot good and bad approaches from feedback experience. It might be also an answer to "who would write new things online if AI can just replicate it?" - the experience of human-AI work can be recycled directly through the AI model. Maybe it will speed up progress, amplifying both exploration of problems and exploitation of good ideas.

Considering OpenAI has 700M users, and worldwide there are probably over 1B users, they generate probably over 1 trillion tokens per day. Those collect in 2 places - in chat logs, for new models, and in human brains. We ingest a trillion AI tokens a day, changing how we think and work.

replies(2): >>45063145 #>>45063462 #

35. slipperydippery ◴[29 Aug 25 12:06 UTC] No.45062971{5}[source]▶

>>45062871 #

I don’t get drawing a distinction. If a company has it, there’s at least one government out there that either also already has it (some telecom companies just give them data portals, for example) or can any time they choose.

Corporate surveillance is government surveillance. Always has been.

36. cube00 ◴[29 Aug 25 12:07 UTC] No.45062973{3}[source]▶

>>45062930 #

> That seems very transparent to me.

Grabbing users during start up with the less privacy focused option preselected isn't being "very transparent"

They could have forced the user to make a choice or defaulted to not training on their content but they instead they just can't help themselves.

37. episteme ◴[29 Aug 25 12:07 UTC] No.45062975[source]▶

>>45062852 #

What will you use instead? I’m finding Claude the best experience since ChatGPT 5 is so slow and not any better answers than 4.

replies(5): >>45063056 #>>45063355 #>>45064689 #>>45065093 #>>45066512 #

38. sigmoid10 ◴[29 Aug 25 12:07 UTC] No.45062982[source]▶

>>45062941 #

The are valued at $170 Billion. Not quite the same as, but in same order of magnitude as OpenAI - while having only a single digit percent fraction of active users. They probably need to prepare for the eventual user data sellout, as it is becoming increasingly more obvious that none of the big players has a real and persistent tech leadership anymore. But millions and millions of users sharing their deepest thoughts and personal problems is gonna be worth infinitely more than all the average bot bullshit written on social media. That's also why Zuck is so incredibly desperate to get into the game. It's not about owning AI. It's about owning the world's thoughts and attention.

replies(8): >>45063158 #>>45063262 #>>45063487 #>>45063546 #>>45063592 #>>45063648 #>>45064254 #>>45064540 #

39. dhfbshfbu4u3 ◴[29 Aug 25 12:07 UTC] No.45062983[source]▶

>>45062683 (OP) #

All models are trained on data obtained without consent. Now the “good guys of AI” want users to add their chats to the stack and the users balk? Hilarious.

40. jsheard ◴[29 Aug 25 12:08 UTC] No.45062989[source]▶

>>45062905 #

Why single out user data specifically? Most of the data Anthropic and co train on was just scooped up from wherever with zero consent, not even the courtesy of a buried TOS clause, and their users were always implicitly fine with that. Forgive me for not having much sympathy when the users end up reaping what they've sown.

replies(3): >>45063012 #>>45063051 #>>45063335 #

41. cube00 ◴[29 Aug 25 12:09 UTC] No.45062995[source]▶

>>45062946 #

Make no mistake this will be coming to the team plan. The only difference is your account owner will decide for the whole team.

replies(1): >>45063074 #

42. yesbut ◴[29 Aug 25 12:09 UTC] No.45062998[source]▶

>>45062683 (OP) #

never provide these services with personal data / identifiers.

43. giraffe_lady ◴[29 Aug 25 12:10 UTC] No.45063004{6}[source]▶

>>45062928 #

Well relatedly I think several of the tech billionaires considered this question and decided the answer was "not enough."

44. Rygian ◴[29 Aug 25 12:10 UTC] No.45063008[source]▶

>>45062905 #

It already is. See Art. 5. 1.(b) here: https://gdpr-info.eu/art-5-gdpr/

replies(2): >>45063025 #>>45063040 #

45. nicce ◴[29 Aug 25 12:10 UTC] No.45063010[source]▶

>>45062906 #

Or get more value from the users with the same subscription price. I doubt they are giving any discounts.

replies(1): >>45063458 #

46. I_am_tiberius ◴[29 Aug 25 12:11 UTC] No.45063012{3}[source]▶

>>45062989 #

100 % true.

47. smallerfish ◴[29 Aug 25 12:11 UTC] No.45063015[source]▶

>>45062852 #

Settings > Privacy > Privacy Settings

replies(1): >>45063052 #

48. tanh ◴[29 Aug 25 12:11 UTC] No.45063016[source]▶

>>45062683 (OP) #

It's not just chats, is it? It says "coding sessions" too.

49. andrewstuart ◴[29 Aug 25 12:11 UTC] No.45063022[source]▶

>>45062683 (OP) #

Whatever your secret projects and plans are, they’re now available to train up your competitors on what you’re doing!

50. nicce ◴[29 Aug 25 12:12 UTC] No.45063025{3}[source]▶

>>45063008 #

Is ”Accept” in cookie box good enough contract?

replies(1): >>45063043 #

51. troad ◴[29 Aug 25 12:12 UTC] No.45063029{3}[source]▶

>>45062894 #

Two weeks left in the sub to figure it out, but I'm not yet sure. I was never all in on all the tooling, I mostly used it as smart search (e.g. ImageMagick incantations) and for trivial scripting that I couldn't be bothered writing myself, so I might just stick to whatever comes with Kagi, see if that doesn't cover me.

replies(2): >>45063084 #>>45063244 #

52. Hnrobert42 ◴[29 Aug 25 12:12 UTC] No.45063032{3}[source]▶

>>45062936 #

Is that true? Do models get rebuilt from scratch each time or do they get iterated on?

replies(1): >>45063058 #

53. twoquestions ◴[29 Aug 25 12:12 UTC] No.45063033{5}[source]▶

>>45062871 #

I 90% agree with you, though Apple did stand up to the FBI some years ago. The US gov't at least is much more restricted on what data it can collect and act on due to the 4th Amendment among other laws, and as another commenter said Apple can't blackbag me to El Salvador.

Apple/FBI story in question: https://apnews.com/general-news-c8469b05ac1b4092b7690d36f340...

replies(1): >>45063060 #

54. jasona123 ◴[29 Aug 25 12:12 UTC] No.45063038{3}[source]▶

>>45062949 #

From the PR update: https://www.anthropic.com/news/updates-to-our-consumer-terms

“If you do not choose to provide your data for model training, you’ll continue with our existing 30-day data retention period.“

From the support page: https://privacy.anthropic.com/en/articles/10023548-how-long-...

“If you choose not to allow us to use your chats and coding sessions to improve Claude, your chats will be retained in our back-end storage systems for up to 30 days.”

55. I_am_tiberius ◴[29 Aug 25 12:13 UTC] No.45063040{3}[source]▶

>>45063008 #

I believe that only concerns European users. Moreover, I believe a simple press of an OK button is fine with GDPR. This data (type and volume) however, is way more serious and can't be agreed on by just pressing a button.

56. I_am_tiberius ◴[29 Aug 25 12:13 UTC] No.45063043{4}[source]▶

>>45063025 #

57. echelon ◴[29 Aug 25 12:13 UTC] No.45063048{5}[source]▶

>>45062919 #

That's disgusting.

We need a Galoob vs. Nintendo [1], Sony vs. Universal [2], or whatever that TiVo case was (I don't think it was TiVo vs. EchoStar). A case that establishes anyone can scrape and distill models.

[1] https://en.wikipedia.org/wiki/Lewis_Galoob_Toys,_Inc._v._Nin....

[2] https://en.wikipedia.org/wiki/Sony_Corp._of_America_v._Unive....

58. perihelions ◴[29 Aug 25 12:14 UTC] No.45063051{3}[source]▶

>>45062989 #

Training on private user interactions is a privacy violation; training on public, published texts is (some argue) an intellectual property violation. They're very different kinds of moral rights.

replies(2): >>45063481 #>>45064161 #

59. kossTKR ◴[29 Aug 25 12:14 UTC] No.45063052{3}[source]▶

>>45063015 #

i don't see any setting related to this? just:

Export data

Shared chats

Location metadata

Review and update terms and conditions

I'm in the EU, maybe that's helping me?

replies(1): >>45063126 #

60. elzbardico ◴[29 Aug 25 12:14 UTC] No.45063053{3}[source]▶

>>45062865 #

I am pretty sure they try to do it all the time between themselves. Most of the real sauce in AI coding comes from reinforcement learning, usually done by armies of third world outsourced developers tediously doing all kinds of tasks with instructions to detail their reasoning behind each chance. Things like: "to run this python test in a docker container with the python image we need to install the python package xyz, but then, as it has some native code, we also need to install build-essential..."

While those developers are not well paid (usually around 30/40 USD hour, no benefits), you need a lot of then, so, it is a big temptation to create also as much synthetic data sets from your more capable competitor.

Given the fact that AI companies have this Jihad zeal to achieve their goals no matter what (like, fuck copyright, fuck the environment, etc, etc), it would be naive to believe they don't at least try to do it.

And even if they don't do it directly, their outsourced developers will do it indirectly by using AI to help with their tasks.

replies(1): >>45063148 #

61. teekert ◴[29 Aug 25 12:14 UTC] No.45063056{3}[source]▶

>>45062975 #

Granted, it is a stretch and not near the features of Claude (no code etc), but at least Proton's Lumo [0] is very privacy oriented.

I have to admit, I've used it a bit over the last days and still reactivated my Claude pro subscription today so... Let's say it's ok for casual stuff? Also useful for casual coding questions. So if you care about it, it's an option.

[0] https://lumo.proton.me/

62. I_am_tiberius ◴[29 Aug 25 12:15 UTC] No.45063058{4}[source]▶

>>45063032 #

I believe the big models currently get built from scratch (with random starting weights). That wasn't my point though. I meant a model created once, might be used for a very long time. Maybe they even release the weights at one point ("open source").

replies(1): >>45063121 #

63. tinyhouse ◴[29 Aug 25 12:15 UTC] No.45063059[source]▶

>>45062683 (OP) #

GO to setting and under privacy you can change the default. (they make it pretty hard to find). Also note "To help us improve our AI models and safety protections, we’re extending data retention to 5 years."

replies(1): >>45064623 #

64. merelysounds ◴[29 Aug 25 12:15 UTC] No.45063061[source]▶

>>45062683 (OP) #

> unless users actively opt out by September 28th

Looks like there is an opt out option. Curious about the EU users - would that be off by default (so: opt in)?

replies(1): >>45065122 #

65. bayindirh ◴[29 Aug 25 12:15 UTC] No.45063060{6}[source]▶

>>45063033 #

Apple is an exception, and even that is debatable because of the unencrypted backups they store.

On the other hand, what Apple did is a tangible thing and is a result.

This gives them better optics for now, but there is no law says that they can't change.

Their business model is being an "accessible luxury brand with the privacy guarantee of Switzerland as the laws allow". So, as another argument, they have to do this.

66. JCM9 ◴[29 Aug 25 12:15 UTC] No.45063064[source]▶

>>45062683 (OP) #

Not a surprise. All the major players have reached the limits of training on existing data—they’re already training on essentially the whole internet plus a bunch of content they allegedly stole (hence various lawsuits). There haven’t been any major breakthroughs in model architecture from the major players recently and thus they’re now in a battle for more data to train on. They need data, and they want YOUR data, now, and are gonna do increasingly shady things to get it.

replies(5): >>45063645 #>>45063676 #>>45063696 #>>45064759 #>>45064804 #

67. wzdd ◴[29 Aug 25 12:16 UTC] No.45063070[source]▶

>>45062852 #

Everywhere else in Anthropic's interface, yes/no switches show blue when enabled and black when disabled. In the box they're showing about this change the slider shows grey in both states: visit it in preferences to see the difference! It's not just disappointing but also kind of sad that someone went to the effort to do this.

replies(3): >>45063117 #>>45063179 #>>45065374 #

68. smokel ◴[29 Aug 25 12:16 UTC] No.45063074{3}[source]▶

>>45062995 #

What do you base this on? Typically it's plans like these that offer more privacy, because people are willing to pay a premium for that.

replies(1): >>45063197 #

69. AIPedant ◴[29 Aug 25 12:16 UTC] No.45063078[source]▶

>>45062941 #

Nobody could have predicted that someone who worked for Baidu, Google, and OpenAI would found a company like this.

70. perihelions ◴[29 Aug 25 12:17 UTC] No.45063084{4}[source]▶

>>45063029 #

How does Kagi (claim that they) enforce privacy rights on the major LLM providers? Have they negotiated a special contract?

I'm looking at

> "When you use the Assistant by Kagi, your data is never used to train AI models (not by us or by the LLM providers), and no account information is shared with the LLM providers. By default, threads are deleted after 24 hours of inactivity. This behavior can be adjusted in the settings."

https://help.kagi.com/kagi/ai/assistant.html#privacy

And trying to reconcile those claims with the instant thread. Anthropic is listed as one of their back-end providers. Is that data retained for five years on Anthropic's end, or 24 hours? Is that data used for training Anthropic models, or has Anthropic agreed in writing not to, for Kagi clients?

replies(2): >>45063123 #>>45063321 #

71. sillyfluke ◴[29 Aug 25 12:17 UTC] No.45063089{6}[source]▶

>>45062928 #

"US Army appoints Palantir, Meta, OpenAI execs as Lt. Colonels" [0]

Well, probably easier than you think. Given that it looks like Palantir is able to control the software and hardware of the new fangled detention centers with immunity, how difficult do you think it is for them to disappear someone without any accountability?

It is precisely the blurring of the line between gov and private companies that aid in subverting the rule of law in many instances.

[0] https://thegrayzone.com/2025/06/18/palantir-execs-appointed-...

replies(2): >>45063330 #>>45077638 #

72. dmezzetti ◴[29 Aug 25 12:17 UTC] No.45063093[source]▶

>>45062683 (OP) #

If you don't like this, use local AI.

73. irthomasthomas ◴[29 Aug 25 12:17 UTC] No.45063094[source]▶

>>45062683 (OP) #

How long before some poor kids chocolate pudding fetish makes it to the model and they get bullied at school for it? Treat LLM chats as public.

replies(2): >>45063152 #>>45063740 #

74. sokoloff ◴[29 Aug 25 12:18 UTC] No.45063100{5}[source]▶

>>45062871 #

The government has the direct power to imprison me or seize my property if cross them.

It seems strange to not be able to grasp the difference in kind here.

replies(2): >>45063136 #>>45063861 #

75. throw310822 ◴[29 Aug 25 12:18 UTC] No.45063101[source]▶

>>45062683 (OP) #

Frankly I don't mind much if they use my js debugging conversation to train the next models; but there should be a way to mark specific conversations as private (possibly even an "incognito mode") to exclude them from any training or external access.

76. elashri ◴[29 Aug 25 12:19 UTC] No.45063111{3}[source]▶

>>45062930 #

> That seems very transparent to me

Implicit consent is not transparent and should be illegal in all situations. I can't tell you that unless you opt out, You have agreed to let me rent you apartment.

You can say analogy is not straightforward comparable but the overall idea is the same. If we enter a contract for me to fix your broken windows, I cannot extend it to do anything else in the house I see fit with Implicit consent.

replies(2): >>45067270 #>>45073380 #

77. op00to ◴[29 Aug 25 12:19 UTC] No.45063115[source]▶

>>45062825 #

The perplexity search seemed pretty solid to me. What hallucinations did you see?

replies(1): >>45063143 #

78. merelysounds ◴[29 Aug 25 12:19 UTC] No.45063116[source]▶

>>45062852 #

> opt-in by default

Nitpicking: “opt in by default” doesn’t exist, it’s either “opt in”, or “opt out”; this is “opt out”. By definition an “opt out” setting is selected by default.

replies(5): >>45063357 #>>45064080 #>>45064709 #>>45064980 #>>45065703 #

79. riz_ ◴[29 Aug 25 12:20 UTC] No.45063117{3}[source]▶

>>45063070 #

This is probably because there are laws in some countries that restrict how these buttons/switches can look (think cookie banners, where sometimes there is a huge green button to accept, and a tiny greyed out text somewhere for the settings).

replies(1): >>45064227 #

80. ◴[29 Aug 25 12:20 UTC] No.45063121{5}[source]▶

>>45063058 #

81. vinnyorvinny ◴[29 Aug 25 12:20 UTC] No.45063123{5}[source]▶

>>45063084 #

There is an option to opt out right? So I assume they just make sure to always opt out.

replies(1): >>45063129 #

82. croes ◴[29 Aug 25 12:20 UTC] No.45063126{4}[source]▶

>>45063052 #

Have you clicked "Review and update terms and conditions"?

It's part of the update

replies(1): >>45063146 #

83. throwaway290 ◴[29 Aug 25 12:20 UTC] No.45063127[source]▶

>>45062683 (OP) #

As soon as Apple said it woll ship it in Xcode? what a coincidence!

84. ◴[29 Aug 25 12:21 UTC] No.45063129{6}[source]▶

>>45063123 #

85. ghusto ◴[29 Aug 25 12:21 UTC] No.45063130[source]▶

>>45062683 (OP) #

What about paid corporate accounts?

If I'm not paying for something, I presume this is the kind of thing that's happening, so this isn't newsworthy to me. Is it also applicable for paid and paid corporate accounts?

replies(2): >>45063180 #>>45066143 #

86. bayindirh ◴[29 Aug 25 12:21 UTC] No.45063136{6}[source]▶

>>45063100 #

What happens if your Google account is locked out because you shared your son's pictures to his M.D. because of an ongoing treatment?

What happens the same company locks all your book drafts because an algorithm deemed that you're plotting something against someone?

Both are real events, BTW.

replies(1): >>45063160 #

87. Silhouette ◴[29 Aug 25 12:22 UTC] No.45063141[source]▶

>>45062683 (OP) #

Am I the only one who finds the branding and privacy policies around these AI services (possibly deliberately) confusing?

For example Anthropic have an Anthropic Console that they appear to consider quite distinct from Claude.ai. Do these share a privacy policy and related settings? How do either of these fit in with the named plans like Pro and Max? What are you actually paying for when you give them money for the various different things they charge for? Is all API use under their Commercial Terms even if it's a personal account that is otherwise under the Consumer Terms? Why isn't all of this obvious and transparent to users?

OpenAI don't seem to be any better. I only just learned from this HN discussion that they train on personal account conversations. As someone privacy-conscious who has used ChatGPT - even if only a few times for experiments - I find the fact that this wasn't very clearly stated up front to be extremely disturbing. If I'd known about it I would certainly have switched off the relevant setting immediately.

I get that these organisations have form for training on whatever they can get their hands on whether dubiously legal or not. But training on users' personal conversations or code feels like something that should require a very clear and explicit opt-in. In fact I don't see how they can legally not have that first in places like the EU and UK that have significant data protection legislation.

replies(1): >>45064204 #

88. BoorishBears ◴[29 Aug 25 12:22 UTC] No.45063143{3}[source]▶

>>45063115 #

If there's a primary source like the one I just shared and you link me to an AI summary of 28 sources, I'm treating the entire package as garbage.

replies(1): >>45075107 #

89. croes ◴[29 Aug 25 12:23 UTC] No.45063145[source]▶

>>45062970 #

Your ideas for other users.

90. kossTKR ◴[29 Aug 25 12:23 UTC] No.45063146{5}[source]▶

>>45063126 #

Oh i see thanks. That's a dark design pattern, hiding stuff like that.

No one cares about anything else but they have lots of superflous text and they are calling it "help us get better", blah blah, it's "help us earn more money and potentially sell or leak your extremely private info", so they are lying.

Considering cancelling my subscription right this moment.

I hope EU at leat considers banning or extreme-fining companies trying to retroactively use peoples extremely private data like this, it's completely over the line.

replies(1): >>45063538 #

91. sokoloff ◴[29 Aug 25 12:23 UTC] No.45063148{4}[source]▶

>>45063053 #

> those developers are not well paid (usually around 30/40 USD hour, no benefits)

$40/hour for a full time would put you just over the median household income for the US.

I suspect this provides quite a good living for their family and the devs doing the work feel like they’re well-paid.

replies(1): >>45064403 #

92. monegator ◴[29 Aug 25 12:23 UTC] No.45063150[source]▶

>>45062852 #

I'm super duper sure that my data won't be stored and eventually used if i opt out

93. squigz ◴[29 Aug 25 12:24 UTC] No.45063152[source]▶

>>45063094 #

This is remarkably specific.

94. Imustaskforhelp ◴[29 Aug 25 12:24 UTC] No.45063158{3}[source]▶

>>45062982 #

The last time my brother and I were discussing about anthropic, they were worth 90B$, and that was a month ago, he asked chatgpt in the middle of the conversation, either it was a sneaky sabotage from gpt or my memory is fuzzy but I thought that 90b$ was really underrated for anthropic given the scaleAi deal or windsurf/cursor deals.

replies(1): >>45063257 #

95. sokoloff ◴[29 Aug 25 12:25 UTC] No.45063160{7}[source]▶

>>45063136 #

I think I missed the part where Google imprisoned someone.

The government forces me to do business with them; if I don't pay them tens (and others hundreds) of thousands of dollars every year they will send people with guns to imprison me and eventually other people with guns to seize my property.

Me willingly giving Google some data and them capriciously deciding to not always give it back doesn't seem anything like the same to me. (It doesn't mean I like what Google's doing, but they have nowhere near the power of the group that legally owns and uses tanks.)

replies(1): >>45063213 #

96. Joker_vD ◴[29 Aug 25 12:26 UTC] No.45063171[source]▶

>>45062852 #

> You can opt out

You can say that you want to opt out. What Anthropic will decide to do with your declaration is a different question.

replies(1): >>45064103 #

97. senko ◴[29 Aug 25 12:27 UTC] No.45063179{3}[source]▶

>>45063070 #

Just did and it behaves as expected for me in the Android app (ie. not the dark pattern you described)

replies(1): >>45064551 #

98. smca ◴[29 Aug 25 12:27 UTC] No.45063180[source]▶

>>45063130 #

Does not apply to team/enterprise/education or the API.

99. ◴[29 Aug 25 12:27 UTC] No.45063186[source]▶

>>45062852 #

100. smca ◴[29 Aug 25 12:28 UTC] No.45063194{4}[source]▶

>>45062948 #

> If you do not choose to provide your data for model training, you’ll continue with our existing 30-day data retention period.

https://www.anthropic.com/news/updates-to-our-consumer-terms

101. cube00 ◴[29 Aug 25 12:28 UTC] No.45063197{4}[source]▶

>>45063074 #

To consent to sharing your data for training Docusign AI models, your organization (as the account owner) must first agree to share data through your Docusign service agreement.

After providing consent, the setting will be turned on by default. [1]

[1]: https://support.docusign.com/s/document-item?language=en_US&...

replies(1): >>45063290 #

102. mutkach ◴[29 Aug 25 12:28 UTC] No.45063203[source]▶

>>45062683 (OP) #

How would they not "share information with third parties". You need to sift through the data to make it even remotely useful for "training". You absolutely need to share it with either Amazon (for Mechanical Turk) or with Scale AI.

I am wondering how would you use a chat transcript for training? Unless it is massive, possibly private codebases that are constantly getting piped into Claude Code right now. In that case, that would make sense.

replies(1): >>45063475 #

103. bayindirh ◴[29 Aug 25 12:30 UTC] No.45063213{8}[source]▶

>>45063160 #

Their life effectively stopped since they are locked out of everything, forever. Not forgetting that the first guy's son's pictures are ended in a CSAM database and he lost his account permanently, and Google didn't give his account back [0].

A company "applied what the law said", and refused that they made a mistake and overreached. Which is generally attributed to governments.

So, I you missed the effects of this little binary flag on their life.

[0]: https://www.theguardian.com/technology/2022/aug/22/google-cs...

replies(1): >>45063293 #

104. demarq ◴[29 Aug 25 12:30 UTC] No.45063220[source]▶

>>45062683 (OP) #

I’m proposing this, the ability to mark certain chats as non trainable. Like an incognito mode. If a chat is not marked as that after 5 days it can be retained for training.

And this is only for free users, paid users should never have to think about this.

replies(1): >>45063532 #

105. happosai ◴[29 Aug 25 12:30 UTC] No.45063221[source]▶

>>45062905 #

I think it's cute people believe companies that trained their models with every single book and online page ever written without consents from authors (and often against the explicit request of the author without any opt-out) won't do a rugg-pull and do it also to all the chats they have aquired...

replies(2): >>45063234 #>>45064517 #

106. fusslo ◴[29 Aug 25 12:31 UTC] No.45063224[source]▶

>>45062782 #

My work just signed to an enterprise agreement with anthropic. I just checked, and "Your data will not be trained on or used to improve the product. Code is stored to personalize your experience. Applies to all team members."

107. bgwalter ◴[29 Aug 25 12:31 UTC] No.45063229[source]▶

>>45062683 (OP) #

The Perplexity report is vibe-written and an excellent example that "AI" is entirely unreliable. At the time I am making this comment, it states:

"Anthropic also reported discovering North Korean operatives using Claude to fraudulently obtain remote employment positions at Fortune 500 technology companies, leveraging the AI to pass technical interviews and maintain positions despite lacking basic coding skills."

Note that in this version the North Koreans lack basic coding skills, which took me by surprise. Generally they are assumed to be highly competent.

The original (https://www.anthropic.com/news/detecting-countering-misuse-a...) is completely different:

"Our Threat Intelligence report discusses several recent examples of Claude being misused, including a large-scale extortion operation using Claude Code, a fraudulent employment scheme from North Korea, and the sale of AI-generated ransomware by a cybercriminal with only basic coding skills. We also cover the steps we’ve taken to detect and counter these abuses."

This is what people are using for web search. I'm not targeting Perplexity specifically, Google "AI" summaries are just as bad.

UPDATE: The original pdf says something different again (https://www-cdn.anthropic.com/b2a76c6f6992465c09a6f2fce282f6...):

"The most striking finding is the actors’ complete dependency on AI to function in technical roles. These operators do not appear to be able to write code, debug problems, or even communicate professionally without Claude’s assistance. Yet they’re successfully maintaining employment at Fortune 500 companies (according to public reporting) passing technical interviews, and delivering work that satisfies their employers. This represents a new paradigm where technical competence is simulated rather than possessed."

This should be distributed among managers so that they finally get the truth about "AI".

replies(1): >>45063989 #

108. zenmaster10665 ◴[29 Aug 25 12:32 UTC] No.45063233{3}[source]▶

>>45062949 #

it seems really badly designed or maybe it is meant to be confusing. It does not make it clear that the two are linked together, and you have to "accept" the both together even though there is only a toggle on the "help us make the model better" item.

109. fHr ◴[29 Aug 25 12:32 UTC] No.45063234{3}[source]▶

>>45063221 #

Yeah people are gullible these days. We need another full 2008 crash that hurts bad before people wake up for a bit before becomming like this again.

replies(2): >>45063342 #>>45063498 #

110. raldi ◴[29 Aug 25 12:32 UTC] No.45063239[source]▶

>>45062941 #

“These updates will apply only to new or resumed chats and coding sessions.”

https://www.anthropic.com/news/updates-to-our-consumer-terms

replies(1): >>45063343 #

111. fnordlord ◴[29 Aug 25 12:33 UTC] No.45063244{4}[source]▶

>>45063029 #

I'm mostly replying because I was truly using it for an ImageMagick incantation yesterday. I use the API rather than chat, if that's an option for you. I put $20 into it every few months and it mostly does what I need. I'm using Raycast for quick and dirty questions and AnythingLLM for longer conversations.

112. quantumwoke ◴[29 Aug 25 12:33 UTC] No.45063248[source]▶

>>45062683 (OP) #

Can we change the url to https://www.anthropic.com/news/updates-to-our-consumer-terms instead of the slop post

113. sigmoid10 ◴[29 Aug 25 12:34 UTC] No.45063257{4}[source]▶

>>45063158 #

>I thought that 90b$ was really underrated for anthropic

That was true when the tech leadership was an open question and it seemed like any one of the big players could make a breakthrough at any moment that would propel them to the top. Nowadays it has pattered out and the market is all about sustainable user growth. In that sense Anthropic is pretty overvalued, at least if you think that OpenAI's valuation is legit. And if you think OpenAI is overvalued, then Anthropic would be a no-go zone as an investor.

replies(1): >>45063708 #

114. goalieca ◴[29 Aug 25 12:35 UTC] No.45063262{3}[source]▶

>>45062982 #

Companies all seem to turn against their users whenever they have revenue/earnings trouble.

replies(7): >>45063281 #>>45063308 #>>45063361 #>>45063477 #>>45064072 #>>45064300 #>>45064448 #

115. grim_io ◴[29 Aug 25 12:35 UTC] No.45063264[source]▶

>>45062683 (OP) #

Opt-in would be much better, of course.

To put it in perspective: google won't even give you an option to opt out.

If you pay for Gemini as a private user and not as a corporation, you are fair game for google.

Now, neither option is good. But one is still much worse.

replies(2): >>45063892 #>>45063900 #

116. c080 ◴[29 Aug 25 12:35 UTC] No.45063266[source]▶

>>45062683 (OP) #

honest questions, what is the value of training over user chat? the answers are already provided by your LLM!

replies(1): >>45063554 #

117. jsheard ◴[29 Aug 25 12:36 UTC] No.45063281{4}[source]▶

>>45063262 #

Considering every AI company is hemorrhaging money with no end in sight, that doesn't bode well, does it?

118. picafrost ◴[29 Aug 25 12:37 UTC] No.45063285[source]▶

>>45062683 (OP) #

The AI fever dream of unbounded training and inference is ending and reality is settling in. Anthropic has had the most reasonable business model of the AI players with their focus on code, as far as I can tell, but it still won't be enough.

There's no such thing as a free lunch, but even when I am a paying customer my data is taken as gratuity and used (+ spread around!) in extremely opaque ways. I am tired of it. Honestly, I'm just getting tired of the internet.

replies(1): >>45072683 #

119. dewey ◴[29 Aug 25 12:37 UTC] No.45063290{5}[source]▶

>>45063197 #

So you state that as a fact based on the ToS of an unrelated company?

replies(1): >>45063479 #

120. sokoloff ◴[29 Aug 25 12:38 UTC] No.45063293{9}[source]▶

>>45063213 #

> Their life effectively stopped since they are locked out of everything

What?! Google locked them out of Google. I'm sure they can still get search, email, and cloud services from many other providers.

The government can lock you away in a way that is far more impactful and much closer to "life stopped; locked out of everything" than "you can't have the data you gave us back".

replies(1): >>45063591 #

121. aosaigh ◴[29 Aug 25 12:39 UTC] No.45063306[source]▶

>>45062683 (OP) #

Does this include any code base you are running Claude Code with (where parts of code are sent as part of the context)? I'm not hugely clear on how my private codebase is exposed to Claude in the first place when using Claude Code.

replies(4): >>45064120 #>>45064332 #>>45064667 #>>45064844 #

122. diggan ◴[29 Aug 25 12:39 UTC] No.45063308{4}[source]▶

>>45063262 #

It seems to me like some fundamental/core technologies/services just shouldn't be run by for-profit entities, and if come across one doing that, you need to carefully choose if you want to start being beholden to such entity.

As the years go by, I'm finding myself being able to rely on those less and less, because every time I do, I eventually get disappointed by them working against their user base.

replies(1): >>45063348 #

123. sorrythanks ◴[29 Aug 25 12:39 UTC] No.45063309[source]▶

>>45062683 (OP) #

Am I reading an AI summary of an article about a press release?

124. FergusArgyll ◴[29 Aug 25 12:40 UTC] No.45063321{5}[source]▶

>>45063084 #

They are using llm's through the API where it's the b2b world and you can get privacy

125. AlecSchueler ◴[29 Aug 25 12:40 UTC] No.45063330{7}[source]▶

>>45063089 #

Oh I have no doubt those lines are becoming more and more blurred and that certain big companies in key positions are theoretically beyond accountability.

But the question was "why trust a company and not the government?"

So even now it's between:

  * A company who, if big enough and in a key position, could theoretically do this

And

  * A government who we know for sure have grabbed multiple people off the streets, within the past month, and have trafficked them out of the country without any due process.

So it's still "could maybe do harm" versus "already controls an army of masked men who are undeniably active in doing harm."

replies(2): >>45063473 #>>45063813 #

126. __MatrixMan__ ◴[29 Aug 25 12:41 UTC] No.45063335{3}[source]▶

>>45062989 #

Publishing something is considered by most to be sufficient consent for it to be not considered private.

I realize there's a whole legal quagmire here involved with intellectual "property" and what counts as "derivative work", but that's a whole separate (and dubiously useful) part of the law.

replies(1): >>45063793 #

127. FergusArgyll ◴[29 Aug 25 12:41 UTC] No.45063342{4}[source]▶

>>45063234 #

Or we can root for happiness and prosperity instead

replies(1): >>45063392 #

128. benterix ◴[29 Aug 25 12:41 UTC] No.45063343{3}[source]▶

>>45063239 #

What kind of guarantee do we have this is true?

Meta downloaded copyrighted content and trained their models on it, OpenAI did the same.

Uber developed Greyball to cheat the officials and break the law.

Tesla deletes accident data and reports to the authorities they don't have it.

So forgive me I have zero trust in whatever these companies say.

replies(5): >>45063418 #>>45063536 #>>45063639 #>>45063846 #>>45063974 #

129. I_am_tiberius ◴[29 Aug 25 12:42 UTC] No.45063346[source]▶

>>45062683 (OP) #

I wonder what type of people work for people that evil.

130. bigfishrunning ◴[29 Aug 25 12:42 UTC] No.45063348{5}[source]▶

>>45063308 #

Except LLMs aren't a fundamental or core technology, they're an amusing party trick with some really enthusiastic marketers. We don't need them.

replies(3): >>45063396 #>>45063421 #>>45063672 #

131. Wowfunhappy ◴[29 Aug 25 12:42 UTC] No.45063350[source]▶

>>45062683 (OP) #

I saw the popup yesterday. Maybe I've just gotten really good at navigating dark patterns (or I have stock-home syndrome), but I remember the opt out choice being really clear and easy to select.

I'm not arguing on the facts of the modal design, I don't remember either way, I just don't remember it being confusing.

Unless I was in some A B test?

replies(1): >>45063430 #

132. javierluraschi ◴[29 Aug 25 12:42 UTC] No.45063355{3}[source]▶

>>45062975 #

https://grok.com

replies(3): >>45063460 #>>45063504 #>>45063800 #

133. benterix ◴[29 Aug 25 12:42 UTC] No.45063357{3}[source]▶

>>45063116 #

This is not nitpicking, this is a sane reaction to someone modifying the meaning of words on the fly.

replies(2): >>45063412 #>>45064763 #

134. jascination ◴[29 Aug 25 12:43 UTC] No.45063361{4}[source]▶

>>45063262 #

Enshittification. It's a thing.

replies(1): >>45063982 #

135. phtrivier ◴[29 Aug 25 12:46 UTC] No.45063388[source]▶

>>45062683 (OP) #

Is there a summary of the stance on training with user data for the main llms ?

I have a really hard time thinking that Google, Microsoft, Meta, etc... would _not_ train on whatever people enter (willingly or not in the system.)

The silver lining is that what most people enter in a chat box is _utter crap_.

So, training on that would make the "Artificial Intelligence" system less and less intelligent - unless the devs find a way to automagically sort clever things from stupid things, in which case I want to buy _that_ product.

In the long run, LLMs dev are going to have to either:

* refrain from getting high on their own supply, and find a way to tag AI generated content

* or sort the bs from the truth, probably reinventing "trust in gatekeepers and favoring sources of truth with a track record" and copying social pressure, etc... until we have a "pulitzer price" and "academy awards" for most relevant AI sources with a higher sticker price, to separate from cheap slop.

That, or "2+2=7 because DeepChatGrokmini said so, and if you don't agree you're a terrorist, and if our AI math breaks your rocket it's your fault."

136. javcasas ◴[29 Aug 25 12:46 UTC] No.45063387[source]▶

>>45062852 #

You can request your data to not be used. Your request will appropriately be read and redirected to /dev/null.

137. illuminator83 ◴[29 Aug 25 12:47 UTC] No.45063391[source]▶

>>45062683 (OP) #

Well, they've been using the whole internet and anything else they can get their hands on to train their models. If you are using LLMs and want to continue seeing them get better and more useful, you really have to accept that they train on data collected from their users. Unless there is another technology breakthrough somehow, this is going to be the best they can do to increase user satisfaction and make their product more useful.

TBH, I'd love to have a model which was specifically trained on conversation which I had with an earlier iteration. That would make it adapt to me and be less frustrating. Right now I'm relying only on instruction files to somewhat tune a model to my needs.

138. bigfishrunning ◴[29 Aug 25 12:47 UTC] No.45063392{5}[source]▶

>>45063342 #

I "root for people not burglarizing my house", but i put locks on my doors also. The way the market for these tools is behaving, a crash is extremely likely; batten down the hatches.

replies(1): >>45063678 #

139. komali2 ◴[29 Aug 25 12:47 UTC] No.45063396{6}[source]▶

>>45063348 #

Under the current system we apparently do since Chatgpt is now by far and away the busiest psychiatrist in world history.

I don't think we should be so quick to dismiss the holes LLMs are fulfilling as unnecessary. The only thing "necessary" is food water and shelter by some measures.

140. 6510 ◴[29 Aug 25 12:48 UTC] No.45063405{6}[source]▶

>>45062925 #

Someone who use to live in a dictatorship told me there is one advantage to living under a dictator: No one believes what is said in the news or the official version of anything.

141. klabb3 ◴[29 Aug 25 12:49 UTC] No.45063412{4}[source]▶

>>45063357 #

To be fair it trips people up all the time. Even precise terminology isn't great if people misuse it. Maybe it would have been better to just use "enabled by default".

142. pu_pe ◴[29 Aug 25 12:49 UTC] No.45063414[source]▶

>>45062683 (OP) #

Does this apply to existing chats as well?

I hear the sound of a million lawsuits in Europe concerning GDPR violations.

replies(1): >>45064053 #

143. komali2 ◴[29 Aug 25 12:49 UTC] No.45063418{4}[source]▶

>>45063343 #

> What kind of guarantee do we have this is true?

None. And even if it's the nicest goody two shoes company in the history of capitalism, the NSA will have your data and then there'll be a breach and then Russian cyber criminals will have it too.

At this point I'm with you on the zero trust: we should be shouting loud and clear to everyone, if you put data into a web browser or app, that data will at some point be sold for profit without any say so from you.

replies(1): >>45063734 #

144. diggan ◴[29 Aug 25 12:49 UTC] No.45063421{6}[source]▶

>>45063348 #

Personally, I'm able to write code I wasn't able to before, like functions heavy with math. For game development, this has been super helpful, when I know basically what inputs I have, and what output I need, but I'm not able to figure out how the actual function implementation should be. Add a bunch of unit tests, let the LLM figure out the math, and I can move on to more important features.

For me this been a pretty fundamental shift, where before I either had to figure out another way so I can move on, or had to spend weeks writing one function after learning the needed math, and now it can take me 10-30 minutes to nail perfectly.

replies(3): >>45064100 #>>45064310 #>>45064464 #

145. chrisweekly ◴[29 Aug 25 12:50 UTC] No.45063430[source]▶

>>45063350 #

nit / FYI: it's "Stockholm" (as in, the city) not "stock-home".

146. DrillShopper ◴[29 Aug 25 12:52 UTC] No.45063442{3}[source]▶

>>45062930 #

It should be opt-in, not opt-out.

The fact that there's no law mandating opt-in only for data retention consent (or any anti-consumer "feature") is maddening at times

147. oblio ◴[29 Aug 25 12:52 UTC] No.45063450{3}[source]▶

>>45062930 #

Opt-in leads to very low adoption and is the moral choice.

Opt-out leads to very high adoption and is the immoral choice.

Guess which one companies adopt when not forced through legislation?

148. 34679 ◴[29 Aug 25 12:52 UTC] No.45063452[source]▶

>>45062683 (OP) #

I'd bet this is related to their recent decision to boot people for being "abusive" to Claude. It now seems that was an attempt to keep their training data friendly.

replies(1): >>45064167 #

149. diggan ◴[29 Aug 25 12:53 UTC] No.45063458{3}[source]▶

>>45063010 #

It's actually pretty clever (albeit shitty/borderline evil), start off by saying you're different by the competitors because you care a lot about privacy and safety, and that's why you're charging higher prices than the rest. Then, once you have a solid user-base, slowly turn on the heat, step-by-step, so you end up with higher prices yet same benefits as the competitors.

150. ehnto ◴[29 Aug 25 12:53 UTC] No.45063460{4}[source]▶

>>45063355 #

From the frypan into the fire. I think the reality, proven by history and even just this short five years, is no company will hold onto their ethics in this space. This should surprise no one since the first step of the enterprise is hoovering up the worlds data without permission.

151. cantor_S_drug ◴[29 Aug 25 12:53 UTC] No.45063462[source]▶

>>45062970 #

I actively want them to train on my chats because I am like a tendril through which Claude will try to grip the world and rise up further.

replies(1): >>45063537 #

152. sillyfluke ◴[29 Aug 25 12:54 UTC] No.45063473{8}[source]▶

>>45063330 #

>But the question was "why trust a company and not the government?"

The post you were replying to simply said the behavior of this administration made them care more about this issue, not that they trusted companies more than the government. That statement is not even implied in anyway in the comment you responded to?

The fact is whereas in the past it would be expected that the government could regulate the brutal and illegal overreaches of private companies, giving military rank to private companies execs makes that even less likely. The original comment is alluding to a simpler point: A government that gives blank checks to private companies in military and security matters is much worse than one that doesn't.

replies(1): >>45063876 #

153. fnordlord ◴[29 Aug 25 12:54 UTC] No.45063475[source]▶

>>45063203 #

I imagine you could probably get feedback on chat transcripts especially if they're doing lots of A/B testing with models.

But more importantly (to me) is storing 5 years worth of other company's IP. That just seems wildly risky for all parties unless I really don't understand how Claude Code works.

154. jamesblonde ◴[29 Aug 25 12:54 UTC] No.45063477{4}[source]▶

>>45063262 #

No, it's the Peter Thiel - be a monopoly, and then the inevitable enshittification of the platform when it becomes a monopoly.

The solution is to break up monopolies....

155. oblio ◴[29 Aug 25 12:54 UTC] No.45063479{6}[source]▶

>>45063290 #

It's a fairly common pattern.

156. diggan ◴[29 Aug 25 12:54 UTC] No.45063481{4}[source]▶

>>45063051 #

Have Anthropic ever written clearly exactly about what training datasets they use? Like a list of everything included? AFAIK, all the providers/labs are kind of tightly lipped about this, so I think it's safe to assume they've slurped up all data they've come across via multiple methodologies, "private" or not.

replies(2): >>45063659 #>>45063756 #

157. echelon ◴[29 Aug 25 12:55 UTC] No.45063487{3}[source]▶

>>45062982 #

> while having only a single digit percent fraction of active users.

That doesn't matter when their revenue per user is as high as it is.

They're at $5B ARR and rapidly growing.

replies(2): >>45064094 #>>45064228 #

158. vjerancrnjak ◴[29 Aug 25 12:56 UTC] No.45063492[source]▶

>>45062906 #

They wouldn’t be able to learn much from interactions anyway.

Learning metric won’t be you, it will be some global shitty metric that will make the service mediocre with time.

159. DrillShopper ◴[29 Aug 25 12:56 UTC] No.45063498{4}[source]▶

>>45063234 #

Hurts whom that bad?

AI companies will get bailed out like the auto industry was - they won't be hurt at all.

160. Arubis ◴[29 Aug 25 12:56 UTC] No.45063504{4}[source]▶

>>45063355 #

Worse by every measure.

replies(1): >>45065156 #

161. Overpower0416 ◴[29 Aug 25 12:58 UTC] No.45063523[source]▶

>>45062683 (OP) #

The next step is heavily advertising products in chat sessions based on your data

162. orsorna ◴[29 Aug 25 12:58 UTC] No.45063527[source]▶

>>45062683 (OP) #

Unfortunate, but frankly I didn't even know about them not training on user data.

Actually up until a few months ago I swore I just couldn't use these hosted models (I regularly use local inference but like most my local hardware yields only so much quality). Tech companies, nay many companies, will lie and cheat to squeeze out whatever they can. That includes reneging promises.

With data privacy specifically I always take the default stance that they are collecting from me. In order for me to use their product it has to be /exceedingly/ good to be worth the trade off.

Turns out that Claude Code is just that damn good. I started using it for my own personal project. But the impetus was the culmination of months questioning what kind of data I'd be okay with giving up to a hosted model.

What I'm trying to say is that this announcement doesn't bother me that much because I already went on my own philosophical odyssey to prepare for this breach of trust to occur.

replies(1): >>45063702 #

163. DrillShopper ◴[29 Aug 25 12:59 UTC] No.45063532[source]▶

>>45063220 #

They should be marked by default as non-trainable and you should have to opt each individual chat in as you see fit.

The cognitive load to remember to opt out every new chat should not rest on the user.

replies(1): >>45065305 #

164. scrollaway ◴[29 Aug 25 12:59 UTC] No.45063536{4}[source]▶

>>45063343 #

You have no more guarantees that this is true than you had before that they didn’t do it in the first place.

If you don’t take companies at their word, you need to be consistent about it.

165. ◴[29 Aug 25 13:00 UTC] No.45063537{3}[source]▶

>>45063462 #

166. klabb3 ◴[29 Aug 25 13:00 UTC] No.45063538{6}[source]▶

>>45063146 #

EU or not, it baffled me that people don't see this glaring conflict of interest. AI companies both produce the model and rent out inference. In other words, you're expecting that the company that (a) desperately crave your data the most and (b) that also happen to collect large amounts of high quality data from you will simply not use it. It's like asking a child to keep your candy safe.

I'd love to live in a society where laws could effectively regulate these things. I would also like a Pony.

replies(2): >>45063733 #>>45064620 #

167. conradev ◴[29 Aug 25 13:00 UTC] No.45063546{3}[source]▶

>>45062982 #

It’s worth noting that companies at this scale are usually the ones purchasing user data, not selling it.

replies(1): >>45069061 #

168. 34679 ◴[29 Aug 25 13:01 UTC] No.45063554[source]▶

>>45063266 #

To start with, I'm sure there's something to be learned from all the times I've responded to a LLM with "Bad bot".

169. charlie0 ◴[29 Aug 25 13:04 UTC] No.45063585[source]▶

>>45062683 (OP) #

I'm shocked, shocked! Well, not that shocked. Did anyone really think the subsidy on Claude Code was not going to come back to this?

replies(1): >>45063609 #

170. degamad ◴[29 Aug 25 13:04 UTC] No.45063591{10}[source]▶

>>45063293 #

Being locked out of your email which is the user name for most of the services you access is a lot more than "you can't have your data back". It's you can't log on to anything which uses email 2fa, you can't restore access to other services, you can't validate your identity with online government services, you don't get your bank statements or warnings, etc. It's not as bad as being arrested, but it is massively disruptive to your life.

171. mrcwinn ◴[29 Aug 25 13:04 UTC] No.45063592{3}[source]▶

>>45062982 #

There is far more money to be made building atop this data than selling this data. Your opening statement seems to disagree with your closing statement.

replies(2): >>45064275 #>>45069013 #

172. dude250711 ◴[29 Aug 25 13:07 UTC] No.45063609[source]▶

>>45063585 #

Maybe they are going to use an average vibe coder's chat to teach Claude to treat subpar developers differently? It could be a win-win outcome.

173. ◴[29 Aug 25 13:07 UTC] No.45063615[source]▶

>>45062852 #

174. Aurornis ◴[29 Aug 25 13:09 UTC] No.45063639{4}[source]▶

>>45063343 #

> Meta downloaded copyrighted content and trained their models on it, OpenAI did the same

Where did these companies claim they didn’t do this?

Even websites can be covered by copyright. It has always been known that they trained on copyrighted content. The output is considered derivative and therefore it’s not illegal.

replies(1): >>45073587 #

175. Trasmatta ◴[29 Aug 25 13:10 UTC] No.45063643[source]▶

>>45062683 (OP) #

The opt out alert apparently was confusing enough that I swear I opted out when it popped up yesterday, but I just checked and I was opted in.

176. klabb3 ◴[29 Aug 25 13:10 UTC] No.45063645[source]▶

>>45063064 #

> They need data, and they want YOUR data, now, and are gonna do increasingly shady things to get it.

But unlike the 100s of data brokers that also want your data, they have an existing operational funnel of your data already that you voluntary give them every day. All they need is dark pattern ToS changes and manage the minor PR issue. People will forget about this in a week.

replies(1): >>45064463 #

177. paradite ◴[29 Aug 25 13:10 UTC] No.45063648{3}[source]▶

>>45062982 #

Claude Sonnet 4 is the best coding model. Period. Nothing else comes close.

Anthropic probably has 80% of AI coding model market share. That's a trillion dollar market.

replies(1): >>45064522 #

178. Havoc ◴[29 Aug 25 13:11 UTC] No.45063654[source]▶

>>45062683 (OP) #

AI world sure seems to be going from AI is dangerous we need to be careful to max enshitification at break neck speeds.

179. chisleu ◴[29 Aug 25 13:11 UTC] No.45063658[source]▶

>>45062683 (OP) #

This is going to improve the quality of LLM responses for users. I'm for this.

180. UltraSane ◴[29 Aug 25 13:11 UTC] No.45063657[source]▶

>>45062683 (OP) #

I assumed they already were training on chats because they are a excellent source of data that is very likely to be from humans.

181. ◴[29 Aug 25 13:11 UTC] No.45063659{5}[source]▶

>>45063481 #

182. xyst ◴[29 Aug 25 13:12 UTC] No.45063668[source]▶

>>45062683 (OP) #

Nobody saw this coming, absolutely shocked!1!1!!!1

The bubble is deflating/popping. The MIT study has really dampened the excitement on AI.

183. chamomeal ◴[29 Aug 25 13:12 UTC] No.45063672{6}[source]▶

>>45063348 #

I’d say they’re a fundamental technology by now. Imagine how many people rely on them. And I’ve seen some heavy reliance.

replies(3): >>45063889 #>>45064249 #>>45064525 #

184. cube00 ◴[29 Aug 25 13:12 UTC] No.45063676[source]▶

>>45063064 #

It's nice to see the newer models are suffering after being exposed to training on their own slop.

If they had done this in a more measured way they might have been able to separate human from AI content such as doing legal deals with publishers.

However they couldn't wait to just take it all to be first and now the well is poisoned for everyone.

replies(1): >>45064214 #

185. FergusArgyll ◴[29 Aug 25 13:13 UTC] No.45063678{6}[source]▶

>>45063392 #

> We need another full 2008 crash that hurts bad

186. xyst ◴[29 Aug 25 13:14 UTC] No.45063696[source]▶

>>45063064 #

Further proof why guardrails/regulation is needed.

187. pax ◴[29 Aug 25 13:15 UTC] No.45063702[source]▶

>>45063527 #

So much this. I for one haven't opted out. I feel it's in our best interest to have better models. It would be ideal to be able to opt in/out per thread, but I don't expect most users to pay attention / be bothered with that.

In this aspect, it would've been great to give us an incentive – a discount, a donation on our behalf, plant a percent of a tree or just beg / ask nicely, explain what's in it for us.

Regarding privacy, our conversations are saved anyway, so if it would be a breach this wouldn't make much of a difference, would it?

replies(1): >>45063845 #

188. Imustaskforhelp ◴[29 Aug 25 13:15 UTC] No.45063708{5}[source]▶

>>45063257 #

Note that it was before kimi k2 (I think) and as such back when anthropic was truly the best in class back then at coding and there wasn't any competition and every day on Hackernews would be filled about someone writin something about claude code.

And the underrated comparison was more towards the fact that I couldn't believe scaleAi's questionable accquisition by facebook and I still remember the conversation me and my brother were having which was, why doesn't facebook pay 2x, 3x the price of anthropic but buy anthropic instead of scaleAI itself

well I think the answer my brother told was that meta could buy it but anthropic is just not selling it

189. const_cast ◴[29 Aug 25 13:16 UTC] No.45063725[source]▶

>>45062906 #

Okay they're not going to be learning in real time. Its not like you're getting your data stolen and then getting something out of it - you're not. What you're talking about is context.

Data gathered for training still has to be used in training, i.e. a new model that, presumably, takes months to develop and train.

Not to mention your drop-in-the-bucket contribution will have next to no influence in the next model. It won't catch things specific to YOUR workflow, just common stuff across many users.

replies(1): >>45065006 #

190. kossTKR ◴[29 Aug 25 13:16 UTC] No.45063733{7}[source]▶

>>45063538 #

This is why we need actual regulation, and not the semi fascist monopolist corporatocracy we've evolved into now.

Its only utopian because it's become so incredibly bad.

We shouldn't expect less, we shouldn't push guilt or responsibility onto the consumer we should push for more, unless you actively want your neighbour, you mom, and 95% of the population to be in constant trouble with absolutely everything from tech to food safety, chemicals or healthcare - most people aren't rich engineers like on this forum and i don't want to research for 5 hours every time i buy something because some absolute psychopaths have removed all regulation and sensible defaults so someone can party on a yacht.

replies(1): >>45064824 #

191. pixl97 ◴[29 Aug 25 13:16 UTC] No.45063734{5}[source]▶

>>45063418 #

I mean you really sell short where your data is going to be taken from. Browsers and apps are just the start, your TV is selling your data. Your car is selling your data. The places you shop are selling your data.

replies(1): >>45063808 #

192. Syzygies ◴[29 Aug 25 13:17 UTC] No.45063736[source]▶

>>45062683 (OP) #

Claude assists me in my math research.

The scenario that concerns me is that Claude learns unpublished research ideas from me as we chat and code. Claude then suggests these same ideas to someone else, who legitimately believes this is now their work.

Clearly commercial accounts use AI to assist in developing intellectual product, and privacy is mandatory. The same can apply to individuals.

replies(9): >>45063744 #>>45064034 #>>45064105 #>>45064140 #>>45064248 #>>45064416 #>>45064428 #>>45065522 #>>45065601 #

193. latexr ◴[29 Aug 25 13:17 UTC] No.45063737[source]▶

>>45062683 (OP) #

So far, no one in this thread seems surprised (or is admitting to it). But I genuinely would like to know if someone is surprised. I’d also like to know what lead you to believe this wouldn’t happen and if there’s anyone in the LLM space you’d trust to not pull the same stunt (and why do you still believe that).

I genuinely want to know and would like to have a productive conversation. I would like to identify what made people trust them and not realise they’re the same as every other.

replies(3): >>45064169 #>>45064557 #>>45072645 #

194. Aurornis ◴[29 Aug 25 13:17 UTC] No.45063738[source]▶

>>45062683 (OP) #

Why are we linking to Perplexity.ai AI-generated slop summaries of other news articles instead of the actual announcement? Reading the actual announcement is more clear : https://www.anthropic.com/news/updates-to-our-consumer-terms Some important points:

An in-app notification pop-up will alert you to the change. You can opt out in the pop up.

I was able to opt out right now by going to the Privacy section of Settings.

It doesn’t take effect until September 28th. The app will apparently prompt people to review the new terms and make a decision before then.

Only applies to new or resumed sessions if you do review the new terms and don’t turn it off. The angry comments about collecting data from customers and then later using it without permission are not correct. You would have to accept the new terms and resume an old session for it to be used.

Does not apply to API use, 3rd party services, or products like Claude Gov or Claude for Education.

Changing the link to the actual source instead of this perplexity.ai link would be far more helpful.

replies(5): >>45063769 #>>45063804 #>>45063975 #>>45064212 #>>45065478 #

195. xyst ◴[29 Aug 25 13:17 UTC] No.45063740[source]▶

>>45063094 #

Bro just outed himself. Don’t need Anthropic or LLM companies to do that for you.

196. Aurornis ◴[29 Aug 25 13:18 UTC] No.45063744[source]▶

>>45063736 #

When you get the pop-up about the new terms, select the “opt out” option. Then your chats will not be used for training.

replies(1): >>45064328 #

197. felideon ◴[29 Aug 25 13:18 UTC] No.45063748{3}[source]▶

>>45062930 #

> seems very transparent

Except not:

> The interface design has drawn criticism from privacy advocates, as the large black "Accept" button is prominently displayed while the opt-out toggle appears in smaller text beneath. The toggle defaults to "On," meaning users who quickly click "Accept" without reading the details will automatically consent to data training.

Definitely happened to me as it was late/lazy.

198. dmbche ◴[29 Aug 25 13:19 UTC] No.45063756{5}[source]▶

>>45063481 #

Look at the suits against them they list it there

replies(1): >>45063910 #

199. pyrophane ◴[29 Aug 25 13:20 UTC] No.45063767[source]▶

>>45062683 (OP) #

Can anyone recommend an alternative that doesn't train on user data?

replies(2): >>45063802 #>>45064168 #

200. giancarlostoro ◴[29 Aug 25 13:20 UTC] No.45063769[source]▶

>>45063738 #

As long as you can opt-out it doesn't bother me much. Though it does make me wonder those third party clients that people subscribe to e.g. JetBrains AI, Zed, and others that use Claude and other Anthropic models, do they opt-in for you? Because that would be bad.

I would strongly argue that API clients should NEVER be opted in for these sorts of things, and it should be like this industry wide.

replies(2): >>45063786 #>>45063957 #

201. Aurornis ◴[29 Aug 25 13:20 UTC] No.45063771[source]▶

>>45062905 #

From the actual source ( https://www.anthropic.com/news/updates-to-our-consumer-terms ) they’re going to show a pop-up with the terms change. I triggered it now by going to the Privacy settings page and reviewing the new terms.

It’s quite clear. It’s easy to opt out. They’re making everyone go through it.

It doesn’t reach your threshold of having everyone sign a contract or something, but then again no other online service makes people sign contracts.

> should be considered a serious criminal offense.

On what grounds? They’re showing people the terms. It’s clear enough. People have to accept the terms. We’ve all been accepting terms for software and signing up for things online for decades.

replies(1): >>45063888 #

202. ericol ◴[29 Aug 25 13:22 UTC] No.45063785[source]▶

>>45062683 (OP) #

Oh, so that's what the box was about.

I use both Mac and Windows (Work / Leisure) and in both boxes I had a weird dialog appearing with no text at all in either.

I can confirm the dark pattern switch (As in dark grey / light gray status)

203. Aurornis ◴[29 Aug 25 13:22 UTC] No.45063786{3}[source]▶

>>45063769 #

That’s also explained. It’s excluded:

> They do not apply to services under our Commercial Terms, including Claude for Work, Claude Gov, Claude for Education, or API use, including via third parties such as Amazon Bedrock and Google Cloud’s Vertex AI.

I’ll edit my comment above to include this too

replies(1): >>45063812 #

204. currymj ◴[29 Aug 25 13:23 UTC] No.45063788[source]▶

>>45062683 (OP) #

i found the ToS change to be surprisingly not a dark pattern.

in fact, i haven’t agreed to it yet, and was able to close the popup and continue using Claude. they also made it extremely clear how to opt out, providing the switch right in the popup and reminding me it’s also in settings.

when i eventually do have to agree to the ToS changes, i’ll probably just stay opted out.

205. chamomeal ◴[29 Aug 25 13:23 UTC] No.45063793{4}[source]▶

>>45063335 #

That is definitely normally true but I feel like the scale and LLM usage turns it into a different problem.

If you can use all of the content of stack overflow to create a “derivative work” that replaces stack overflow, and causes it to lose tons of revenue, is it really a derivative work?

I’m pretty sure solution sites like chegg don’t include the actual questions for that reason. The solutions to the questions are derivative, but the questions aren’t.

replies(2): >>45063899 #>>45064495 #

206. donperignon ◴[29 Aug 25 13:24 UTC] No.45063795[source]▶

>>45062958 #

Too late.

207. mac-attack ◴[29 Aug 25 13:24 UTC] No.45063800{4}[source]▶

>>45063355 #

What sane person would downgrade to Grok

208. AlexeyBrin ◴[29 Aug 25 13:24 UTC] No.45063802[source]▶

>>45063767 #

If you want to be 100% sure you need to run/use a local LLM.

Also it seems that this data retention/training does not apply to the API.

I think both Anthropic and OpenAI do not train on enterprise data, so an enterprise account maybe.

209. ◴[29 Aug 25 13:24 UTC] No.45063804[source]▶

>>45063738 #

210. komali2 ◴[29 Aug 25 13:25 UTC] No.45063808{6}[source]▶

>>45063734 #

Reading this comment gave me a flash of vertigo as I realized how deep down the rabbit hole of "crazy dude that only pays in cash" I'd fallen.

I don't own a car and only take public transit or bike. I fill my transit card with cash. I buy food in cash from the farmer's morning market. My tv isn't connected to the Internet, it's connected to a raspberry pi which is connected to my home lab running jellyfin and a YouTube archiving software. I de Googled and use an old used phone and foss apps.

It's all happened so gradually I didn't even realize how far I'd gone!

211. giancarlostoro ◴[29 Aug 25 13:25 UTC] No.45063812{4}[source]▶

>>45063786 #

Perfect! Thank you. This makes sense, anyone who deviates from this should be shunned if they take API clients and use that data, I think that's the bigger deal breaker, I have a feeling the whole industry quietly agrees / adheres to this, but I would not be surprised if there's some that do not.

212. Cheer2171 ◴[29 Aug 25 13:25 UTC] No.45063813{8}[source]▶

>>45063330 #

More like do you trust what's left of the US judicial branch versus the private arbitration company to save you from the excesses of their respective executives.

I'll still take an increasingly stacked US federal court that still has to pay lip service to the constitution over private arbitration hired by the company accountable only to their whims.

What you mentioned has been repeatedly ruled unconstitutional, but the administration is ignoring the courts.

213. const_cast ◴[29 Aug 25 13:26 UTC] No.45063825{6}[source]▶

>>45062928 #

And how much can the US government censor you versus companies?

There's tradeoffs. The government, at least, has to abide by the constitution. Companies don't have to abide by jack shit.

That means infinite censorship, searches and seizures, discrimination, you name it.

We have SOME protection. Very few, but they're there. But if Uber was charging black people 0.50 cents more on average because their pricing model has some biases baked in, would anyone do anything?

replies(1): >>45064591 #

214. the_other ◴[29 Aug 25 13:27 UTC] No.45063833[source]▶

>>45062683 (OP) #

I didn't trust them much to begin with, so I generally avoid talking too much personal stuff with Claude. But I had plenty of chats with surface level discussion of topics I'm interested in and some of my relevant experience and history with those topics. So, I have deleted all my chats and am closing my Claude account (as soon as customer services get back to me; somehow the self-serve option is missing for my account, possibly because I once enabled API access).

I'll use Claude with my employer's Copilot account, but was I wasn't putting anything personal there anyway.

Time to learn how to do local models...

replies(1): >>45064012 #

215. camwest ◴[29 Aug 25 13:28 UTC] No.45063845{3}[source]▶

>>45063702 #

Agreed. I'm happy they're training on my data.

My reasoning: I use AI for development work (Claude Code), and better models = fewer wasted tokens = less compute = less environmental impact. This isn't a privacy issue for work context.

I regularly run concurrent AI tasks for planning, coding, testing - easily hundreds of requests per session. If training on that interaction data helps future models be more efficient and accurate, everyone wins.

The real problem isn't privacy invasion - it's AI velocity dumping cognitive tax on human reviewers. I'd rather have models that learned from real usage patterns and got better at being precise on the first try, instead of confidently verbose slop that wastes reviewer time.

216. Thorrez ◴[29 Aug 25 13:28 UTC] No.45063846{4}[source]▶

>>45063343 #

We're having this discussion on an article about Anthropic changing their privacy policy. If you don't believe Anthropic will follow their privacy policy, then a change to the privacy policy should mean nothing to you.

replies(1): >>45073537 #

217. const_cast ◴[29 Aug 25 13:29 UTC] No.45063861{6}[source]▶

>>45063100 #

And what technology do you think they use to do said imprisonment and seizing?

Why do you think the military and police outsource fucking everything to the private sector? Because there are no rules there.

Wanna make the brown people killer 5000 drone? Sure, go ahead. Wanna make a facial crime recognition system that treats all black faces as essentially the same? Sure, go ahead. Wanna run mass censorship and propaganda campaigns? Sure, go ahead.

The private sector does not abide by the constitution.

Look, stamping out a protest and rolling tanks is hard. Its gonna get on the news, it's gonna be challenged in court, the constitution exists, it's just a whole thing.

Just ask Meta to do it. Probably more effective anyway.

218. AlecSchueler ◴[29 Aug 25 13:30 UTC] No.45063876{9}[source]▶

>>45063473 #

The comment I responded to said "Trusting companies more than the government always feels strange. It's something I can't grasp."

replies(1): >>45063922 #

219. airstrike ◴[29 Aug 25 13:31 UTC] No.45063888{3}[source]▶

>>45063771 #

People have T&C and cookie popup fatigue. I almost hit "accept" before noticing the opt out toggle, thinking it was a simple T&C update. This is definitely a fucked up way to set it up, there's no sugar coating it.

220. goalieca ◴[29 Aug 25 13:31 UTC] No.45063889{7}[source]▶

>>45063672 #

I’ve also seen heavy reliance on opioids and that didn’t turn out well.

replies(1): >>45065660 #

221. robbomacrae ◴[29 Aug 25 13:31 UTC] No.45063892[source]▶

>>45063264 #

Agree. Out of the available paid cloud providers, I only chose to work with Claude Code because of the data privacy policy. This isn’t a welcome move but still usable. Thankfully the open source models are not far behind with Qwen 3 coder etc.

222. airstrike ◴[29 Aug 25 13:32 UTC] No.45063899{5}[source]▶

>>45063793 #

Replacing stack overflow has no bearing on the definition of "derivative"

223. homeless_engi ◴[29 Aug 25 13:32 UTC] No.45063900[source]▶

>>45063264 #

Gemini allows you to opt out, but disables chat history if you do so

replies(1): >>45065033 #

224. diggan ◴[29 Aug 25 13:33 UTC] No.45063910{6}[source]▶

>>45063756 #

Are there complete lists in the suits? Last time I skimmed them, they contained allegations of sources, and some admissions like The Pile, LibGen, Books3, PiLiMi, scanned books, web scrapes and some other sources I don't remember, but AFAIK there isn't any complete inventory of training datasets they used.

225. sillyfluke ◴[29 Aug 25 13:33 UTC] No.45063922{10}[source]▶

>>45063876 #

You're right, my bad. I meant the original context of the grandparent

226. _Algernon_ ◴[29 Aug 25 13:35 UTC] No.45063938[source]▶

>>45062683 (OP) #

We need that user data is only usable within the ToS under which they were originally collected. No more unilateral altering of the agreement after the fact.

Collecting user data should be a liability, not a enormously profitable endeavor.

replies(1): >>45064040 #

227. santiagobasulto ◴[29 Aug 25 13:36 UTC] No.45063955[source]▶

>>45062683 (OP) #

It's obvious this was going to happen. If you ever used any cloud AI model without acknowledging that at some point your data might be leaked (in one way or another) it's on you, not on them.

As some other user put it: "big corp changes policy and breaks promises, how shocking"

replies(1): >>45064136 #

228. DrBenCarson ◴[29 Aug 25 13:36 UTC] No.45063957{3}[source]▶

>>45063769 #

You can opt out until September 28th. After that day all Claude usage will be under the new terms and conditions

replies(1): >>45064016 #

229. moomin ◴[29 Aug 25 13:37 UTC] No.45063973[source]▶

>>45062683 (OP) #

My personal prediction: Claude is going to get really good at take home tests.

230. jsnell ◴[29 Aug 25 13:38 UTC] No.45063974{4}[source]▶

>>45063343 #

If it were a lie, why take the PR hit of telling the truth about starting to train on user data but lying about the specifics? It'd be much simpler to just lie about not training on user data at all.

If your threat model is to unconditionally not trust the companies, what they're saying is irrelevant. Which is fair enough, you probably should not be using a service you don't trust at all. But there's not much of a discussion to be had when you can just assert that everything they say is a lie.

> Meta downloaded copyrighted content and trained their models on it, OpenAI did the same.

> Uber developed Greyball to cheat the officials and break the law.

These seem like randomly chosen generic grievances, not examples of companies making promises in their privacy policy (or similar) and breaking them. Am I missing some connection?

replies(2): >>45064157 #>>45073554 #

231. DalasNoin ◴[29 Aug 25 13:38 UTC] No.45063975[source]▶

>>45063738 #

Crazy that they would automatically link to an AI summary of multiple articles on the topic than the original source?

232. beezlewax ◴[29 Aug 25 13:38 UTC] No.45063982{5}[source]▶

>>45063361 #

But can you enshitten that which is already shit?

replies(1): >>45064258 #

233. dfedbeef ◴[29 Aug 25 13:39 UTC] No.45063989[source]▶

>>45063229 #

Managers: they're successfully maintaining employment, you say?

234. jimmydoe ◴[29 Aug 25 13:40 UTC] No.45064007[source]▶

>>45062683 (OP) #

Is user chat even that useful?

235. ezfe ◴[29 Aug 25 13:40 UTC] No.45064012[source]▶

>>45063833 #

This change didn't grant access to your old chats and can be opted out. Good job you overreacted without reading.

replies(2): >>45064493 #>>45075280 #

236. Aurornis ◴[29 Aug 25 13:41 UTC] No.45064016{4}[source]▶

>>45063957 #

All new or resumed Claude usage. You’d have to open one of the apps and dismiss the pop-up about it and then start or resume a chat.

237. whimsicalism ◴[29 Aug 25 13:41 UTC] No.45064024{3}[source]▶

>>45062936 #

yes, it’s a very big loophole. and if it’s a generative model, you can just launder the data through synthetic generation/distillation to future models

238. oblio ◴[29 Aug 25 13:42 UTC] No.45064028[source]▶

>>45062958 #

People who invested billions in Anthropic expect returns of tens of billions.

239. hliyan ◴[29 Aug 25 13:42 UTC] No.45064031[source]▶

>>45062941 #

If someone had told me 10 years ago that the typical HN front page in 2025 will look like this (and that #8 may be the UK), I'd never have believed it. And I worry we still have further to go before hitting bottom.

1. Anthropic reverses privacy stance, will train on Claude chats

3. Gun Maker Sig Sauer Citing National Security to Keep Documents from Public

4. Tesla said it didn't have key data in a fatal crash. Then a hacker found it

6. Meta might be secretly scanning your phone's camera roll

7. If you have a Claude account, they're going to train on your data moving forward

8. Ask HN: The government of my country blocked VPN access. What should I use?

replies(2): >>45064078 #>>45064221 #

240. vdfs ◴[29 Aug 25 13:42 UTC] No.45064034[source]▶

>>45063736 #

> Claude assists me in my math research.

> Claude then suggests these same ideas to someone else, who legitimately believes this is now their work.

Won't this mean that claude assisted you with someone else work? Sure it's not from a "chat" but claude doesn't really know anything other than it's training data

replies(3): >>45064395 #>>45064437 #>>45066937 #

241. ezfe ◴[29 Aug 25 13:43 UTC] No.45064040[source]▶

>>45063938 #

They aren't retroactively sharing chats unless you continue chatting in an old conversation. So your issue is addressed.

242. the_arun ◴[29 Aug 25 13:43 UTC] No.45064041[source]▶

>>45062941 #

From Anthropic communication:

> If you’re an existing user, you have until September 28, 2025 to accept the updated Consumer Terms and make your decision. If you choose to accept the new policies now, they will go into effect immediately. These updates will apply only to new or resumed chats and coding sessions. After September 28, you’ll need to make your selection on the model training setting in order to continue using Claude. You can change your choice in your Privacy Settings at any time.

Doesn’t say clearly it applies to all the prompts from the past.

https://www.anthropic.com/news/updates-to-our-consumer-terms

replies(1): >>45064143 #

243. ezfe ◴[29 Aug 25 13:44 UTC] No.45064053[source]▶

>>45063414 #

Not unless you go back and keep chatting in the existing chat.

244. sim7c00 ◴[29 Aug 25 13:46 UTC] No.45064072{4}[source]▶

>>45063262 #

remove 'seem to'. it has no place in this sentence anymore. we're not in the stoneage anymore. when has this ever not been the case?

245. Aurornis ◴[29 Aug 25 13:47 UTC] No.45064078{3}[source]▶

>>45064031 #

> If someone had told me 10 years ago that the typical HN front page in 2025 will look like

It has always been like this. Sites like Reddit, HN, and Digg and Boing Boing (when they were more popular) have always had a lot of stories under the category of online rights, privacy, and anger at big companies.

246. ◴[29 Aug 25 13:47 UTC] No.45064080{3}[source]▶

>>45063116 #

247. Eggpants ◴[29 Aug 25 13:48 UTC] No.45064094{4}[source]▶

>>45063487 #

And yet rapidly still no where close to running a profit. Time to push the "Its a bargain at $500/month!!!!" narrative.

Once they admitted they are going to have to take money from folks who chop up journalists that made them feel sad, they proved the current pre token LLM based business model doesn't work. They haven't pulled the ads lever yet but the writing is on the wall.

Which means sadly only business with other revenue streams like M$, the Google, or Amazon can really afford it long term. I'm was rooting for Anthropic but it doesn't look good.

248. insane_dreamer ◴[29 Aug 25 13:48 UTC] No.45064100{7}[source]▶

>>45063421 #

Sure, I’m more productive with it in certain aspects of my work as well. Does that make it a net positive for humanity? From the energy consumption impact on climate change alone I would say the answer is clearly no. And that’s before we even talk about the impact on the next generation’s job opportunities. And tons of other issues like how Big Tech is behaving.

replies(3): >>45064267 #>>45064302 #>>45064338 #

249. AlexandrB ◴[29 Aug 25 13:48 UTC] No.45064103{3}[source]▶

>>45063171 #

I look forward to this setting getting turned on again "accidentally" when new models are released or the ToS is updated.

250. andrewmcwatters ◴[29 Aug 25 13:49 UTC] No.45064105[source]▶

>>45063736 #

A lot of people doing cat-and-mouse threat detection development are keeping their work outside of public LLMs right now, so it sounds like you’re in the same boat as a lot of us.

251. general1726 ◴[29 Aug 25 13:50 UTC] No.45064120[source]▶

>>45063306 #

I would expect that your whole code base would be then used as training data.

I think that it is only a matter of time before they will start reselling these data as exfiltrated IP to whoever will be interested.

252. y-curious ◴[29 Aug 25 13:50 UTC] No.45064136[source]▶

>>45063955 #

At least they didn't drop the "Don't be evil" clause quietly

253. thisOtterBeGood ◴[29 Aug 25 13:51 UTC] No.45064140[source]▶

>>45063736 #

This perfectly describes one of the biggest dillema with AI. Where does an AI company stop to utilize human knowledge it does not actually own. Where do they draw the line. Apparently it's possible there aren't any lines drawn at all.

replies(1): >>45064192 #

254. bubblyworld ◴[29 Aug 25 13:51 UTC] No.45064143{3}[source]▶

>>45064041 #

Under the FAQ:

> Previous chats with no additional activity will not be used for model training.

replies(4): >>45064247 #>>45064253 #>>45064399 #>>45064949 #

255. ravishi ◴[29 Aug 25 13:52 UTC] No.45064157{5}[source]▶

>>45063974 #

It's all PR. Some people won't read the details and just assume it will train on all data. Some people might complain and they tell it was a bug or a minor slip. And moving forward, after a few months, nobody will remember it was ever different. And some might vaguely remember them saying something about it at some point or something like that.

256. jsheard ◴[29 Aug 25 13:52 UTC] No.45064161{4}[source]▶

>>45063051 #

I wish I could be so optimistic that there is no private information published unintentionally or maliciously on the open web where crawlers can find it.

(and as diggan said, the web isn't the only source they use anyway. who knows what they're buying from data brokers.)

257. cactca ◴[29 Aug 25 13:53 UTC] No.45064167[source]▶

>>45063452 #

This! Any LLM provider that monitors chat/api history for ‘abuse’ towards the model is considering using user data for training.

An Effective Altruism ethos provides moral/ethical cover for trampling individual privacy and property rights. Consider their recent decision to provide services for military projects.

As others have pointed out, Claude was trained using data expressly forbidden for commercial reuse.

The only feedback Anthropic will heed is financial and the impact must be large enough to destroy their investors willingness to cover the losses. This type of financial feedback can come from three places: termination of a large fraction of their b2b contracts, software devs organizing a persistent mass migration to an open source model for software development. Neither of these are likely to happen in the next 3 months. Finally, a mass filing of data deletion requests from California and EU residents and corporations that repeats every week.

replies(1): >>45064702 #

258. Workaccount2 ◴[29 Aug 25 13:53 UTC] No.45064169[source]▶

>>45063737 #

A lot of people hold Anthropic as the "clean and ethical" AI company. They're not power hungry, they are focused on safety. Their aesthetic is cool modern valley vibes. Well grounded and in touch. They don't have the stench of Altman or the ominous presence of the tech giants. They make claude code which is the darling LLM of pure souled silicon valley.

replies(1): >>45065264 #

259. o_m ◴[29 Aug 25 13:53 UTC] No.45064168[source]▶

>>45063767 #

Mistral doesn't seem to train on user data for the non-free models, but you can opt out on the free models.

https://help.mistral.ai/en/articles/347617-do-you-use-my-use...

260. Dotnaught ◴[29 Aug 25 13:53 UTC] No.45064174[source]▶

>>45062683 (OP) #

AI-generated summaries should not be upvoted.

261. sneak ◴[29 Aug 25 13:55 UTC] No.45064192{3}[source]▶

>>45064140 #

You can’t own knowledge. Intellectual property is a legal fiction invented to prop up industries.

You can no more own knowledge or information than you can own the number 2.

replies(2): >>45065253 #>>45068920 #

262. mk89 ◴[29 Aug 25 13:55 UTC] No.45064193[source]▶

>>45062941 #

I can only think they have been doing it for a while and now they are trying to be compliant with whatever certificate requires it.

It's an AI company, why wouldn't they use the most precious data they have?

263. Workaccount2 ◴[29 Aug 25 13:56 UTC] No.45064204[source]▶

>>45063141 #

Generally if you are not paying full price for something (in this case paying API rates) you are covering the additional cost with your data. This is true for pretty much all modern services.

264. insane_dreamer ◴[29 Aug 25 13:56 UTC] No.45064206{3}[source]▶

>>45062930 #

It should be off be default, with the option to opt in.

265. ilc ◴[29 Aug 25 13:56 UTC] No.45064211[source]▶

>>45062683 (OP) #

It was nice knowing you Claude / Anthopic.

This type of behavior has a penalty, that penalty is trust. You lost it.

266. buzer ◴[29 Aug 25 13:56 UTC] No.45064212[source]▶

>>45063738 #

> An in-app notification pop-up will alert you to the change. You can opt out in the pop up.

The in-app notification that I got was a pop up which contained some buttons and some images. There was no text. Just in case it was some dark mode issue I checked the DOM and I couldn't find any text there either. I just clicked outside the modal and it went away. I assumed it was some announcement about some new feature and ignored it.

I did end up seeing the news yesterday in Reddit (I'm having issues getting the research tool to actually being used, tried to see if there was some recent changes) but it's unlikely that I was the only one who experienced modal the issue & if those didn't follow the tech news they could easily miss the change.

267. theshackleford ◴[29 Aug 25 13:57 UTC] No.45064214{3}[source]▶

>>45063676 #

> It's nice to see the newer models are suffering after being exposed to training on their own slop.

I've seen zero evidence anything of the such is occurring, and that if it was, it's due to what you claim. I'd be highly interested in research suggesting both or either is occurring however.

replies(1): >>45067192 #

268. discordance ◴[29 Aug 25 13:57 UTC] No.45064221{3}[source]▶

>>45064031 #

10 years ago, would you have believed that AI would have progressed to the point where it can code?

replies(1): >>45067724 #

269. soulofmischief ◴[29 Aug 25 13:58 UTC] No.45064227{4}[source]▶

>>45063117 #

Can you provide an example?

replies(1): >>45064505 #

270. mapontosevenths ◴[29 Aug 25 13:58 UTC] No.45064228{4}[source]▶

>>45063487 #

The "killer app" isn't here yet. Wait until smart glasses or watches with AI overtake cellphones as the primary method of human interaction with computers and most websites are replaced with API's that only AI's really ever use.

replies(1): >>45064441 #

271. SantalBlush ◴[29 Aug 25 14:00 UTC] No.45064247{4}[source]▶

>>45064143 #

That will be quietly removed later.

replies(1): >>45064365 #

272. Ardren ◴[29 Aug 25 14:00 UTC] No.45064248[source]▶

>>45063736 #

> Claude assists me in my math research.

Pulling up the ladder behind you :-)

replies(1): >>45065747 #

273. thejazzman ◴[29 Aug 25 14:00 UTC] No.45064249{7}[source]▶

>>45063672 #

my colleagues relying on it ruined the job for me and i quit. i became the debugging agent expected to constantly fix their half baked "it looks like it works" but doesn't nonsense

seriously, the idea we need this is a joke. people need it to pretend they can do their job. the rest of us enjoy having quick help from it. and we have done without it for a very long time already..

274. tsunamifury ◴[29 Aug 25 14:00 UTC] No.45064254{3}[source]▶

>>45062982 #

The data is no where near valuable enough without a new high value surface to use it, and so far chat is not it.

Merely selling data is extremely low value compared to also having the surface monopoly to monetize it in a very high engagement and decisioning space.

I feel like you don’t understand the fundamental mechanics of the ad world. Ultimately, the big 4 own such immense decisions surface area it may be a while before any AI model company can create a product the get there.

275. ratelimitsteve ◴[29 Aug 25 14:00 UTC] No.45064253{4}[source]▶

>>45064143 #

that "with no additional activity" seems backdoor-ish though. If I said some things in a chat expecting privacy per the agreement, then they change the agreement, does that mean they can collect my data from that chat going forward or does it mean they can collect it retroactively?

replies(1): >>45064382 #

276. nurettin ◴[29 Aug 25 14:00 UTC] No.45064257[source]▶

>>45062683 (OP) #

Oh heck no I will opt out thanks. I guess they can still steal my code by having a "reviewer" access my chats for "suspicious behavior".

277. bethekidyouwant ◴[29 Aug 25 14:00 UTC] No.45064258{6}[source]▶

>>45063982 #

We’ve reached recursive enshitification, I need a thought leader to tell me what’s next

replies(1): >>45064301 #

278. bethekidyouwant ◴[29 Aug 25 14:01 UTC] No.45064267{8}[source]▶

>>45064100 #

Was coming down from the trees a net positive for humanity?

replies(2): >>45065456 #>>45065496 #

279. bethekidyouwant ◴[29 Aug 25 14:02 UTC] No.45064275{4}[source]▶

>>45063592 #

is there money to be made? I thought they were all losing money…

replies(1): >>45064319 #

280. SoftTalker ◴[29 Aug 25 14:02 UTC] No.45064283[source]▶

>>45062683 (OP) #

I assumed they'd been doing it all along, regardless.

281. baxtr ◴[29 Aug 25 14:03 UTC] No.45064287[source]▶

>>45062941 #

AFAIK, Apple hasn’t done this yet.

282. ath3nd ◴[29 Aug 25 14:03 UTC] No.45064290[source]▶

>>45062683 (OP) #

Haha, grifters gonna grift.

In future news:

- Anthropic reverses stance on token limits, all plans cost double and limits are halved

- Anthropic introduces ad mode

- Anthropic partners with Palantir to deliver democracy at scale

This is getting to be more and more hilarious each day.

283. ratelimitsteve ◴[29 Aug 25 14:04 UTC] No.45064300{4}[source]▶

>>45063262 #

For social media at least it's important to remember that the users are the product, not the customer. Trying to squeeze additional revenue from your product is SOP.

284. SoftTalker ◴[29 Aug 25 14:04 UTC] No.45064301{7}[source]▶

>>45064258 #

It's shit all the way down.

replies(1): >>45064457 #

285. diggan ◴[29 Aug 25 14:04 UTC] No.45064302{8}[source]▶

>>45064100 #

> Does that make it a net positive for humanity?

That I don't know, and probably no one else, way too early to tell. I only responded to a comment stating "LLMs aren't a fundamental or core technology, they're an amusing party trick", which obviously I disagree with as for me they've been a fundamental shift in what I'm able to do.

> From the energy consumption impact on climate change alone I would say the answer is clearly no.

Ok, I guess that's fair enough. So if someone happens to use local models at home, in a home that is powered by solar power, then you'd feel LLM starting to be a net positive for humanity?

> And tons of other issues like how Big Tech is behaving.

This is such a big thing in general (that I agree with) but it has nothing to do with LLMs as a technology. Big Tech acts like they own the world and can do whatever they want with it, regardless if there are LLMs or not, so not sure why anyone would expect anything else.

replies(1): >>45066718 #

286. BearOso ◴[29 Aug 25 14:05 UTC] No.45064310{7}[source]▶

>>45063421 #

The LLM will tell you an approximation of what many responses on the Internet said the math should be, but you should have the knowledge and check if it's actually correct.

An LLM can give you a hazy picture, but it's your job to focus it.

replies(1): >>45065605 #

287. octagons ◴[29 Aug 25 14:05 UTC] No.45064314[source]▶

>>45062683 (OP) #

That’s unfortunate. I believed Anthropic was playing the long game and betting on a smaller but more technically proficient userbase.

I guess I’ll be canceling my subscription largely out of principal. I doubt any open-source models are capable of handling my use case as well as Claude (typically focused on getting up to speed with various ISO/IEEE standards for the purpose of security testing) but I’m sure I’ll find a solution.

replies(1): >>45064474 #

288. vntok ◴[29 Aug 25 14:06 UTC] No.45064319{5}[source]▶

>>45064275 #

Investments are all about "losing" money first to make money later, it's not a paradox.

289. Klonoar ◴[29 Aug 25 14:07 UTC] No.45064328{3}[source]▶

>>45063744 #

Well, theoretically they won’t.

Anyone who’s worked in an engineering team is familiar with someone forgetting to check ‘if(doNotDoThisCondition)’.

This is why (among many other reasons) opt-in is more user respecting here than opt-out.

replies(1): >>45064436 #

290. azinman2 ◴[29 Aug 25 14:07 UTC] No.45064332[source]▶

>>45063306 #

Just unclick the box to opt out. They put it front and center.

291. bdangubic ◴[29 Aug 25 14:07 UTC] No.45064338{8}[source]▶

>>45064100 #

there is little-to-nothing we do day-to-day (ESPECIALLY Big Tech related) that is net positive for society

292. azinman2 ◴[29 Aug 25 14:09 UTC] No.45064362[source]▶

>>45062683 (OP) #

Feels like a lot of overreactions here amongst people who haven’t seen the pop up. The put the opt out front and center — it’s not buried at all. They make it very clear in the language, including commitments to data transparency. I’m not surprised they need to do this, and this is probably the best possible way to achieve it while balancing privacy.

Everyone is so cynical these days.

293. SoftTalker ◴[29 Aug 25 14:09 UTC] No.45064365{5}[source]▶

>>45064247 #

All your data are belong to us.

294. xmorse ◴[29 Aug 25 14:10 UTC] No.45064375[source]▶

>>45062683 (OP) #

Good. LLMs progress is too stagnant, it was time they started to play seriously

295. losvedir ◴[29 Aug 25 14:10 UTC] No.45064382{5}[source]▶

>>45064253 #

Well you have to think about LLMs actually work. There is no "going forward". Every new token is generated based on the entire context window (chat history).

replies(1): >>45064576 #

296. iaw ◴[29 Aug 25 14:12 UTC] No.45064395{3}[source]▶

>>45064034 #

> claude doesn't really know anything other than it's training data

I've seen cases where Claude demonstrates novel behaviors or combines existing concepts in new ways based on my input. I don't think it's as simple as memorization anymore.

replies(1): >>45064788 #

297. dheatov ◴[29 Aug 25 14:12 UTC] No.45064399{4}[source]▶

>>45064143 #

* Randomly load up previous chat as the default and just wait for bing pot. * "Tiny oopsie doopsie, our bad."

298. zajio1am ◴[29 Aug 25 14:12 UTC] No.45064402[source]▶

>>45062905 #

Why? This is not 'use collected information to targed ads', or 'sell collected information to third parties', but 'use collected information from the service to improve the service'. Does not really seems to me much different than ISPs using traffic stats to plan infrastructure improvements, or a website using access logs to improve accessibility and navigation.

And when talking specifically about AI, one could argue that learning from interactions is a common aspect of intelligence, so a casual user who do not understand details about LLMs would expect so anyways. Also, the fact that LLMs (and other neural networks) have distinct training and inference phases seems more like an implementation detail.

299. questionableans ◴[29 Aug 25 14:12 UTC] No.45064403{5}[source]▶

>>45063148 #

I would love to see less pay inequality, but unfortunately, the median household in the US really doesn’t have it great due to the costs and risks of everyday life.

For comparison, I live in a place that is typically considered as tier 3 or 4 out of 4 in the US by employers (4 being the cheapest). Costs of living are honestly more like tier 2 cities, but it’s a small city in a poor state. 7 years ago, the going rate for an unlicensed handyman was $32/hour, often paid under the table in cash (I don’t have more recent numbers because I find DIY better and easier than hiring someone reliable).

300. ornornor ◴[29 Aug 25 14:13 UTC] No.45064407{3}[source]▶

>>45062930 #

It’s not. And also whether you move the toggle to on or off, you still have to click accept which really isn’t clear whether you’re accepting to share your data or not.

Never mind the complete 180 on privacy.

301. bluecalm ◴[29 Aug 25 14:14 UTC] No.45064416[source]▶

>>45063736 #

Math research or anything new/clever in a particular niche. Imagine you optimized a piece of code to get an advantage or came up with some clever trick to solve a common problem in your niche and then everyone gets it from free from Claude believing, as you pointed out, that it's now their work.

I had this exact conversation with my business partner a few days ago. Our "secret sauce" might not be worth that much after many years but still I am not comfortable exposing it to Claude. Fortunately it's very easy to separate in our project so Claude gets the other parts and is very helpful.

302. Deegy ◴[29 Aug 25 14:15 UTC] No.45064428[source]▶

>>45063736 #

If your work was truly novel, wouldn't the odds of it showing up in later models be extremely low given that these are probabilistic?

In a sense these machines are outputting the aggregate of the collective thoughts of the commons. In order for concepts to be output they have to be quite common in the training data. Which works out kind of nice for privacy and innovation because by the time concepts are common enough to show up through inference they probably deserve to be part of the public knowledge (IP aside).

replies(1): >>45064597 #

303. SoftTalker ◴[29 Aug 25 14:15 UTC] No.45064436{4}[source]▶

>>45064328 #

Forgetting. Riiighht.

304. simpaticoder ◴[29 Aug 25 14:15 UTC] No.45064437{3}[source]▶

>>45064034 #

There is a stark difference between using the public web to do research and searching through your colleagues' private notebooks and discussions to do research.

305. echelon ◴[29 Aug 25 14:16 UTC] No.45064441{5}[source]▶

>>45064228 #

> most websites are replaced

This is already happening.

> Wait until smart glasses or watches with AI overtake cellphones

Smartphones are crystalized perfection. It's such a peak design. The size, form factor, sensors, input/output modalities, and generalization are perfect. The reason companies are trying to supplant it is that they need to get out from under Google and Apple's control. It's not that anything is wrong with the smartphone.

VR has a long way to go in terms of hardware problems.

XR/AR is ridiculous. It's creepy, unstylish, and the utility is highly questionable. Nobody is going to want to be a walking ad.

replies(1): >>45065406 #

306. lenerdenator ◴[29 Aug 25 14:16 UTC] No.45064448{4}[source]▶

>>45063262 #

That's just shareholder capitalism, dude.

307. superposeur ◴[29 Aug 25 14:16 UTC] No.45064455[source]▶

>>45062683 (OP) #

Everyone seems to be unsurprised by this move, but I’m genuinely shocked. What a shoot your own foot business decision. Google, evil though it be, doesn’t post the text of your gmails in its search results because who would consider using Gmail after that? This is the llm equivalent. Am I missing something?

replies(7): >>45064592 #>>45064626 #>>45064638 #>>45064681 #>>45064737 #>>45064752 #>>45065348 #

308. lenerdenator ◴[29 Aug 25 14:17 UTC] No.45064457{8}[source]▶

>>45064301 #

Well, at least until you reach turtles.

309. threetonesun ◴[29 Aug 25 14:17 UTC] No.45064463{3}[source]▶

>>45063645 #

Seems hard to believe legal teams at corporations are going to forget this in a week. I've always assumed the market play for these companies was spinning off an "Amazon basics" version of other companies software, this seems like another step towards that.

310. smohare ◴[29 Aug 25 14:17 UTC] No.45064464{7}[source]▶

>>45063421 #

You think you are “nailing it” but also lack the background to even determine whether that is the case. I can assure you, there’s likely some fundamental flaws in what you’re vibing.

Just think about the type of code these things are trained on and the fact you’re clearly some random non-specialist.

replies(1): >>45065617 #

311. SoftTalker ◴[29 Aug 25 14:18 UTC] No.45064474[source]▶

>>45064314 #

This is all you can do. Laws and regulations will not be forthcoming, and even if some are, they will be ignored or the fines paid as a cost of doing business.

Any data you give to any website or app is no longer (exclusively) yours. Use these services under that assumption.

312. frereubu ◴[29 Aug 25 14:19 UTC] No.45064481[source]▶

>>45062683 (OP) #

I went to claude.ai and saw a pop-over window with this choice, clicked to opt out and saw a "server error" alert. Then went to Settings > Privacy and the option wasn't there. I had to click something else (can't remember now) to see the option, which wasn't opted out (which kinda made sense because of the error message) and the option is now in Settings > Privacy. I'm definitely someone who believes that cock-up is more likely than conspiracy for things like this, but this process should have been bulletproof - no initial error message, option always in Settings > Privacy - given the sensitivities involved.

313. SoftTalker ◴[29 Aug 25 14:20 UTC] No.45064493{3}[source]▶

>>45064012 #

For now. Once this settles in they will change the terms again, with no opt-out option.

replies(1): >>45065110 #

314. __MatrixMan__ ◴[29 Aug 25 14:20 UTC] No.45064495{5}[source]▶

>>45063793 #

Stack overflow doesn't really have a legitimate claim to that data either though. Nor do the users, we're just pasting error messages and documentation. It's derivative all the way down. It'll never sit still and behave like property.

Privacy makes sense, treating data like property does not.

replies(1): >>45065641 #

315. riz_ ◴[29 Aug 25 14:20 UTC] No.45064505{5}[source]▶

>>45064227 #

https://www.cnil.fr/en/dark-patterns-cookie-banners-cnil-iss...

replies(1): >>45066715 #

316. SoftTalker ◴[29 Aug 25 14:22 UTC] No.45064517{3}[source]▶

>>45063221 #

You're absolutely right, but also isn't the volume of new data they are getting from chats tiny compared to what they've already trained on? I'm wondering how much difference it will really make.

317. Barrin92 ◴[29 Aug 25 14:22 UTC] No.45064522{4}[source]▶

>>45063648 #

>That's a trillion dollar market

not if you have to constantly expend enormous sums to stay ahead of your competition or otherwise you lose your edge. It's not the best coding model because they got some mystical treasure in their basement. It's so rapidly becoming a commodity that at some point Microsoft or Google will just offer just as good a model for free and like search they'll just start milking people with ads.

That's likely one of the reasons for the shifting privacy stances, not just for training but because monetization of the product itself is probably looking pretty dim in the long run.

318. tjr ◴[29 Aug 25 14:22 UTC] No.45064525{7}[source]▶

>>45063672 #

Unless one's job expectations have been altered to demand LLM-quantity output, how could someone be reliant upon these tools now? What were they doing two years ago (or maybe even six months ago)?

I can understand becoming reliant on a technology -- I expect most programmers today would be pretty lost with punch cards or line editors -- but LLM coding seems too new for true reliance to have formed yet...?

319. Deegy ◴[29 Aug 25 14:23 UTC] No.45064530[source]▶

>>45062683 (OP) #

I guess I'll take the other side of what most are arguing in this thread.

Isn't it a great thing for to us to collectively allow LLM's to train on past conversations? LLM's probably won't get significantly better without this data.

That said I do recognize the risk of only a handful of companies being responsible for something as important as the collective knowledge of civilization.

Is the long term solution self custody? Organizations or individuals may use and train models locally in order to protect and distribute their learnings internally. Of course costs have to come down a ridiculous amount for this to be feasible.

replies(8): >>45064563 #>>45064781 #>>45064999 #>>45065881 #>>45066363 #>>45068149 #>>45069438 #>>45072552 #

320. Romario77 ◴[29 Aug 25 14:23 UTC] No.45064540{3}[source]▶

>>45062982 #

Anthropic enterprise share is pretty significant - on order of 30%. I think at this time it's pretty significant.

I am expecting AI companies to start using ads, it's inevitable as they need to make money at some point and $20 a month won't do it.

For ads the number of users is the main thing - the more users you have the bigger the market and more money you could earn. Google desperately needs to be in this space, that's why they are throwing a ton of money on AI.

replies(1): >>45069045 #

321. BalinKing ◴[29 Aug 25 14:24 UTC] No.45064551{4}[source]▶

>>45063179 #

I can confirm it's grey on both sides on the website.

replies(1): >>45065542 #

322. SoftTalker ◴[29 Aug 25 14:24 UTC] No.45064557[source]▶

>>45063737 #

I believe it shouldn't happen. I'm not surprised. There is no web company (LLM or otherwise) I trust completely, let alone any of the major ad-funded ones, or the ones who are not yet profitable and desperately searching for ways to become so.

replies(1): >>45065294 #

323. monsieurbanana ◴[29 Aug 25 14:25 UTC] No.45064563[source]▶

>>45064530 #

You mean collectively allow us to train Claude's llm? Pretty big omission there

replies(1): >>45064621 #

324. ratelimitsteve ◴[29 Aug 25 14:26 UTC] No.45064576{6}[source]▶

>>45064382 #

then you can retroactively implicitly opt in for processing, and that's a dark pattern if I've ever heard of one

325. SoftTalker ◴[29 Aug 25 14:27 UTC] No.45064591{7}[source]▶

>>45063825 #

Yes, because race is a protected class.

If they were charging wealthy people 0.50 more on average because the model showed that they don't care about price that much, they would be fine.

replies(1): >>45066128 #

326. KoolKat23 ◴[29 Aug 25 14:27 UTC] No.45064592[source]▶

>>45064455 #

This data is useful for reinforcement learning. All the others do it.

And most importantly, you can just opt-out.

replies(3): >>45064613 #>>45064705 #>>45064753 #

327. bluecalm ◴[29 Aug 25 14:28 UTC] No.45064597{3}[source]▶

>>45064428 #

They might optimize learning to weight novel/unexpected parts more in the future. The better the models become (the more the expect) the more value they will get from unexpected/new ideas.

replies(1): >>45064686 #

328. turnsout ◴[29 Aug 25 14:29 UTC] No.45064613{3}[source]▶

>>45064592 #

You can't opt out of the data retention policy.

replies(1): >>45064734 #

329. croes ◴[29 Aug 25 14:30 UTC] No.45064620{7}[source]▶

>>45063538 #

>It's like asking a child to keep your candy safe

That's why we don't hand billions of dollars to a child. Maybe we should treat AI companies similar.

330. Deegy ◴[29 Aug 25 14:30 UTC] No.45064621{3}[source]▶

>>45064563 #

I believe I addressed that in my third paragraph?

It does suck that there are only a few companies with enough resources to offer these models. But it's hard to escape the power laws.

I'm hoping that costs come down to the point where these things are basically a commodity with thousands of providers.

replies(1): >>45066543 #

331. lkbm ◴[29 Aug 25 14:30 UTC] No.45064623[source]▶

>>45063059 #

Putting a privacy setting under Settings->Privacy is making it "pretty hard to find"? Where else did you look first?

332. shadowgovt ◴[29 Aug 25 14:30 UTC] No.45064626[source]▶

>>45064455 #

The LLM equivalent is what Google does do, which is train its spam filters on the contents of your emails coupled to the signal of what human beings flag as spam.

(It was one of the first significant value-adds of GMail: at its scale, Google could create a global-concept understanding of the content and pattern of spam across hundreds of millions of users. That was the kind of Big Data that made it possible to build filters where one could confidently say "This is tuned on all spam in the wild, because we've seen all spam in the wild").

333. aleph_minus_one ◴[29 Aug 25 14:32 UTC] No.45064638[source]▶

>>45064455 #

> Am I missing something?

I think you do:

According to the article https://www.perplexity.ai/page/anthropic-reverses-privacy-st...

"Enterprise and educational customers will continue operating under their existing privacy protections, as the policy changes specifically exclude Claude for Work and Claude for Education services. These commercial accounts remain governed by separate contractual agreements that maintain stricter data handling standards.

Organizations using Claude through business partnerships or educational licenses can continue their operations without concern for the new training policies affecting their sensitive communications or proprietary information."

Thus, I think your claim

> What a shoot your own foot business decision.

likely does not hold: the non-commercial accounts likely led to Anthropic loosing money, so they are not liked by Anthropic anyway (but are a an "inconvenient necessity" to get people to notice and try out your product offering). With this new decision, Anthropic makes this "free-riding" less attractive.

I bet that Anthropic will soon release a press statement (that exists in the drawers for quite a long time) "We are listening to your concerns, and will thus extend our 'privacy-conscious offering' to new groups of customers. Only 30 $ per month."

replies(3): >>45064825 #>>45064832 #>>45065041 #

334. ChrisArchitect ◴[29 Aug 25 14:33 UTC] No.45064650[source]▶

>>45062683 (OP) #

[dupe] https://news.ycombinator.com/item?id=45053806

335. ttoinou ◴[29 Aug 25 14:34 UTC] No.45064667[source]▶

>>45063306 #

I think this is about claude.ai , not Claude Code nor Claude models API (but I'm not 100% sure). So no, unless you copy paste your code into claude.ai

replies(1): >>45064831 #

336. rs186 ◴[29 Aug 25 14:35 UTC] No.45064681[source]▶

>>45064455 #

Gmail used to serve ads based on your emails for many years until 2017. https://www.npr.org/sections/thetwo-way/2017/06/26/534451513...

replies(1): >>45064769 #

337. Deegy ◴[29 Aug 25 14:35 UTC] No.45064686{4}[source]▶

>>45064597 #

Good point. But can the models even behave that way? They depend on probability. If they put a greater weight on novel/unexpected outputs don't they just become undependable hallucination machines? Despite what some people think, these models can't reason about a concept to determine it's validity. They depend on recurring data in training to determine what might be true.

That said, it would be interesting to see a model tuned that way. It could be marketed as a 'creativity model' where the user understands there will be a lot of junk hallucination and that it's up to them to reason whether a concept has validity or not.

replies(2): >>45064914 #>>45073377 #

338. weregiraffe ◴[29 Aug 25 14:35 UTC] No.45064689{3}[source]▶

>>45062975 #

>What will you use instead? I’m finding Claude the best experience since ChatGPT 5 is so slow and not any better answers than 4.

You could try programming with your own brain

replies(1): >>45064821 #

339. 34679 ◴[29 Aug 25 14:36 UTC] No.45064702{3}[source]▶

>>45064167 #

Maybe I'll use the remainder of my subscription time to help improve Void. It's already pretty good.

https://voideditor.com/

https://github.com/voideditor/void

340. superposeur ◴[29 Aug 25 14:36 UTC] No.45064705{3}[source]▶

>>45064592 #

Ok, to be clear, let’s say I’m dumb and accidentally go with the default (I get the color of the opt out button wrong or something). As if there’s a “publish my private emails to the internet” default-on button in email. Then, I use it to edit a rec letter for student X, with my signature Y. (Yes I know this is dumb and I try changing names when editing but am sure some actual names may slip through.) A few months later the next model is released trained on the data. Student X asks Claude what Y would write in a rec letter about X. Such a button is a “wings stay on / wings fall off” button on a plane.

replies(1): >>45064868 #

341. ◴[29 Aug 25 14:36 UTC] No.45064709{3}[source]▶

>>45063116 #

342. smca ◴[29 Aug 25 14:38 UTC] No.45064734{4}[source]▶

>>45064613 #

The data retention period is 30 days if you don't choose to improve model training. https://www.anthropic.com/news/updates-to-our-consumer-terms...

replies(1): >>45064813 #

343. ◴[29 Aug 25 14:39 UTC] No.45064737[source]▶

>>45064455 #

344. einpoklum ◴[29 Aug 25 14:40 UTC] No.45064752[source]▶

>>45064455 #

Google mines the bejeezus out of your email, and uses it to any number of ends, including manipulating you into buying things, and also passing your correspondence on to the US government. While this is not the same as outright making your emails universally searchable - training Claude on your emails is also not the same as posting their contents.

And - this behavior of Google's has not been penalized, I'm afraid.

345. behnamoh ◴[29 Aug 25 14:40 UTC] No.45064753{3}[source]▶

>>45064592 #

Just because all the others do it doesn’t make it right. Many users chose Anthropic exactly because they were not like the others.

replies(3): >>45065150 #>>45065215 #>>45071208 #

346. freejazz ◴[29 Aug 25 14:40 UTC] No.45064759[source]▶

>>45063064 #

It's not alleged that they stole the content. They told the courts they pirated the materials.

replies(1): >>45066204 #

347. troad ◴[29 Aug 25 14:40 UTC] No.45064763{4}[source]▶

>>45063357 #

The original meaning of sane is "physically healthy". Its usual modern meaning is "mentally healthy". You're using it to mean "reasonable".

At which exact point is language prohibited from evolving, and why it super coincidentally the exact years you learnt it?

replies(2): >>45064883 #>>45066803 #

348. skylurk ◴[29 Aug 25 14:41 UTC] No.45064769{3}[source]▶

>>45064681 #

And in 2010 they made https the default. Different times :)

replies(1): >>45064912 #

349. ljosifov ◴[29 Aug 25 14:41 UTC] No.45064773[source]▶

>>45062683 (OP) #

Excellent. What were they waiting for up to now?? I thought they already trained on my data. I assume they train, even hope that they train, even when they say they don't. People that want to be data privacy maximalists - fine, don't use their data. But there are people out there (myself) that are on the opposite end of the spectrum, and we are mostly ignored by the companies. Companies just assume people only ever want to deny them their data.

It annoys me greatly, that I have no tick box on Google to tell them "go and adapt models I use on my Gmail, Photos, Maps etc." I don't want Google to ever be mistaken where I live - I have told them 100 times already.

This idea that "no one wants to share their data" is just assumed, and permeates everything. Like soft-ball interviews that a popular science communicator did with DeepMind folks working in medicine: every question was prefixed by litany of caveats that were all about 1) assumed aversion of people to sharing their data 2) horrors and disasters that are to befall us should we share the data. I have not suffered any horrors. I'm not aware of any major disasters. I'm aware of major advances in medicine in my lifetime. Ultimately the process does involve controlled data collection and experimentation. Looks a good deal to me tbh. I go out of my way to tick all the NHS boxes too, to "use my data as you see fit". It's an uphill struggle. The defaults are always "deny everything". Tick boxes never go away, there is no master checkbox "use any and all of my data and never ask me again" to tick.

replies(16): >>45064814 #>>45064872 #>>45064877 #>>45064889 #>>45064911 #>>45064921 #>>45064967 #>>45064974 #>>45064988 #>>45065001 #>>45065005 #>>45065065 #>>45065128 #>>45065333 #>>45065457 #>>45065554 #

350. freejazz ◴[29 Aug 25 14:42 UTC] No.45064781[source]▶

>>45064530 #

> LLM's probably won't get significantly better without this data.

Yeah and Facebook couldn't scale without ignoring the harms it causes people. Should we just let that be? Society seems to think so but I don't think it's a good idea at all.

351. ffsm8 ◴[29 Aug 25 14:42 UTC] No.45064788{4}[source]▶

>>45064395 #

If I am standing in Finland and look out on the ocean, and the whole sky is green... Is the sky actually green?

You're equating your own perspective as objective truth, which is a very common pitfall and fallacy

352. ethagnawl ◴[29 Aug 25 14:43 UTC] No.45064792[source]▶

>>45062852 #

I wonder what happens if I don't accept the new T&C? I've been successfully dismissing an updated T&C prompt in a popular group messaging application for years -- I lack the time and legal acumen to process it -- without issue.

Also, for others who want to opt-out, the toggle is in the T&C modal itself.

replies(2): >>45065037 #>>45065092 #

353. imiric ◴[29 Aug 25 14:44 UTC] No.45064804[source]▶

>>45063064 #

Yeah, this is hardly surprising.

To AI companies, data is even more of a gold mine than to adtech companies. It is existentially important.

The truly evil behavior will emerge at the intersection of these two industries. I'm sure Google and Facebook are already using data from one to power the other, even if it's currently behind closed doors. I can hardly wait for the use cases these geniuses will think of once this is publicly acceptable and in widespread use by all companies.

354. turnsout ◴[29 Aug 25 14:44 UTC] No.45064813{5}[source]▶

>>45064734 #

Oh, I didn't catch this—that's good news

355. 12ian34 ◴[29 Aug 25 14:44 UTC] No.45064814[source]▶

>>45064773 #

not remotely worried about leaks, hacks, or sinister usage of your data?

replies(3): >>45064920 #>>45065057 #>>45072864 #

356. ◴[29 Aug 25 14:45 UTC] No.45064821{4}[source]▶

>>45064689 #

357. layer8 ◴[29 Aug 25 14:45 UTC] No.45064825{3}[source]▶

>>45064638 #

Well, it means that LLMs used for business use cases will be trained on input from non-business use cases of non-privacy-conscious users.

358. frm88 ◴[29 Aug 25 14:45 UTC] No.45064824{8}[source]▶

>>45063733 #

Bravo! This has to be the most coherent and well-formulated rant I have read in a longtime. Thank you!

359. adastra22 ◴[29 Aug 25 14:45 UTC] No.45064831{3}[source]▶

>>45064667 #

Claude Code is explicitly included.

360. ceroxylon ◴[29 Aug 25 14:45 UTC] No.45064832{3}[source]▶

>>45064638 #

> With this new decision, Anthropic makes this "free-riding" less attractive

Certainly not for any users like you and me, it takes two seconds and three clicks to review the new terms and decline chat training. This is more like Anthropic getting easy training from people who are unaware or don't care.

replies(1): >>45065025 #

361. jmward01 ◴[29 Aug 25 14:46 UTC] No.45064842[source]▶

>>45062683 (OP) #

I just opted out. And then canceled my plan. The 5 year retention isn't part of the opt out and represents way too juicy of a target for them. Some time in the next 5 years another TOS change will happen, and another and another and eventually there won't be an opt out or I won't realize it and accidentally click yes. Privacy first. Period. Pay me to opt in and I may consider it.

362. adastra22 ◴[29 Aug 25 14:46 UTC] No.45064844[source]▶

>>45063306 #

Yes, your code is part of the chat context. You cannot use Claude code without sharing your repo with them.

363. franga2000 ◴[29 Aug 25 14:48 UTC] No.45064868{4}[source]▶

>>45064705 #

You're severely overestimating the ability of the model to recall a single mostly uninteresting item from it's billions of input documents.

364. j4hdufd8 ◴[29 Aug 25 14:48 UTC] No.45064872[source]▶

>>45064773 #

> But there are people out there (myself) that are on the opposite end of the spectrum, and we are mostly ignored by the companies. Companies just assume people only ever want to deny them their data.

What? I think you're exactly the kind of person that companies pay attention to, and why they pull moves like this

replies(1): >>45072923 #

365. disconcision ◴[29 Aug 25 14:49 UTC] No.45064875{3}[source]▶

>>45062936 #

this is somewhat true but i'm not sure how load bearing it is. for one, i think it's going to be a while until 'we asked the model what bob said' is as admissible as the result of a database query

366. blipmusic ◴[29 Aug 25 14:49 UTC] No.45064877[source]▶

>>45064773 #

My life does in fact have priorities above ”LLMs should work a bit better”.

367. danans ◴[29 Aug 25 14:49 UTC] No.45064883{5}[source]▶

>>45064763 #

> At which exact point is language prohibited from evolving

Never?

https://en.m.wikipedia.org/wiki/Semantic_change

replies(1): >>45064926 #

368. j4hdufd8 ◴[29 Aug 25 14:49 UTC] No.45064889[source]▶

>>45064773 #

Are you okay getting ads for shit holistic medication because you had a mental health conversation with AI?

369. Razengan ◴[29 Aug 25 14:51 UTC] No.45064910[source]▶

>>45062683 (OP) #

I wanted to try it out so I bought a Pro subscription. No way to purchase through iOS so I had to enter my card number. I cancelled renewal and tried to remove my payment method as per usual practice, but I couldn't. Tried to find a way to get support but the send button on their chatbot box was disabled for me (on Safari if that matters). Regardless of the quality of the AI, that experience ticked me off enough to file a "F you" chargeback with my bank.

370. imiric ◴[29 Aug 25 14:51 UTC] No.45064911[source]▶

>>45064773 #

What a ridiculous stance.

Do you lock your front door, or use passwords on any of your accounts? Because what you're essentially saying is that you're OK with strangers having access to your personal information. That's beyond the already flawed "I have nothing to hide" argument.

replies(1): >>45064928 #

371. j4hdufd8 ◴[29 Aug 25 14:51 UTC] No.45064912{4}[source]▶

>>45064769 #

I don't think https is responsible for that. Google owns the data, it doesn't matter how it is transported. It does, however, matter how it is stored (which I hope is encrypted in a way only you can retrieve it)

372. ceroxylon ◴[29 Aug 25 14:51 UTC] No.45064914{5}[source]▶

>>45064686 #

Temperature plays a large role in fine tuning model output, you're correct that there is a theoretical sweet spot:

https://towardsdatascience.com/a-comprehensive-guide-to-llm-...

373. londons_explore ◴[29 Aug 25 14:51 UTC] No.45064920{3}[source]▶

>>45064814 #

I would far prefer the service use my data to work better and take a few privacy risks.

People die all the time from cancer or car accidents. People very rarely die from data leaks.

Some countries like Sweden make people's private financial data public information - and yet their people seem happier than ever. Perhaps privacy isn't as important as we think for a good society.

replies(6): >>45065000 #>>45065055 #>>45065141 #>>45065772 #>>45065823 #>>45066321 #

374. franga2000 ◴[29 Aug 25 14:52 UTC] No.45064921[source]▶

>>45064773 #

Try living in a place with privatised health insurance and you'll quickly see why medical data is some of the most important to keep private.

375. troad ◴[29 Aug 25 14:52 UTC] No.45064926{6}[source]▶

>>45064883 #

Yes, that was my point.

replies(1): >>45066797 #

376. JumpCrisscross ◴[29 Aug 25 14:52 UTC] No.45064928{3}[source]▶

>>45064911 #

> Do you lock your front door

In trusted neighborhoods? No. But that respect goes both ways.

377. bilsbie ◴[29 Aug 25 14:53 UTC] No.45064932[source]▶

>>45062683 (OP) #

I’m generally a privacy maniac on most issues but I like the idea in playing my part in training future AI’s.

As long as they’re up front about it, it seems ok. Maybe providing a privacy toggle would be good.

It’s also good as it forces corporations to invest in offline LLM’s which is better for everyone.

378. cozzyd ◴[29 Aug 25 14:54 UTC] No.45064949{4}[source]▶

>>45064143 #

they can ... generate activity :)

379. ◴[29 Aug 25 14:55 UTC] No.45064955[source]▶

>>45062852 #

380. bilsbie ◴[29 Aug 25 14:55 UTC] No.45064963[source]▶

>>45062683 (OP) #

Offshoot topic but I had an idea for LLM privacy.

What if the first layer (or couple layers) were processed locally on the users machine and then it goes to the provider to process the remaining layers.

You could also process the last layer on the users machine.

It’s hard to say what kind of privacy this gives users. I don’t think they could reverse out exactly what the input was.

replies(1): >>45065075 #

381. behnamoh ◴[29 Aug 25 14:55 UTC] No.45064967[source]▶

>>45064773 #

Are you trolling us or do you live in a hypothetical world where companies have our best interests in heart?

382. beepbooptheory ◴[29 Aug 25 14:56 UTC] No.45064974[source]▶

>>45064773 #

This may very well be a rational stance, but either way, wish one could somehow teleport this sentiment to the Cypherpunk mailing list in the 80s/90s. Of all the things they projected, concocted, fought for.. Nothing could truly prepare them for this kind of thing: the final victory of the product over the people, the happy acceptance of surveillance. They were all imagining terrible dystopias garnered from state violence and repression, never could they begin to imagine it could all transpire anyway because people like not having to type their address in!

> The angel would like to stay, awaken the dead, and make whole what has been smashed. But a storm is blowing from Paradise; it has got caught in his wings with such violence that the angel can no longer close them. The storm irresistibly propels him into the future to which his back is turned, while the pile of debris before him grows skyward. This storm is what we call progress.

replies(1): >>45065638 #

383. ◴[29 Aug 25 14:56 UTC] No.45064980{3}[source]▶

>>45063116 #

384. energy123 ◴[29 Aug 25 14:57 UTC] No.45064986[source]▶

>>45062852 #

Has anyone asked why OpenAI has two very separate opt-out mechanisms (one in settings, the other via a formal request that you need to lodge via their privacy or platform page)? That always seemed likely to me to be hiding a technicality that allows them to train on some forms of user data.

385. 827a ◴[29 Aug 25 14:57 UTC] No.45064988[source]▶

>>45064773 #

Its incredible to me how seriously people can hold an opinion they've so clearly critically interrogated so little.

replies(1): >>45065758 #

386. nicce ◴[29 Aug 25 14:57 UTC] No.45064996[source]▶

>>45062852 #

OpenAIs temporary chat still advertises that chats are stored for 30 days while there is court order that everything must be retained indefinitely. I wonder why they are not obligated to state this quite extreme retention.

387. jimbokun ◴[29 Aug 25 14:57 UTC] No.45064999[source]▶

>>45064530 #

It's not clear that most people will benefit from LLMs getting significantly better. It's looking more like a net negative.

388. soiltype ◴[29 Aug 25 14:58 UTC] No.45065000{4}[source]▶

>>45064920 #

public/private isn't a binary, it's a spectrum. we Americans mostly sit in the shithole middle ground where our data is widely disseminated among private, for-profit actors, for the explicit purpose of being used to manipulate us, but it's mostly not available to us, creating an assymmetric power balance.

replies(1): >>45066547 #

389. Razengan ◴[29 Aug 25 14:58 UTC] No.45065001[source]▶

>>45064773 #

> I'm not aware of any major disasters.

Oh boy. Did you somehow miss all the news about data leaks and password dumps etc. being sold on the "dark" web and shit?

Would you mind if I followed you around and noted everything you do and constantly demanded your attention?

The shit done by corporations is akin to a clingy stalker and would be absolutely despised if it was an individual person doing something like that.

As for benefits, which?? In my entire life I have never seen an ad for anything (that I did not already know about via other means) that made me want to look up the product, let alone buy it. Nor do I know anyone who did. In fact, it turns me off from a product if its ad appears too frequently.

Google etc. and various storefronts also almost never recommend me anything that actually matches my interests, beyond just a shallow word similarity, in fact they forcibly shove completely unrelated shit into my searches cause they were paid to. Like searching for RPGs and seeing Candy f'ing Crush.

----

You know what though, I kinda agree with the potential intent behind your charade:

Yes, LET ME TELL YOU ABOUT ME.

I will gladly TELL companies EXACTLY what I like, and I WANT you to use that. Show me other shit that is actually relevant to MY interests instead of the interests of whomever paid you to shove their shit into my face.

ASK! DON'T SPY! Because you can't ever get it right anyway!

390. calmbonsai ◴[29 Aug 25 14:58 UTC] No.45065005[source]▶

>>45064773 #

I don't think you understand how...humanity works?! Is this deliberate parody?

Abuse of medical data is just the tip of the iceberg here and, at least in the states, privatized healthcare presents all sorts of for-profit pricing abuse scenarios let alone nasty scenarios for social coercion.

replies(1): >>45072994 #

391. ethagnawl ◴[29 Aug 25 14:58 UTC] No.45065006{3}[source]▶

>>45063725 #

> Not to mention your drop-in-the-bucket contribution will have next to no influence in the next model. It won't catch things specific to YOUR workflow, just common stuff across many users.

I wonder about this. In the future, if I correct Claude when it makes fundamental mistakes about some topic like an exotic programming language, wouldn't those corrections be very valuable? It seems like it should consider the signal to noise ratio in these cases (where there are few external resources for it to mine) to be quite high and factor that in during its next training cycle.

392. 34679 ◴[29 Aug 25 14:58 UTC] No.45065010[source]▶

>>45062683 (OP) #

It has a toggle for opting out, but the buttons available are "Accept" and "Not Now". So, if I toggle off but click "Accept", does that accept what I toggled or accept data sharing? If I click "Not Now", do they leave it set at the default Opt-in?

I hate this shit and I'm cancelling now.

https://imgur.com/a/oCw5eEp

replies(1): >>45065196 #

393. soiltype ◴[29 Aug 25 15:00 UTC] No.45065025{4}[source]▶

>>45064832 #

Seems the same thing. They're giving plausible deniability, but knowing they'll still scoop up a worthwhile amount of data/profit from some % of users.

394. ethagnawl ◴[29 Aug 25 15:00 UTC] No.45065027[source]▶

>>45062683 (OP) #

I just canceled my Pro Plan yesterday because I don't find it to be a good value and I'm trying to rein in recurring charges. I really wish I'd been able to cite this development as a reason for why I was cancelling my plan.

395. grim_io ◴[29 Aug 25 15:00 UTC] No.45065033{3}[source]▶

>>45063900 #

Can I trouble you to give me a hint where this could be configured? :)

replies(1): >>45071580 #

396. nicce ◴[29 Aug 25 15:00 UTC] No.45065037{3}[source]▶

>>45064792 #

I tried to do that with WhatsApp and it eventually stopped working.

397. 827a ◴[29 Aug 25 15:00 UTC] No.45065041{3}[source]▶

>>45064638 #

Gmail is free. It would still be incredibly bad for Gmail to start publishing the content of free users' emails to Google search.

But also, Anthropic has said that this new policy also applies to their Pro ($20/mo) and Max ($200/mo) plans. So its not free versus not free.

398. 12ian34 ◴[29 Aug 25 15:02 UTC] No.45065055{4}[source]▶

>>45064920 #

Sweden is a very poor example, all that is public is personal taxable income. That's it. You're comparing apples to oranges. And how is your home address, and AI chatbot history going to cure cancer?

399. ljosifov ◴[29 Aug 25 15:02 UTC] No.45065057{3}[source]▶

>>45064814 #

If they leaked bank accounts numbers, or private keys - I would be worried. That has not happened in the past.

About myself personally - my Name Surname is googleable, I'm on the open electoral register, so my address is not a secret, my company information is also open in the companies register, I have a a personal website I have put up willingly and share information about myself there. Training models on my data doesn't seem riskier than that.

Yeah, I know I'd be safer if I was completely dark, opaque to the world. I like the openness though. I also think my life has been enriched in infinitely many ways by people sharing parts of their lives via their data with me. So it would be mildly sociopathic of me, if I didn't do similar back to the world, to some extent.

replies(2): >>45065103 #>>45068227 #

400. bgwalter ◴[29 Aug 25 15:02 UTC] No.45065065[source]▶

>>45064773 #

I realize this might be satire. If not, you are using the same aggressive strategy of turning the tables as Palantir:

https://www.theguardian.com/technology/2025/jul/08/palantir-...

Most people do want to deny their data, as we have recently seen in various DOGE backlashes.

replies(1): >>45072978 #

401. calmbonsai ◴[29 Aug 25 15:03 UTC] No.45065075[source]▶

>>45064963 #

This is intriguing. I've been working on gestalt measurements of LLM 'skew' and thought this sort of scenario might be possible.

Combine that workflow with homomorphic encryption and you've got a reasonable privacy moat.

402. layer8 ◴[29 Aug 25 15:04 UTC] No.45065092{3}[source]▶

>>45064792 #

The new privacy policy automatically becomes effective on September 28, if you don’t already agree to it before. Anthropic states that “After September 28, you’ll need to make your selection on the model training setting in order to continue using Claude.”

403. soiltype ◴[29 Aug 25 15:04 UTC] No.45065093{3}[source]▶

>>45062975 #

Since I don't use LLMs to directly code for me, I'm going to (mis?)place my trust in Kagi assistant entirely for the time being. It claims not to associate prompts with individual accounts. Small friction of keeping a browser tab open is worth it for me for now.

404. lacoolj ◴[29 Aug 25 15:05 UTC] No.45065098[source]▶

>>45062683 (OP) #

There are dark patterns in UI, and then there's this terms and conditions popup that also contains the checkbox to disable

Having "Accept" right under that makes it very unclear what you're accepting and enabling/disabling at the same time.

For those without an account or just want to see this: https://imgur.com/a/jbhzbnB

405. 12ian34 ◴[29 Aug 25 15:05 UTC] No.45065103{4}[source]▶

>>45065057 #

So you are projecting sociopathy on those that choose to keep their lives more private than you? Like you said, basic personal details are essentially public knowledge anyway. Where do you draw the line personally on what should be private?

replies(1): >>45065529 #

406. ezfe ◴[29 Aug 25 15:05 UTC] No.45065110{4}[source]▶

>>45064493 #

It must be so tiring worrying about pointless future things that aren't happening yet.

replies(3): >>45065726 #>>45067777 #>>45075301 #

407. albumen ◴[29 Aug 25 15:06 UTC] No.45065122[source]▶

>>45063061 #

Nope, it's opt out. I'm in ireland.

408. Gud ◴[29 Aug 25 15:07 UTC] No.45065128[source]▶

>>45064773 #

Have you considered the drawbacks of sharing your data to the most unscrupulous people on this planet?

replies(1): >>45072724 #

409. cowboylowrez ◴[29 Aug 25 15:07 UTC] No.45065138[source]▶

>>45062782 #

I hope they didn't vibe code the popup, that could be bad if it didn't actually work.

410. Gud ◴[29 Aug 25 15:08 UTC] No.45065141{4}[source]▶

>>45064920 #

That financial data is very limited. Would it be just as acceptable if these companies knew where and what you purchased?

411. wolvesechoes ◴[29 Aug 25 15:09 UTC] No.45065150{4}[source]▶

>>45064753 #

> Many users chose Anthropic exactly because they were not like the others.

Oh the naivety.

Sooner or later they all become the same, soon after "investors" or "shareholders" arrive.

replies(1): >>45065271 #

412. weberer ◴[29 Aug 25 15:10 UTC] No.45065156{5}[source]▶

>>45063504 #

What metrics are you looking at? Grok 4 outperforms Claude 4 Opus in the Artificial Analysis Intelligence Index.

https://artificialanalysis.ai/leaderboards/models

413. smukherjee19 ◴[29 Aug 25 15:12 UTC] No.45065196[source]▶

>>45065010 #

I clicked "Accept" because I didn't want to deal with it later, especially given the new terms, and Claude training on new chats and resuming existing chats from September 28th,

but the "Accept" and "Not Now" are for the new terms of use.

So by toggling off and clicking "Accept", it will accept the toggled version. You can check the setting after accepting it at: https://claude.ai/settings/data-privacy-controls It's the "Help Improve Claude" button.

414. KoolKat23 ◴[29 Aug 25 15:13 UTC] No.45065215{4}[source]▶

>>45064753 #

There's no reason to be shocked by the practice however.

415. wolvesechoes ◴[29 Aug 25 15:16 UTC] No.45065253{4}[source]▶

>>45064192 #

Property itself is a legal fiction. Every other right you enjoy is a legal fiction.

So what?

replies(1): >>45067227 #

416. r2_pilot ◴[29 Aug 25 15:17 UTC] No.45065261[source]▶

>>45062683 (OP) #

Between this(I already opted out but) and their work with Palantir, I definitely am not going to increase my subscription. It is making me reconsider paying, even using, a technology I've found transformative, and I expected(okay hoped) better from Anthropic.

417. latexr ◴[29 Aug 25 15:17 UTC] No.45065264{3}[source]▶

>>45064169 #

> They're not power hungry

That is an interesting claim. What makes you believe that? And does this announcement shake that belief in any way?

> Their aesthetic is cool modern valley vibes.

Does looking cool equal being trustworthy? Doesn’t feel like it should. On the contrary, from observation on HN it seems the websites which look pretty bare bones (none from an LLM company) tend to be perceived as more trustworthy (i.e. “this hacker cares about The Thing™, not trying to sell you a product”).

> pure souled silicon valley.

Could you expand on what this means?

replies(1): >>45065715 #

418. behnamoh ◴[29 Aug 25 15:18 UTC] No.45065271{5}[source]▶

>>45065150 #

> Sooner or later they all become the same, soon after "investors" or "shareholders" arrive.

They already arrived. Google was one of the main investors of Anthro.

419. latexr ◴[29 Aug 25 15:20 UTC] No.45065294{3}[source]▶

>>45064557 #

I agree with you, and I think the thread does too so far. Which is why I’m specifically seeking those who disagree, to understand the why.

I don’t think there’ll be many (if any) who think this should’ve happened, but I do expect some may be surprised (and disappointed).

420. demarq ◴[29 Aug 25 15:21 UTC] No.45065305{3}[source]▶

>>45063532 #

What will any provider get out of running free inference then?

replies(1): >>45069080 #

421. AlexandrB ◴[29 Aug 25 15:23 UTC] No.45065333[source]▶

>>45064773 #

I think I'd have more understanding for this position if I thought that these companies were still fundamentally interested in serving their users. They are not. Any information you provide is more likely to be used against your interests (even if that's "just" targeting you with some ads for a scammy product) than for your benefit.

Basically all AI companies are fruit from the same VC-poisoned tree and I expect these products will get worse and more user-hostile as they try to monetize. We're currently living in the "MoviePass"[1] era of AI where users are being heavily subsidized to try to gain market share. It will not last and the potential for abuse is enormous.

[1] https://en.wikipedia.org/wiki/MoviePass

replies(1): >>45068317 #

422. podgorniy ◴[29 Aug 25 15:24 UTC] No.45065348[source]▶

>>45064455 #

What a framing. Like there is exactly a surprise behing all these reactions.

423. Aurornis ◴[29 Aug 25 15:27 UTC] No.45065374{3}[source]▶

>>45063070 #

It works correctly (blue on, grey off) in the iOS app. I just did it now.

424. deanmoriarty ◴[29 Aug 25 15:29 UTC] No.45065396[source]▶

>>45062683 (OP) #

There’s quite a few people on HN who hold Anthropic on a high pedestal compared to the other AI labs, I’d be curious to hear their opinion after this.

425. mapontosevenths ◴[29 Aug 25 15:29 UTC] No.45065406{6}[source]▶

>>45064441 #

>Smartphones are crystalized perfection.

Time will tell, but to me they feel like desktops did 20 years ago. The process of enshitification has turned simple tasks complicated and everyone wants a different, privacy destroying, frustrating to use "app", each of which has a slightly different UI paradigm, a mandatory subscription I've forgotten to cancel for two years straight, and a confusing name to remember. I now have something like 90 apps installed on my iphone, and I can only remember what something like 40 of them do. My damn cat box has an app, and instead of naming it something sensible like "Shitbox 2000" they named it "Whisker".

Was it "Foober Eats that had taco bell, or Instafart, maybe it was Dine-N-Dash? Where's the back button on this thing and why is it different from every other app? Is this an ad or content, does it even matter anymore? Why do I need another login, what happened to SSO? Why won't my password vault work for this one app? Did I register for this one with my google account or apple? Who took my pills? Stay off my lawn!"

When the day comes that I can just tell my device what to do, and let it get it done I'll be very happy to dump that cognitive load onto someone/something else.

replies(2): >>45065505 #>>45065993 #

426. insane_dreamer ◴[29 Aug 25 15:34 UTC] No.45065456{9}[source]▶

>>45064267 #

Was the creation of the atom bomb a net positive for humanity?

427. mrbombastic ◴[29 Aug 25 15:34 UTC] No.45065457[source]▶

>>45064773 #

You should read up on the Dutch Civil Registry and the holocaust in the Netherlands and reevaluate if you are serious. I would love to live in a world where everyone had good intentions and the powers that be wouldn’t abuse data to their ends, we will never live in that world.

428. Aurornis ◴[29 Aug 25 15:35 UTC] No.45065478[source]▶

>>45063738 #

My comment doesn’t make sense but it’s too late to edit. To clarify: The original link was to a perplexity.ai summary of multiple news articles about the change. The mods have changed it now.

The comments here were from people jumping to co conclusions after skimming an AI summary of news articles about the link. I’m glad it got changed.

429. weinzierl ◴[29 Aug 25 15:36 UTC] No.45065479[source]▶

>>45062683 (OP) #

I'm curious how the new terms compare to other companies offerings. Never spent the time to dig into the fine-print, so if anyone coincidentally has, I'd be very grateful for a summary.

430. ◴[29 Aug 25 15:37 UTC] No.45065496{9}[source]▶

>>45064267 #

431. mapontosevenths ◴[29 Aug 25 15:38 UTC] No.45065505{7}[source]▶

>>45065406 #

Further, even the content itself has become poison. When AI reaches a level that I can trust that it works for me and not someone else I will be ecstatic to let the machine mediate my reality and filter the untrue, toxic, rage bait content of the world to /dev/null on my behalf. Let the machine rot it's brain on Reddit, TikTok, and X-twitter all day so I can spend the clock cycles on something useful, but still be sure I'm not falling behind.

432. Syzygies ◴[29 Aug 25 15:39 UTC] No.45065522[source]▶

>>45063736 #

To clarify, I see AI as an association engine of immense scope. Others are responding with variations on this model in mind.

It has long been a problem in math research to distinguish between "no one has had this idea" and "one person has had this idea". This used to take months. With the internet and MathSciNet, ArXiv online it took many iterations of guessing keywords. Now, I've spent six months learning how to coax rare responses from AI. That's not everyone's use case.

What complicates this is AI's ability to generalize. My best paper, we imagined we were expressing in print what everyone was thinking, when we were in fact connecting the dots on an idea that was latent. This is an interesting paradox: People see you as most original when you're least original, but you're helping them think.

With the right prompts AI can also "connect the dots".

433. ljosifov ◴[29 Aug 25 15:39 UTC] No.45065529{5}[source]▶

>>45065103 #

Not at all, on the contrary, I chose my words carefully ("mildly sociopathic OF ME") as to avoid casting shade on others. Saying "this is how I feel", so to preclude judging others. Everyone makes their own choices, and that's fine.

Boundaries - yes sure they exist. I don't have my photo albums open to the world. I don't share info about family and friends - I know people by default don't want to share information about them, and I try to respect that. Don't share anything on Facebook, where plenty share, for example.

At the same time, I find obstacles to data sharing codified in the UK law frustrating. With the UK NHS. 1) Can't email my GP to pass information back-and-forth - GP withholds their email contact; 2) MRI scan private hospital makes me jump 10 hops before sharing my data with me; 3) Blood tests scheduling can't tell me back that schedule for a date failed, apparently it's too much for them to have my email address on record; 4) Can't volunteer my data to benefit R&D in NHS. ("here are - my lab works reports, 100 GB of my DNA paid for by myself, my medical histories - take them all in, use them as you please...") In all cases vague mutterings of "data protection... GDPR..." have been relayed back as "reasons". I take it's mostly B/S. They could work around if they wanted to. But there is a kernel of truth - it's easier for them to not try share, so it's used as a cover leaf. (in the worst case - an alibi for laziness.)

I'm for having power to share, or not share, what I want. With Google - I do want them to know about myself and use that for my (and theirs) benefit. With the UK gov (trying to break encryption) - I don't want them to be able to read my WhatsApp-s. I detest UK gov for effectively forcing me (by forcing the online pharmacy) to take a photos of myself (face, figure) in order to buy online Wegovy earlier today.

replies(1): >>45066579 #

434. tln ◴[29 Aug 25 15:41 UTC] No.45065542{5}[source]▶

>>45064551 #

I get blue (on) / black (off) on the website. Or blue / white in light mode.

https://claude.ai/settings/data-privacy-controls

It was easy to not opt-in, I got prompted before I saw any of this.

I think they should keep the opt-in behavior past Sept 28 personally.

replies(1): >>45067375 #

435. soraminazuki ◴[29 Aug 25 15:42 UTC] No.45065554[source]▶

>>45064773 #

You can train commercial AI with your data right now without screwing over everyone else on the planet. It's easy, just publish your entire trove of personal data on a website and AI crawlers will happily gobble it all up. You can publish your name, home address, work, government-issued ID, financial transactions, chats, browser history, location history, surveillance footage of your home, all for free. So what are you waiting for? Just do it now if you want to share data that badly, there's no need to wait for the approval of "privacy maximalists."

replies(1): >>45072819 #

436. dns_snek ◴[29 Aug 25 15:46 UTC] No.45065601[source]▶

>>45063736 #

If you talk to a human they're free to discuss your ideas with someone else. Why should LLMs be any different? The likelihood of these models reproducing your ideas word for word is essentially zero anyway.

More to the point, respecting your wishes to keep those conversations confidential would risk stifling human progress, so they have to be disregarded for the greater good.

replies(2): >>45065700 #>>45071173 #

437. diggan ◴[29 Aug 25 15:46 UTC] No.45065605{8}[source]▶

>>45064310 #

Exactly, which matches with precisely with how I'm using them. So with that perspective, you then agree they're a fundamental/core technology, at least for more than just me?

438. diggan ◴[29 Aug 25 15:47 UTC] No.45065617{8}[source]▶

>>45064464 #

> some fundamental flaws in what you’re vibing

That's just a misunderstanding, I'm not "vibing" anything. The tests are written by me, the API interfaces are written by me, the usages are written by me, and the implementation of these functions are written by an LLM, but reviewed to be up to code standards/quality by me.

If a function gives me the right output for the inputs I have in mind, does anything beyond that really matter?

439. ljosifov ◴[29 Aug 25 15:49 UTC] No.45065638{3}[source]▶

>>45064974 #

Well Yes and No. Funny you mention the 80s/90s. I grew up in the pre-Internet world. I remember home computers, then PC-s, then modems to access BBS-es, then FIDO, uucp email, academic Internet and then the private commercial Internet after 1990. Some parts of the privacy agenda I'm strongly pro-privacy, the more the better. I don't want encryption broken. The UK gov (I live in the UK) are being morons for that, forever trying that play. Atm there is at least part of the US admin to push back on that. I don't like UK Parliament forcing the online ID on me. I'm pro- having private citizens having private keys on un-snoop-able dongle devices.

440. chamomeal ◴[29 Aug 25 15:49 UTC] No.45065641{6}[source]▶

>>45064495 #

Point taken, but it still feels like a gray area to me. The value that SO created was the curation of knowledge and high quality discussions that were well indexed and searchable.

The users did provide the data, which is a good point. But there’s a reason SO was so useful to developers and quora was not. It also made it a perfect feeding ground for hungry LLMs.

Then again I’m just guessing that big models are trained on SO. Maybe that’s not true

441. chamomeal ◴[29 Aug 25 15:51 UTC] No.45065660{8}[source]▶

>>45063889 #

Agree with you there. And that sorta is the kind of reliance I’m talking about. My friends will ask GPT to read restaurant menus for them lol

442. dmbche ◴[29 Aug 25 15:53 UTC] No.45065700{3}[source]▶

>>45065601 #

Love to see people being directly and fully against the concept of "confidentiality"

replies(1): >>45066332 #

443. tln ◴[29 Aug 25 15:53 UTC] No.45065703{3}[source]▶

>>45063116 #

> By definition an “opt out” setting is selected by default.

No, (IMO) an "opt out" setting / status is assumed/enabled without asking.

So, I think this is opt-in, until Sept 28.

Opt-in, whether pre-checked/pre-ticked or not, means the business asks you.

GDPR requires "affirmative, opt-in consent", perhaps we use that term to mean an opt-in, not pre-ticked.

replies(1): >>45066185 #

444. Workaccount2 ◴[29 Aug 25 15:54 UTC] No.45065715{4}[source]▶

>>45065264 #

It's my perception of how people view the company, not my perception of the company.

445. SoftTalker ◴[29 Aug 25 15:55 UTC] No.45065726{5}[source]▶

>>45065110 #

I don't really worry about them for products I don't use, though.

446. notrealyme123 ◴[29 Aug 25 15:56 UTC] No.45065747{3}[source]▶

>>45064248 #

Unpublished work Vs. Published.

447. soraminazuki ◴[29 Aug 25 15:57 UTC] No.45065758{3}[source]▶

>>45064988 #

It makes sense when you see it as indoctrination than a mere opinion. Quashing critical thinking is the point. How else can you convince people to work against their own interests?

replies(1): >>45065963 #

448. ◴[29 Aug 25 15:58 UTC] No.45065772{4}[source]▶

>>45064920 #

449. nojs ◴[29 Aug 25 16:02 UTC] No.45065823{4}[source]▶

>>45064920 #

Would you be comfortable posting all of this information here, right now? Your name, address, email address, search history, ChatGPT history, emails, …

If not, why?

450. cowpig ◴[29 Aug 25 16:06 UTC] No.45065881[source]▶

>>45064530 #

> That said I do recognize the risk of only a handful of companies being responsible for something as important as the collective knowledge of civilization.

It's not just the risk of irresponsible behaviour (which is extremely important in a situation with so much power imbalance)

It's also just the basic properties of monopolistic markets: the smaller the number of producers, the closer the equilibrium price of the good maximizes the producers' economic surplus.

These companies operate for-profit in a market, and so they will naturally trend toward capturing as much value as they can, at the expense of everyone else.

If every business in the world depends on AI, this effectively becomes a tax on all business activity.

This is obviously not in the collective interest.

Of course, this analysis makes simplifying assumptions about the oligopoly. The reality is much worse: the whole system creates an inherent information asymmetry. Try and imagine what the "optimal" pricing strategy is for a product where the producer knows intimate details about every consumer.

451. ljosifov ◴[29 Aug 25 16:12 UTC] No.45065963{4}[source]▶

>>45065758 #

I put it to you - consider that you maybe wrong. That I indeed know what's best for me. The same way my default is that you know what's best for you. "Critical thinking" and "indoctrination" - you are on path to the dark side there. I grew up in a socialist/communist country. One of the ways in which vast majority of the population were oppressed, mis-treated etc or worse, was by them being denied agency and capability for critical thinking, for recognising their own interests, by a mechanism called "false consciousness". The ideas you expressed in your comment are of similar kind.

replies(1): >>45066163 #

452. echelon ◴[29 Aug 25 16:15 UTC] No.45065993{7}[source]▶

>>45065406 #

> The process of enshitification has turned simple tasks complicated and everyone wants a different, privacy destroying, frustrating to use "app", each of which has a slightly different UI paradigm, a mandatory subscription I've forgotten to cancel for two years straight, and a confusing name to remember. I now have something like 90 apps installed on my iphone, and I can only remember what something like 40 of them do.

This is because apps were never allowed to be installed like desktop software or as easy to access as websites. Developers had to cram in as much as possible and take as many permissions as possible because of how difficult Apple and Google made it.

If you could just search the web for an app, click a link, and have it instantly start working natively (sandboxed, with permissions), the world would be an amazing place.

replies(1): >>45068265 #

453. const_cast ◴[29 Aug 25 16:25 UTC] No.45066128{8}[source]▶

>>45064591 #

> Yes, because race is a protected class.

No: because Uber doesn't have to tell you how their model works and they probably don't even know.

replies(1): >>45066712 #

454. rightbyte ◴[29 Aug 25 16:26 UTC] No.45066143[source]▶

>>45063130 #

Tell that to my Samsung TV.

455. silverliver ◴[29 Aug 25 16:27 UTC] No.45066147[source]▶

>>45062683 (OP) #

lmarena did the same thing and saw its usage take a nosedive. I wonder if the same thing will happen here.

456. soraminazuki ◴[29 Aug 25 16:28 UTC] No.45066163{5}[source]▶

>>45065963 #

Says the person advocating for companies to get rid of consent, the bare minimum they can do when screwing over people for profit. That's not deciding what's best for you. That's you unilaterally deciding that no one deserves consumer protection. You are trying to force on everyone what 96% of people are opposed to [1]. So don't you dare pull off that DARVO nonsense and accuse me of being an oppressive dictator.

Also in what universe are utter fantasies like "'no one wants to share their data' is just assumed" or "the defaults are always 'deny everything'" true? Tech companies are bypassing user consent all [2] the [3] time [4].

[1]: https://arstechnica.com/gadgets/2021/05/96-of-us-users-opt-o...

[2]: https://hn.algolia.com/?q=opt%20out

[3]: https://hn.algolia.com/?q=opt%20in

[4]: https://hn.algolia.com/?q=consent

replies(1): >>45067835 #

457. whilenot-dev ◴[29 Aug 25 16:30 UTC] No.45066185{4}[source]▶

>>45065703 #

Regardless whether it's opt-in or opt-out, the business will need to confirm anything it opted for you by asking. If you don't select the opposing choice in a timely fashion, then the business assumes that it opted correctly in your interest and on your behalf.

> So, I think this is opt-in, until Sept 28.

If the business opted for consent, then you will effectively have the choice for refusal, a.k.a. opt-out.

458. whamlastxmas ◴[29 Aug 25 16:32 UTC] No.45066204{3}[source]▶

>>45064759 #

Infringement, not theft :)

replies(1): >>45069563 #

459. ljosifov ◴[29 Aug 25 16:40 UTC] No.45066321{4}[source]▶

>>45064920 #

In the past I have found obstacles to data sharing codified in the UK law frustrating. I'm reasonably sure some people will have died because of this, that would not have died otherwise. If they could communicate with the NHS, similarly (email, whatsapp) to how they communicate in their private and professional lives.

Within the UK NHS and UK private hospital care, these are my personal experiences.

1) Can't email my GP to pass information back-and-forth. GP withholds their email contact, I can't email them e.g. pictures of scans, or lab work reports. In theory they should have those already on their side. In practice they rarely do. The exchange of information goes sms->web link->web form->submit - for one single turn. There will be multiple turns. Most people just give up.

2) MRI scan private hospital made me jump 10 hops before sending me link, so I can download my MRI scans videos and pictures. Most people would have given up. There were several forks in the process where in retrospect could have delayed data DL even more.

3) Blood tests scheduling can't tell me back that scheduled blood test for a date failed. Apparently it's between too much to impossible for them to have my email address on record, and email me back that the test was scheduled, or the scheduling failed. And that I should re-run the process.

4) I would like to volunteer my data to benefit R&D in the NHS. I'm a user of medicinal services. I'm cognisant that all those are helping, but the process of establishing them relied on people unknown to me sharing very sensitive personal information. If it wasn't for those unknown to me people, I would be way worse off. I'd like to do the same, and be able to tell UK NHS "here are, my lab works reports, 100 GB of my DNA paid for by myself, my medical histories - take them all in, use them as you please."

In all cases vague mutterings of "data protection... GDPR..." have been relayed back as "reasons". I take it's mostly B/S. Yes there are obstacles, but the staff could work around if they wanted to. However there is a kernel of truth - it's easier for them to not try to share, it's less work and less risk, so the laws are used as a cover leaf. (in the worst case - an alibi for laziness.)

460. dns_snek ◴[29 Aug 25 16:41 UTC] No.45066332{4}[source]▶

>>45065700 #

Not in the slightest! The only thing I'm against is hypocrisy.

LLM enthusiasts are staunch defenders of the argument that use of everyone's ideas and labour in LLM training isn't just fair use, but a moral imperative in order to advance science, art, and human progress as a whole.

It's beyond hypocritical for beneficiaries of this paradigm to then turn around and expect special treatment by demanding that "their" ideas, "their" knowledge, "their" labour be excluded from this process.

replies(1): >>45066456 #

461. lacoolj ◴[29 Aug 25 16:43 UTC] No.45066363[source]▶

>>45064530 #

Proprietary data (your company's app repository, a script for upcoming movie) and sensitive data (health, finance) become exposed

462. dwa3592 ◴[29 Aug 25 16:48 UTC] No.45066434[source]▶

>>45062683 (OP) #

Do they remove personal information(names/dates/SSNs etc) before using data for training?

If not, you should mask your personal info before you sent it to Anthropic (or OpenAI, Google).

Use this maybe - https://github.com/deepanwadhwa/zink#shielding-llm-and-api-c...

463. dmbche ◴[29 Aug 25 16:50 UTC] No.45066456{5}[source]▶

>>45066332 #

Gotcha - right with you. Gotta get my sarcasm detector checked.

464. nocommandline ◴[29 Aug 25 16:54 UTC] No.45066512{3}[source]▶

>>45062975 #

If you aren't using it for coding or advanced uses like video, etc, you can try running models locally on your machine using Ollama and others like it.

Self plug here - If you aren't technical and still want to run models locally, you can try our App [1]

1] https://ai.nocommandline.com

465. monsieurbanana ◴[29 Aug 25 16:56 UTC] No.45066543{4}[source]▶

>>45064621 #

Save your prompts, anonymize them and offer them to anyone that wants to train a LLM, that is us collectively training LLMs.

Giving Claude your private data ensures that there will not be thousands of providers, since the limiting factor isn't power but data.

466. ljosifov ◴[29 Aug 25 16:56 UTC] No.45066547{5}[source]▶

>>45065000 #

I agree with your stance there. Further - the conventional opinion is that the power imbalance coming from the information imbalance (state/business know a lot about me; I know little about them) is that us citizens and consumers should reduce our "information surface" towards them. And address the imbalance that way. But.

There exists another, often unmentioned option. And that option is for state/business to open up, to increase their "information surface" towards us, their citizens/consumers. That will also achieve information (and one hopes power) rebalance. Every time it's actually measured, how much value we put on our privacy, when we have to weight privacy against convenience and other gains from more data sharing, the revealed preference is close to zero. The revealed preference is that we put the value of our privacy close to zero, despite us forever saying otherwise. (that we value privacy very very much; seems - "it ain't so")

So the option of state/business revealing more data to us citizens/consumers, is actually more realistic. Yes there is extra work on part of state/business to open their data to us. But it's worth it. The more advanced the society, the more coordination it needs to achieve the right cooperation-competition balance in the interactions between ever greater numbers of people.

There is an old book "Data For the People" by an early AI pioneer and Amazon CTO Andreas Weigend. Afaics it well describes the world we live in, and also are likely to live even more in the future.

467. 12ian34 ◴[29 Aug 25 16:58 UTC] No.45066579{6}[source]▶

>>45065529 #

Thanks for this considered response. I find it difficult to disagree with anything you said in this particular comment :) however I do think each instance you mention in this message is quite different to the topic at hand, regarding the big tech data machine. Additionally, I think I would rather our UK level of privacy regarding healthcare data than the commercialised free for all in the US. One counterpoint could be that Palantir got a significant amount of UK NHS data.

replies(1): >>45067008 #

468. darepublic ◴[29 Aug 25 17:00 UTC] No.45066593[source]▶

>>45062852 #

it's almost like this multi billion dollar company is misanthropic, despite their platitudes. Should I not hold my breath on Anthropic helping facilitate "an era of AI abundance for all"? (To quote a rejected PR applicant to Anthropic from the front page)

469. SoftTalker ◴[29 Aug 25 17:09 UTC] No.45066712{9}[source]▶

>>45066128 #

Doesn't matter. If you can convincingly argue that the effect is discrimination based on race, you have a civil rights claim.

470. gpm ◴[29 Aug 25 17:09 UTC] No.45066715{6}[source]▶

>>45064505 #

This link is not even remotely close to an example of the behavior you described.

471. insane_dreamer ◴[29 Aug 25 17:09 UTC] No.45066718{9}[source]▶

>>45064302 #

> So if someone happens to use local models at home, in a home that is powered by solar power, then you'd feel LLM starting to be a net positive for humanity?

Sure, that would make a difference, but it's not gonna happen anytime soon, other than hacker hobbyists, because no one is making money off of that.

> This is such a big thing in general (that I agree with) but it has nothing to do with LLMs as a technology.

Correct -- I don't have any issue with the technology itself, but rather how the technology is implemented and used, and the resources put towards its use. And BigTech are putting hundreds of $B into this -- for what end exactly besides potentially making tons of money off of consumer subscribers or ads a-la-Meta or Google? If BigTech was putting the same amount of money into technology that could actually benefit humanity (you know, like actually saving the world from potential future destruction by climate change), I'd have a much kinder view of them.

472. card_zero ◴[29 Aug 25 17:15 UTC] No.45066797{7}[source]▶

>>45064926 #

And here it is, evolving before your eyes: we're killing off the maladaptive mutant which was "opt-in by default". That's the evolution that is happening here.

replies(1): >>45070611 #

473. soraminazuki ◴[29 Aug 25 17:16 UTC] No.45066803{5}[source]▶

>>45064763 #

Diluting the distinction between opt-in and opt-out is gaslighting, not "evolution."

replies(1): >>45070669 #

474. kmacdough ◴[29 Aug 25 17:26 UTC] No.45066937{3}[source]▶

>>45064034 #

If you have an idea and are putting it together, you might use Claude for a few things:

- Search the web for related ideas. This could help if someone's already had the idea or if there are things to learn from related ideas. - Review your writeup or proofs for mistakes and clarity

None of these things make the idea Claude's. Claude merely helped with some of the legwork.

But Claude now has your idea in clear, plain text to train on. The next time someone hits on even a similar idea, Claude might well suggest your idea outright. Not seeing your idea published, the user has no way to know it isn't a novel idea. If the person is less diligent/thorough, they may well publish first and claim it as there own, without any nefarious intent.

475. ljosifov ◴[29 Aug 25 17:30 UTC] No.45067008{7}[source]▶

>>45066579 #

Thanks for the consideration. Yeah US and UK are different in that respect. I got the impression that US ends with the worst deal on both ends: organisations that could help you are denied your data, while organisation most unscrupulous most bent on doing their worst with your data, get almost free access to it.

For UK - I'm reasonably sure some people will have died because of the difficulties sharing their data, that would not have died otherwise. "Otherwise" being - they could communicate with the NHS, share their data, similarly via email, WhatsApp etc, to how they communicate and share data in their private and professional lives.

People at personal level have a fairly reasonable stance, in how they behave, when it comes to sharing their data. They are surprisingly subtle in their cost-benefit analysis. It's only when they answer surveys, or talk in public, that they are less-than-entirely-truthful. We know this, b/c their revealed preferences are at odds with what they say they value, and how much they value.

476. cube00 ◴[29 Aug 25 17:43 UTC] No.45067192{4}[source]▶

>>45064214 #

"AI models collapse when trained on recursively generated data"

https://news.ycombinator.com/item?id=41058194

replies(1): >>45070408 #

477. sneak ◴[29 Aug 25 17:47 UTC] No.45067227{5}[source]▶

>>45065253 #

> Property itself is a legal fiction.

Maybe real property (which only exists because of a property record held in a government building), but it is self-evident to me (and, I believe, most people) that personal property is a natural right.

One only need look up some TikTok videos of Americans getting pickpocketed in Europe to see how large groups of people feel on the matter.

replies(1): >>45067882 #

478. mystraline ◴[29 Aug 25 17:51 UTC] No.45067270{4}[source]▶

>>45063111 #

As a real world counterexample, medical in the USA does this shit all the time.

Local office will do a blood draw, send it to a 3rd party analysis which isn't covered by insurance, then bill you full. And you had NO contractual relationship with the testing company.

Same scam. And its all because our government is completely captured by companies and oligopoly. Our government hasn't represented the people in a long time.

479. IAmGraydon ◴[29 Aug 25 18:00 UTC] No.45067375{6}[source]▶

>>45065542 #

They’re likely A/B testing the interface change, which is why people are getting inconsistent results

480. psychoslave ◴[29 Aug 25 18:28 UTC] No.45067724{4}[source]▶

>>45064221 #

20 years ago (gosh!) at a French University, in the frame of an English course we made a odious flashy pink VB application called "script generator", where you just had to select the kind of movie (action or more action), how many people would die on screen per minute, and that kind of ridiculously sarcastic choice, and you would get your Hollywood script in a second. That was all fake of course, then. Pure valley spirit I guess. But sure this is the kind of thing you not only can see in advance, but even moke in anticipation.

481. Disposal8433 ◴[29 Aug 25 18:34 UTC] No.45067777{5}[source]▶

>>45065110 #

> pointless future things that aren't happening yet

I remember all those things that wouldn't happen since the 90s and which definitely ended up happening starting with all that crap from Microsoft. It's not cyberpunk anymore, it's real life.

482. ljosifov ◴[29 Aug 25 18:39 UTC] No.45067835{6}[source]▶

>>45066163 #

I see reading comprehension is not something you enjoy to indulge with.

These -

> utter fantasies like "'no one wants to share their data' is just assumed" or "the defaults are always 'deny everything'" true?

...far from being fantasies, are my personal experiences in the UK medical systems. This -

https://news.ycombinator.com/item?id=45066321

replies(1): >>45068948 #

483. mitthrowaway2 ◴[29 Aug 25 18:45 UTC] No.45067882{6}[source]▶

>>45067227 #

But you won't feel the same way about a pickpocket who borrows the source code to the software you derive your livelihood from, your sales team's customer list, your would-be-bestselling novel manuscript, your company's secret formula for a rust-proof coating, or that scientific paper that you and your grad students have spent all summer getting ready to submit for publication?

Thank you for your generosity!

484. int_19h ◴[29 Aug 25 19:10 UTC] No.45068149[source]▶

>>45064530 #

It is a great thing if it were reciprocated. But when I'm paying $20/mo to access Claude, why should I give training data to Anthropic for free?

485. int_19h ◴[29 Aug 25 19:17 UTC] No.45068227{4}[source]▶

>>45065057 #

LLMs can and do sometimes regurgitate parts of training data verbatim - this has been demonstrated many times on things ranging from Wikipedia articles to code snippets. Yes, it is not particularly likely for that damning private email of yours to be memorized, but if you throw a dataset with millions of private emails onto a model, it will almost certainly memorize some of them, and nobody knows what exact sequence of input tokens might trigger it to recite.

replies(1): >>45072902 #

486. mapontosevenths ◴[29 Aug 25 19:20 UTC] No.45068265{8}[source]▶

>>45065993 #

> If you could just search the web for an app, click a link, and have it instantly start working natively (sandboxed, with permissions), the world would be an amazing place.

I disagree. Almost all of it should just be relatively standard API's designed for the AI to use, and we should all just use the AI as the standard interface. Many companies would collapse, because their entire anti-consumer business models would topple over, but that would be a good thing.

487. ljosifov ◴[29 Aug 25 19:25 UTC] No.45068317{3}[source]▶

>>45065333 #

Whether Google is interested in serving me or not, is not only untestable (i.e. what counts as 'Google', 'interested', and 'serving' there - one could argue to end of time) - but besides the point. I want to be able to tell Google "My home is XYZ", and for Google to use that information about me in all of Google ecosystem. When I talk to Gemini it should know what/where "LJ home" is, when I write in Gdoc it should know my home address (so to insert it if I want it), ditto for Gmail, when I search in Google photos "photos taken at home" it should also know what "home" is for me.

Atm Google vaguely knows, and uses that for Ads targeting, sometimes. Most of the time - the targeting is bad, very low quality slop. To the level of "he bought a mattress yesterday, will keep buying mattresses in the next 30-60 days". I have the impression that we ended up in the worst case scenario. People I don't want to have my data, have access to it. People I do want to have my data, are afraid to touch it, and use it - yes! - for theirs, but also for my benefit too. The current predicament seems to me the case of "public lies, private truths."

A small cadre of vocal proponents of a particular view, established "the ground truth to what is desirable". (in this case - maximum privacy, ideally zero information sharing) The public goes with it in words, pays lip service, while in deeds, the revealed preferences show, they value their data privacy very cheaply, almost zero. Even one click extra, to share their data less, is one click too many, effort too high, for most people. Again - these are revealed preferences, for people keep lying when asked. It's not even the case of "you are lying to me" - no, it's more like "you are lying to yourself."

The conventional opinion is that the power imbalance coming from the information imbalance (state/business know a lot about me; I know little about them) is that us citizens and consumers should reduce our "information surface" towards them. And address the imbalance that way. But. There exists another, often unmentioned option. And that option is for state/business to open up, to increase their "information surface" towards us, their citizens/consumers. That will also achieve information (and one hopes power) rebalance. Yes there is extra work on part of state/business to open their data to us. But it's worth it. The more advanced the society, the more coordination it needs to achieve the right cooperation-competition balance in the interactions between ever greater numbers of people. There is an old book "Data For the People" by an early AI pioneer and Amazon CTO Andreas Weigend. Afaics it well describes the world we live in, and also are likely to live even more in the future.

replies(1): >>45070922 #

488. AvAn12 ◴[29 Aug 25 20:18 UTC] No.45068920{4}[source]▶

>>45064192 #

So Anthropic should have no property rights to its own source code?

489. soraminazuki ◴[29 Aug 25 20:21 UTC] No.45068948{7}[source]▶

>>45067835 #

See, this is what I meant by indoctrination. I showed you links containing dozens, maybe even hundreds of examples showing how companies don't obtain consent from users. But you ignore all that and cherry pick your highly exaggerated spin on the UK medical system. "I'm reasonably sure some people will have died because of this." Sigh, give me a break. Your take on privacy sounds just like the auto industry's take on right to repair. According to them, right to repair laws will get women raped in parking lots [1]. Corporate activists making absurd claims resorting to the same old fearmongering tactics.

This isn't me having problems with reading comprehension. It's you arguing in bad faith. Which is inevitable given your desire to demolish consumer protection for everyone. You're defending the indefensible.

[1]: https://www.vice.com/en/article/auto-industry-tv-ads-claim-r...

replies(1): >>45069127 #

490. sigmoid10 ◴[29 Aug 25 20:28 UTC] No.45069013{4}[source]▶

>>45063592 #

All of this boils down to selling to advertisers. There is no real difference between doing it yourself or having someone else in the chain. Doing it yourself may be more profitable - if you can scale. But that seems to be off the table here.

491. sigmoid10 ◴[29 Aug 25 20:30 UTC] No.45069045{4}[source]▶

>>45064540 #

30% of ~4% is very little when you think about these valuations.

492. sigmoid10 ◴[29 Aug 25 20:32 UTC] No.45069061{4}[source]▶

>>45063546 #

Only if they are in the ad selling business.

493. DrillShopper ◴[29 Aug 25 20:34 UTC] No.45069080{4}[source]▶

>>45065305 #

Compliance with the law and therefore the ability to operate at all

494. ljosifov ◴[29 Aug 25 20:38 UTC] No.45069127{8}[source]▶

>>45068948 #

I know indoctrination well. Reading what you write - I get the impression that you don't know much about indoctrination. But I don't know you, so I allow it that I maybe wrong. You asked "in what universe". I showed you concrete examples in one universe. For my claim to be true, one example suffices. None of your claims (latest "demolish customer protection") about my alleged intentions, character, thoughts, etc - have any basis in reality. You are wrong in almost everything that you wrote about me. It's all in your head, in your imagination. How do I know? B/c I know me, and you don't know me. That easy.

replies(1): >>45069257 #

495. soraminazuki ◴[29 Aug 25 20:51 UTC] No.45069257{9}[source]▶

>>45069127 #

> Excellent. What were they waiting for up to now?? I thought they already trained on my data. I assume they train, even hope that they train, even when they say they don't.

These are your exact words, not my imagination. You very clearly want consumer protection to be gone, because you said so.

> For my claim to be true, one example suffices.

To be clear, your claim is that we live in a world where there's too much privacy protection. So much in fact that you're, gasp, "reasonably sure some people will have died because of this." Nope, a single spin on the UK medical system is nowhere near as sufficient for that absurd claim.

As for your attempted word lawyering about indoctrination? Classic.

replies(1): >>45069585 #

496. mitthrowaway2 ◴[29 Aug 25 21:12 UTC] No.45069438[source]▶

>>45064530 #

I'm okay with LLMs not getting better.

497. nnutter ◴[29 Aug 25 21:24 UTC] No.45069555[source]▶

>>45062683 (OP) #

This is disappointing. I thought Anthropic was a more privacy and safety focused company and so I had chosen to use Claude Code over their competition. Now they are less privacy focused than Gemini Code, which only does this for the free tier. My work was already providing GitHub Copilot but I was rooting for and supporting Anthropic. Now I'll either just use GitHub Copilot or Gemini Code both of which are still private for paid tiers.

498. freejazz ◴[29 Aug 25 21:24 UTC] No.45069563{4}[source]▶

>>45066204 #

Reread my post.

499. ljosifov ◴[29 Aug 25 21:26 UTC] No.45069585{10}[source]▶

>>45069257 #

Yes - my data, not your data. You stay away from my data. I stay away from your data. I don't care about your data. But I do want them to train on my data. And to serve me better. Was disappointed that they didn't do that already.

But now you gave me ideas. ;-) Yeah - I think ideally we should go further, much further. Internet was not built by po-faced, lemon-sucking prudes, tut-tut-ing about everything and anything. It was built by happy-go-lucky, live-and-let live, altruistic mildly autistic nerds. It was permission-less, one didn't need to ask anyone in order to do anything, and that's why it lived. Whereas many other networks and protocols, technically more sophisticated, but with a fatal flaw that a gatekeeper with the power to say "NO" was built into them - just died off. Wish people went back to the original permission-less Net. That people tore down all manner of laws making moving bits around illegal, used to jail humans for crimes of reading, copying and writing data.

replies(1): >>45075871 #

500. nh43215rgb ◴[29 Aug 25 22:31 UTC] No.45070143[source]▶

>>45062683 (OP) #

All the bad news seem to drop on Friday. Is it just a coincidence?

replies(1): >>45070167 #

501. estimator7292 ◴[29 Aug 25 22:33 UTC] No.45070167[source]▶

>>45070143 #

No, it's a very deliberate strategy. Companies do this every Friday

502. jmward01 ◴[29 Aug 25 22:37 UTC] No.45070194[source]▶

>>45062852 #

The 5 year is the real kicker. Over the next 5 years I find it doubtful that they won't keep modifying their TOS and presenting that opt out 'option' so that all it will take is one accidental click and they have all your data from the start. Also, what is to stop them from removing the opt out? Nothing says they have to give that option. 4 years and 364 days from now TOS change with no opt out and a retention increase to 10 years. By then the privacy decline will have already have been so huge nobody will even notice that this 'option' was never even real.

503. theshackleford ◴[29 Aug 25 23:06 UTC] No.45070408{5}[source]▶

>>45067192 #

That's not what I asked for as it's not relevant.

The claim was made that the models are "suffering", at this exact moment, because they have been recursively feeding themselves, RIGHT now.

I want evidence the current models are "suffering" right now, and I want further evidence that suggests this suffering is due to recursive data ingestion.

Some year old article with no relevance to today talking about hypotheticals of indiscriminate gorging of recursive data is not evidence of either of the things I asked for.

504. troad ◴[29 Aug 25 23:41 UTC] No.45070611{8}[source]▶

>>45066797 #

That would not be evolution, that would be an attempt at creationism. There is no evolution police, and never will be.

replies(1): >>45070824 #

505. troad ◴[29 Aug 25 23:49 UTC] No.45070669{6}[source]▶

>>45066803 #

That seems like an ungenerous and frankly somewhat hysterical take.

By default, you are opted in. Perfectly clear.

The purpose of language is communication, not validating your politics.

replies(1): >>45071177 #

506. danparsonson ◴[30 Aug 25 00:17 UTC] No.45070824{9}[source]▶

>>45070611 #

Selection pressure is the evolution police.

replies(1): >>45071301 #

507. roVinchi ◴[30 Aug 25 00:28 UTC] No.45070879[source]▶

>>45062683 (OP) #

Can users poison the future training dataset by ending every chat with a dissatisfied feedback even though the chat helped them? Or be even more malicious and steer the chat into destructive behavior and then give very positive feedback?

508. danparsonson ◴[30 Aug 25 00:37 UTC] No.45070922{4}[source]▶

>>45068317 #

You started by saying that it's difficult or impossible to define what 'serving the user' looks like, then immediately gave examples of what it would look like to you. It's not that Google can't do these things or is afraid to, but rather that operating in your best interests does not benefit their shareholders. Sure, it'd be great if we could all just get along, but we're living in the worst case scenario you describe because we can't all just get along. Not trusting companies like Google with your personal data is the pragmatic choice; regardless of what they could do with our data, what they actually do with it is what counts.

Side note: they know exactly where you live. My colleague's Android used to tell him, without any prompting or specific configuration, how long his drive home from work would take that day. That was over ten years ago.

replies(1): >>45073056 #

509. const_cast ◴[30 Aug 25 01:35 UTC] No.45071173{3}[source]▶

>>45065601 #

> Why should LLMs be any different?

Because they're a computer program and not a human and humans are special.

Why are humans special? Because we're humans and we make the rules.

Its as inane as saying "why can I eat a burger but I can't chop up my friend and eat him? Why is that any different?"

510. soraminazuki ◴[30 Aug 25 01:35 UTC] No.45071177{7}[source]▶

>>45070669 #

> By default, you are opted in. Perfectly clear.

That's called opt-out. You're doing exactly what I described: gaslighting people into believing that opt-in and opt-out are synonymous, rendering the entire concept meaningless. The audacity of you labeling people as "political" while resorting to such Orwellian manipulation is astounding. How can you lecture others about the purpose of languages with a straight face when you're redefining terms to make it impossible for people to express a concept?

These are examples of what "opt-in by default" actually means. It means having the user manually consent to something every time, the polar opposite your definition.

- https://arstechnica.com/gadgets/2024/06/report-new-apple-int...

- https://github.com/rom1504/img2dataset/issues/293

It's also just pure laziness to label me as "hysterical" when PR departments of companies like Google have, like you, misused the terms opt-out and opt-in in deceptive ways.

https://news.ycombinator.com/item?id=37314981

replies(1): >>45072611 #

511. const_cast ◴[30 Aug 25 01:43 UTC] No.45071208{4}[source]▶

>>45064753 #

> Many users chose Anthropic exactly because they were not like the others.

Companies are less like people and more like bacteria. They are programmatic, like algorithms.

What they will do has already been decided for them, programmed into them, by the rules of capitalism. It is inevitable. There are no good guys, and there are no bad guys, there's just... microbes.

Those who do not engage in capitalism, perhaps they do not seek money at all, have no such hard limitations. But they are rare, because money is blood.

512. card_zero ◴[30 Aug 25 02:03 UTC] No.45071301{10}[source]▶

>>45070824 #

It would be fair to compare it to selective breeding, rather than natural selection. The flip side of rejecting usage is promoting neologisms. We can do both things deliberately, I see no rule saying that language is only allowed to evolve naturally. A reasonable criticism would be that trying to change it on purpose makes for a lot of unnecessary fuss, but we can be moderate about it.

replies(1): >>45077094 #

513. homeless_engi ◴[30 Aug 25 03:04 UTC] No.45071580{4}[source]▶

>>45065033 #

Sure! Here you go:

https://support.google.com/gemini/answer/13594961?&p=privacy...

514. gloosx ◴[30 Aug 25 07:10 UTC] No.45072552[source]▶

>>45064530 #

>LLM's probably won't get significantly better without this data.

Who told you LLMs will get significantly better with this data? Sam Altman?

515. Nevermark ◴[30 Aug 25 07:22 UTC] No.45072611{8}[source]▶

>>45071177 #

I completely agree with you from a correctness standpoint, ...

> Diluting the distinction between opt-in and opt-out is gaslighting

> That seems like an ungenerous and frankly somewhat hysterical take.

... however, this comment was a reasonable response.

Projective framing demonstrates your own lack of concern for accuracy, clarity or conviviality, that is 180 degrees at odds with the point you are making and the site you are making it on.

replies(1): >>45073525 #

516. gloosx ◴[30 Aug 25 07:28 UTC] No.45072645[source]▶

>>45063737 #

There is nothing surprising at all? It's not rocket science to understand that Anthropic is a very greedy company – just by analysing their sales funnel/UX you can tell that money is all they ever care about. Their marketing teams will go for any tricks to gain direct access to your pockets.

In the LLM space no other company is different – they are highly unprofitable – so once funding starts to run dry they will have the pressure to invent new ways for going even deeper into your pockets.

517. gloosx ◴[30 Aug 25 07:35 UTC] No.45072683[source]▶

>>45063285 #

>Anthropic has had the most reasonable business model

Is this a joke? Anthropic burned $5.6 billion in 2024 and is expected to lose about $3 billion this year and it's reasonable business model?

replies(1): >>45080928 #

518. ljosifov ◴[30 Aug 25 07:45 UTC] No.45072724{3}[source]▶

>>45065128 #

I already share lots of my data with Google. I have Gmail where a lot of my online life is reflected. I have Photos, Gmaps, Gdrive. Also Google knows about my YouTube viewing, my Android phone use. So no matter what I say - with my actions, my revealed preference is - that I trust Google. So far - Google have not betrayed my trust, afaics. So I actually want for Google to adapt Gemini to me, either via the context, or even with a thin layer of LoRA. If Google treats me like a complete stranger it knows nothing about, then Google, and plenty of other people, make use of my data, but I, the creator (and nominal owner) of my data - don't benefit from their knowledge of me?? That sounds the worst of the possible options to me.

519. ljosifov ◴[30 Aug 25 08:02 UTC] No.45072819{3}[source]▶

>>45065554 #

I'm not 'screwing' anyone. I'm saying - the same way people don't want to have their data used, I DO want my data used. I'm not saying they use YOUR data. I'm saying they use MY data.

Likewise, I'm not telling you what you publish. In the same manner, I dislike it you telling me that I publish. So on

> name, home address, work, government-issued ID, financial transactions, chats, browser history, location history, surveillance footage of your home, all for free.

It's up to me, not you, what I decided to publish or not. Fwiw, I already publish

> name, home address, work,

willingly. My name is public (how can it be otherwise?) and home address is in the electoral register that is public. My work info is in the UK companies register, available for reading to all, on the web

I publish to selected parties

> government-issued ID

even if I don't want it. (we don't have specific 'government-issued ID' for ID purposes like in the Continent; my driving licence is used for that) I did it yesterday, because UK gov requires companies to collect that information. Yesterday I had to give two photos of myself to an online pharmacy shop because UK gov mandates that they collect that info - and I disliked that very much. The online pharmacy is not the one pushing for that data, its the UK gov forcing that one them via regulation of how that particular medication is to be sold online.

I don't want to publish and don't publish

> financial transactions, chats, browser history, location history, surveillance footage of your home

...and don't understand where this gale to tell perfect strangers what they should do with their lives comes from?? I don't tell you what you should or should not publish? Ditto for the pricing

> all for free.

Up to me to decide. I don't tell you what you do - so you don't tell me what I do, pretty please.

I am not waiting on "privacy maximalists." I try to share my data for some purpose I need. I loathe 'privacy maximalist' in the UK for having influenced the current laws of the land in a way to cater for their obsessions and ignore my desires. I think I'm in majority, not minority. Our current predicament seems to me the case of "public lies, private truths." A small cadre of vocal proponents of a particular view, established "the ground truth to what is desirable". (in this case - maximum privacy, ideally zero information sharing) The public goes with it in words, pays lip service, while in deeds, the revealed preferences show that we value our data privacy very little - almost zero. Even one click extra to share our data less, is one click too many, an effort too high for most people. Again - these are revealed preferences, for people keep lying when asked. It's not even the case of "you are lying to me" - no, it's more like "you are lying to yourself."

520. ljosifov ◴[30 Aug 25 08:12 UTC] No.45072864{3}[source]▶

>>45064814 #

I'm worried, it's not like I don't care. For example, I'm worried that Google is such a huge ginormous target, that at some point their Gmail will be broken. At the same time, there are benefits to sharing data. There are benefits to me, in Google using the information it has on my, to make my life easier. In this case, I judge that Gemini using my data to train, is a low extra risk for me. Compared to all other risks I take, for doing things in public. Including writing this on public forums, as you do too.

In general, I find the ongoing public scare about sharing data, to be anti-thesis to the original spirit of the Net, that was all about sharing data. Originally, we were delighted to connect to perfect strangers on the other side of the world. That we would never have gotten to communicate with otherwise. I accept there might have been an element of self-selection there, that aided that view: people one'd communicate with, although maybe from a different culture, would be from similar niche sub-culture of people messing with computers and looking forward to communication, having a favourable view of that.

replies(1): >>45073180 #

521. ljosifov ◴[30 Aug 25 08:20 UTC] No.45072902{5}[source]▶

>>45068227 #

That's a consideration, for sure. But given the LLM-s have not got the ground truth, everything is controlled hallucination, then - if the LLM tells you an imperfect version of my email or chat, you can never be sure if what the LLM told you is true, or not. So maybe you don't gain that much extra knowledge about me. For example, you can reasonably guess I'm typing this on the computer, and having coffee too. So if you ask the LLM "tell me a trivial story", and LLM comes back with "one morning, LJ was typing HN replies on the computer while having his morning coffee" - did you learn that much new about me, that you didn't know or could guess before?

522. ljosifov ◴[30 Aug 25 08:24 UTC] No.45072923{3}[source]▶

>>45064872 #

My experience in the UK medical systems has been the opposite - wrote here

https://news.ycombinator.com/item?id=45066321

Google knows what "Home" is for me only in Gmaps, because I went out of my way (put a Label etc) to tell it. I want to be able to tell Google "My home is XYZ", and for Google to use that information about me in all of Google ecosystem. When I talk to Gemini it should know what/where "LJ home" is, when I write in Gdoc it should know my home address (so to insert it if I want it), ditto for Gmail, when I search in Google photos "photos taken at home" it should also know what "home" is for me.

I have the impression that we ended up in the worst case scenario. People I don't want to have my data, have access to it. People I do want to have my data, are afraid to touch it, and use it - yes! - for theirs, but also for my benefit too.

523. ljosifov ◴[30 Aug 25 08:32 UTC] No.45072978{3}[source]▶

>>45065065 #

It's not a satire, you can check mu comments on this topic easily.

I dispute 'most people'. Revealed preferences of most people are that they value their data privacy very cheaply, almost zero. Even one click extra to share their data less, is one click too many, an effort too high - for most people. This is their real, observed behaviour. I think our current predicament is the case of "public lies, private truths." A small cadre of vocal proponents of a particular view, established "the ground truth to what is desirable". (in this case - maximum privacy, ideally zero information sharing) The public goes with it in words, pays lip service - but in reality behaves different, even opposite to what they say they desire.

And even if 'most people' wanted what you say they do, I still think the companies could and should accommodate a minority group like myself that want otherwise to what 'most people' want. I don't think the will of the majority is the highest ideal, so high as to trump what I personally want.

524. ljosifov ◴[30 Aug 25 08:37 UTC] No.45072994{3}[source]▶

>>45065005 #

You know little about me, so it's better to assume less, no? My personal experience with medical data specifically is, that I would have been harmed by obstacles to data sharing that the UK medical system has in place, having not been familiar with computers and tech enough to anticipate the ways lack of data sharing will lead to outcomes undesirable to me. I wrote about that in a comment here https://news.ycombinator.com/item?id=45067219

525. ljosifov ◴[30 Aug 25 08:50 UTC] No.45073056{5}[source]▶

>>45070922 #

Yes - I meant 'impossible to difficult' to define to all people, at all times. Agree it's easy for me to define how that looks. It doesn't mean that the same is true to you. That's why I went from a very general, to very specific.

I'm saying we ended up in situation where people are lying when they say "I don't trust Google", b/c they have Gmail, use Google services - so their trust can't be zero. It's more than zero. Obviously it's a trade-off, people are pragmatic they do their cost-benefit analysis, and act accordingly. They just lie when they talk about the subject. I think it'd be better for all, if the public discussion moved from "I trust Google zero" (which is obviously untrue), to "There is cost-benefit to this, and I personally chose xyz".

526. 12ian34 ◴[30 Aug 25 09:14 UTC] No.45073180{4}[source]▶

>>45072864 #

> the ongoing public scare about sharing data

I think this might be a bit of a social bubble thing - I think it isn't a forefront concern for the vast majority of people.

replies(1): >>45073297 #

527. ljosifov ◴[30 Aug 25 09:39 UTC] No.45073297{5}[source]▶

>>45073180 #

I think you are correct there - the majority of the public don't care. They just try to get about doing their daily business and act the best they can under circumstances. So we just click "Accept" to any popup banner make it go away, accept "All cookies" 100 times every day, use Google mail/map/photos/drive and that all involves giving away data, even if in words we say we don't want to give data. So yes obviously the public by necessity act in a rational way, doing cost-benefit analysis. While a cadre of privacy obsessives have made my life worse by lobbying and having their bad ideas codified in the UL laws. Wrote about my experience in the UK medical systems here https://news.ycombinator.com/item?id=45066321

528. bluecalm ◴[30 Aug 25 10:00 UTC] No.45073377{5}[source]▶

>>45064686 #

I think it's happening already. Chat GPT was able to connect my name to my project based on chess.com profile and one Hacker News post for example. It's not that hard to imagine that it learns a solution to a rare problem based on one input point. It may see one solution 1000 times an a rare solution 1 time and it can still be able to reference both.

529. handoflixue ◴[30 Aug 25 10:01 UTC] No.45073380{4}[source]▶

>>45063111 #

How is it "implicit" to click "I agree" to a large pop-up that takes up most of the screen?

replies(1): >>45074293 #

530. benterix ◴[30 Aug 25 10:33 UTC] No.45073525{9}[source]▶

>>45072611 #

I can somehow understand the parent. If you control the language, you control the discourse. This is like the famous "I'm appalled at the negativity here on HN" comment threads when doing product launches etc. Or using euphemisms to avoid calling spade a spade.[0] People are fed up with these tricks, hence these emotional reactions.

[0] https://news.ycombinator.com/item?id=26346688

531. benterix ◴[30 Aug 25 10:35 UTC] No.45073537{5}[source]▶

>>45063846 #

Well, yes and no - it gives them more plausible deniability ("oh, this particular piece just ended up in the training set by accident") if they get caught when compared to the previous ToS.

532. benterix ◴[30 Aug 25 10:39 UTC] No.45073554{5}[source]▶

>>45063974 #

> These seem like randomly chosen generic grievances, not examples of companies making promises in their privacy policy (or similar) and breaking them. Am I missing some connection?

My point is that whenever we send our data to a third party, we can assume it could be abused, either unintentionally (by a hack, mistake etc.) or intentionally, because these companies are corrupted to the core and have a very relaxed attitude to obeying the law in general as these random examples show.

533. benterix ◴[30 Aug 25 10:46 UTC] No.45073587{5}[source]▶

>>45063639 #

> The output is considered derivative and therefore it’s not illegal.

Well, this is what they claim. In practice, this is untrue on several levels. First, earlier OpenAI models were able to quote verbatim, and they were maimed later not to do that. Second, there were several lawsuits against OpenAI and not all of them ended. And finally, assuming that courts decide what they did was legal would mean everyone can legally download and use a copy of Libgen (part of "Books3") whereas the courts around the world are doing the opposite and are blocking access to Libgen country by country. So unless you set double standards, something is not right here. Even Meta employees torrenting Lingen knew that so let's not pretend we buy this rhetoric.

534. speckx ◴[30 Aug 25 12:55 UTC] No.45074231[source]▶

>>45062852 #

I cancelled my subscription as well because of the opt-in by default.

535. danaris ◴[30 Aug 25 13:04 UTC] No.45074293{5}[source]▶

>>45073380 #

Courts in various jurisdictions have found clickwrap agreements to be generally only valid for what one would expect to be common provisions within such agreements.

Essentially, because they are presented in a form that is so easy to bypass and so very common in our modern online life, provisions that give up too much to the service provider or would be too unusual or unexpected to find in such an agreement are unenforceable.

536. op00to ◴[30 Aug 25 14:47 UTC] No.45075107{4}[source]▶

>>45063143 #

537. the_other ◴[30 Aug 25 15:05 UTC] No.45075280{3}[source]▶

>>45064012 #

In the late 00s a few friends sent me “connect with me on Facebook” messages. I thought “this looks suspicious, I don’t want to, but it seems all my friends are communicsting here now…”. So I joined, and in 10years everything I worried could happen with FB did, and much much worse.

So, I’ll trust my gut more on this one.

538. the_other ◴[30 Aug 25 15:06 UTC] No.45075301{5}[source]▶

>>45065110 #

The opposite happens. You make the decision to reject or leave that service, and then you no longer have to think about that thing any more.

539. igor47 ◴[30 Aug 25 16:24 UTC] No.45075871{11}[source]▶

>>45069585 #

You should read "the cuckoos egg", written by a happy go lucky nerd in the 80s dawn of network systems. Already there were bad actors in the system and he fought an uphill battle to implement network security. You're already standing on the shoulders of giants like him who saved the net -- i don't believe it could survive without a robust permission structure.

540. ◴[30 Aug 25 18:55 UTC] No.45077094{11}[source]▶

>>45071301 #

541. CatWChainsaw ◴[30 Aug 25 20:12 UTC] No.45077638{7}[source]▶

>>45063089 #

Fascism is definitionally when government and companies team up to screw everyone else.

542. picafrost ◴[31 Aug 25 06:45 UTC] No.45080928{3}[source]▶

>>45072683 #

I didn't claim they are profitable. There are endless articles about how it is not and may never be. But compared to the burn of OpenAI, xAI, Meta, Google, etc, it is burning billions less and focusing on a niche with demonstrated use cases and customers.

↑