(www.anthropic.com)

747 points porridgeraisin | 1 comments | 29 Aug 25 11:29 UTC | HN request time: 0s | source

Show context

I_am_tiberius ◴[29 Aug 25 12:01 UTC] No.45062905[source]▶

In my opinion, training models on user data without their real consent (real consent = e.g. the user must sign a contract or so, so he's definitely aware), should be considered a serious criminal offense.

replies(5): >>45062989 #>>45063008 #>>45063221 #>>45063771 #>>45064402 #

jsheard ◴[29 Aug 25 12:08 UTC] No.45062989[source]▶

>>45062905 #

Why single out user data specifically? Most of the data Anthropic and co train on was just scooped up from wherever with zero consent, not even the courtesy of a buried TOS clause, and their users were always implicitly fine with that. Forgive me for not having much sympathy when the users end up reaping what they've sown.

replies(3): >>45063012 #>>45063051 #>>45063335 #

__MatrixMan__ ◴[29 Aug 25 12:41 UTC] No.45063335[source]▶

>>45062989 #

Publishing something is considered by most to be sufficient consent for it to be not considered private.

I realize there's a whole legal quagmire here involved with intellectual "property" and what counts as "derivative work", but that's a whole separate (and dubiously useful) part of the law.

replies(1): >>45063793 #

chamomeal ◴[29 Aug 25 13:23 UTC] No.45063793[source]▶

>>45063335 #

That is definitely normally true but I feel like the scale and LLM usage turns it into a different problem.

If you can use all of the content of stack overflow to create a “derivative work” that replaces stack overflow, and causes it to lose tons of revenue, is it really a derivative work?

I’m pretty sure solution sites like chegg don’t include the actual questions for that reason. The solutions to the questions are derivative, but the questions aren’t.

replies(2): >>45063899 #>>45064495 #

1. airstrike ◴[29 Aug 25 13:32 UTC] No.45063899[source]▶

>>45063793 #

Replacing stack overflow has no bearing on the definition of "derivative"

↑

Updates to Consumer Terms and Privacy Policy