If you have a Claude account, they're going to train on your data moving forward

1. ratg13 ◴[29 Aug 25 12:44 UTC] No.45063367[source]▶

I can understand training AIs on books, and even internet forums, but I can't help but think that training an AI on lots of dumb questions with probably an excessive amount of grammar and spelling errors will somehow make it smarter.

replies(4): >>45063503 #>>45063715 #>>45064070 #>>45068370 #

2. dahsameer ◴[29 Aug 25 12:56 UTC] No.45063503[source]▶

>>45063367 (TP) #

> and even internet forums

i would consider internet forums also includes a lot of dumb questions

replies(2): >>45063666 #>>45066596 #

3. ratg13 ◴[29 Aug 25 13:12 UTC] No.45063666[source]▶

>>45063503 #

Agree, but people generally take a small pause before saying stuff online.

In 'private', people are less ashamed of their ignorance, and also know they can say gibberish and the AI will figure it out.

4. nrclark ◴[29 Aug 25 13:15 UTC] No.45063715[source]▶

>>45063367 (TP) #

Depends on how you’re using the data. There’s a pretty strong correctness signal in the user behavior.

Did they rephrase the question? Probably the first answer was wrong. Did the session end? Good chance the answer was acceptable. Did they ask follow-ups? What kind? Etc.

replies(2): >>45064137 #>>45064409 #

5. mrweasel ◴[29 Aug 25 13:46 UTC] No.45064070[source]▶

>>45063367 (TP) #

They train AI on Reddit and Stack Overflow questions, I can't see it getting any worse.

6. dudefeliciano ◴[29 Aug 25 13:51 UTC] No.45064137[source]▶

>>45063715 #

> Did the session end? Good chance the answer was acceptable.

Or that the user just ragequit

7. vb-8448 ◴[29 Aug 25 14:13 UTC] No.45064409[source]▶

>>45063715 #

I'm used to doing the same task 4 or 5 times (different sessions, similar prompts), and most of the time the result is useless or completely wrong. Sometimes I go back and pick the first result, other time none of them, other time a mix of them. I'm wondering how can they extract value from this.

8. timeon ◴[29 Aug 25 17:00 UTC] No.45066596[source]▶

>>45063503 #

Like what?

9. victorbjorklund ◴[29 Aug 25 19:31 UTC] No.45068370[source]▶

>>45063367 (TP) #

Doubt they feed everything in. They probably pick out a small subset of conversations for the training round.