Fighting the New York Times' invasion of user privacy

(openai.com)

237 points meetpateltech | 4 comments | 12 Nov 25 14:08 UTC | HN request time: 0.199s | source

Show context

1vuio0pswjnm7 ◴[12 Nov 25 16:11 UTC] No.45901917[source]▶

>>45900370 (OP) #

"The New York Times is demanding that we turn over 20 million of your private ChatGPT conversations."

As might any plaintiff. NYT might be the first of many others and the lawsuits may not be limited to copyright claims

Why has OpenAI collected and stored 20 million conversations (including "deleted chats")

What is the purpose of OpenAI storing millions of private conversations

By contrast the purpose of NYT's request is both clear and limited

The documents requested are not being made public by the plaintiffs. The documents will presumably be redacted to protect any confidential information before being produced to the plaintiffs, the documents can only be used by the plaintiffs for the purpose of the litigation against OpenAI and, unlike OpenAI who has collected and stored these conversations for as long as OpenAI desires, the plaintiffs are prohibited from retaining copies of the documents after the litigation is concluded

The privacy issue here has been created by OpenAI for their own commercial benefit

It is not even clear what this benefit, if any, will be as OpenAI continues to search for a "business model"

Wanton data collection

replies(8): >>45902173 #>>45902539 #>>45903180 #>>45903873 #>>45904258 #>>45904285 #>>45905079 #>>45906333 #

silveraxe93 ◴[12 Nov 25 16:32 UTC] No.45902173[source]▶

>>45901917 #

No it's not. It's literally a court order mandating them to collect this data.

- [1] https://arstechnica.com/tech-policy/2025/08/openai-offers-20...

replies(2): >>45902784 #>>45903718 #

otterley ◴[12 Nov 25 18:19 UTC] No.45903718[source]▶

>>45902173 #

This article says nothing of the sort. The court order is to preserve existing logs they already have, not to disable logging, and hand all the logs over the plaintiffs. OpenAI's objections are mainly that 1/there are too many logs (so they're proposing a sample instead) and that 2/there's identifying data in the logs and so they are being "forced" to anonymize the logs at their expense (even though it's what they want as a condition of transferring the logs).

There is nothing in the article that mentions OpenAI being forced to create new logs they don't already have.

replies(1): >>45904727 #

1. ◴[12 Nov 25 19:18 UTC] No.45904817[source]▶

>>45904727 (TP) #

2. pclmulqdq ◴[12 Nov 25 19:29 UTC] No.45905022[source]▶

>>45904727 (TP) #

If OpenAI truly didn't keep conversation records for any length of time, they would not be subject to this kind of order. Lots of stateless services get these and are able to defeat them because they never store the user's data. The fact that they store them at all means that they are in scope for a preservation order. It also means that they are in scope for all manner of usage by OpenAI themselves even if a user requests deletion.

replies(1): >>45906397 #

3. dghlsakjg ◴[12 Nov 25 20:46 UTC] No.45906397[source]▶

>>45905022 #

It seems as if the court has forced OpenAI into collecting logs that they weren't otherwise collecting, or that they were deleting at user request.

So in this case not keeping logs as ordered by the court would be contempt of court.

replies(1): >>45907509 #

4. otterley ◴[12 Nov 25 22:08 UTC] No.45907509{3}[source]▶

>>45906397 #

Respectfully, it doesn’t matter the way it “seems,” it matters what is. They were collecting these logs, and as soon as they got the preservation order, they disabled deletion functionality and notified their customers of that.

There is a separate higher-tier private API customers can pay for that never had logging enabled, and the court did not force the company to add it.

↑