←back to thread

276 points Fendy | 1 comments | | HN request time: 0.202s | source
Show context
resters ◴[] No.45170203[source]
By hosting the vectors themselves, AWS can meta-optimize its cloud based on content characteristics. It may seem like not a major optimization, but at AWS scale it is billions of dollars per year. It also makes it easier for AWS to comply with censorship requirements.
replies(3): >>45170388 #>>45170758 #>>45173752 #
1. j45 ◴[] No.45173752[source]
Also, if it's not encrypted, I'm not sure if AWS or others "synthesize" customer data by a cursory scrubbing of so called client identifying information, and then try to optimize and model for those scenarios at scale.

I do feel more and more some information in the corpus of AI models was done this way. A client's name and private identifiable information might not be in the model, but some patterns of how to do things sure seem to come up from such sources.