Will Amazon S3 Vectors kill vector databases or save them?

(zilliz.com)

276 points Fendy | 2 comments | 08 Sep 25 15:35 UTC | HN request time: 0.001s | source

Show context

resters ◴[08 Sep 25 16:21 UTC] No.45170203[source]▶

By hosting the vectors themselves, AWS can meta-optimize its cloud based on content characteristics. It may seem like not a major optimization, but at AWS scale it is billions of dollars per year. It also makes it easier for AWS to comply with censorship requirements.

replies(3): >>45170388 #>>45170758 #>>45173752 #

barbazoo ◴[08 Sep 25 16:34 UTC] No.45170388[source]▶

>>45170203 #

> It also makes it easier for AWS to comply with censorship requirements.

Does it, how? Why would it be the vector store that would make it easier for them to censor the content? Why not censor the documents in S3 directly, or the entries in the relational database. What is different about censoring those vs a vector store?

replies(1): >>45170514 #

resters ◴[08 Sep 25 16:43 UTC] No.45170514[source]▶

>>45170388 #

Once a vector has been generated (and someone has paid for it) it can be searched for and relevant content can be identified without AWS incurring any additional cost to create its own separate censorship-oriented index, etc. AWS can also add additional bits to the vector that benefit its internal goals (scalability, censorship, etc.)

Not to mention there is lock-in once you've gone to the trouble of using a specific embedding model on a bunch of content. Ideally we'd converge on backwards-compatible, open source approaches, but cloud vendors want to offer "value" by offering "better" embedding models that are not open source.

replies(4): >>45170544 #>>45170605 #>>45170776 #>>45173764 #

1. barbazoo ◴[08 Sep 25 16:46 UTC] No.45170544[source]▶

>>45170514 #

And that doesn't apply to any other database/search technology AWS offers?

replies(1): >>45170705 #

2. resters ◴[08 Sep 25 16:57 UTC] No.45170705[source]▶

>>45170544 (TP) #

It does to some but not to most of it, which is why Azure and GCP offer nearly the exact same core services.

↑