Will Amazon S3 Vectors kill vector databases or save them?

(zilliz.com)

276 points Fendy | 1 comments | 08 Sep 25 15:35 UTC | HN request time: 0s | source

Show context

simonw ◴[08 Sep 25 16:27 UTC] No.45170289[source]▶

This is a good article and seems well balanced despite being written by someone with a product that directly competes with Amazon S3. I particularly appreciated their attempt to reverse-engineer how S3 Vectors work, including this detail:

> Filtering looks to be applied after coarse retrieval. That keeps the index unified and simple, but it struggles with complex conditions. In our tests, when we deleted 50% of data, TopK queries requesting 20 results returned only 15—classic signs of a post-filter pipeline.

Things like this are why I'd much prefer if Amazon provided detailed documentation of how their stuff works, rather than leaving it to the development community to poke around and derive those details independently.

replies(5): >>45171116 #>>45171985 #>>45172432 #>>45177278 #>>45180236 #

libraryofbabel ◴[08 Sep 25 19:04 UTC] No.45172432[source]▶

>>45170289 #

> Things like this are why I'd much prefer if Amazon provided detailed documentation of how their stuff works, rather than leaving it to the development community to poke around and derive those details independently.

Absolutely this. So much engineering time has been wasted on reverse-engineering internal details of things in AWS that could be easily documented. I once spent a couple days empirically determining how exactly cross-AZ least-outstanding-requests load balancing worked with AWS's ALB because the docs didn't tell me. Reverse-engineering can be fun (or at least I kinda enjoy it) but it's not a good use of our time and is one of those shadow costs of using the Cloud.

It's not like there's some secret sauce here in most of these implementation details (there aren't that many ways to design a load balancer). If there was, I'd understand not telling us. This is probably less an Apple-style culture of secrecy and more laziness and a belief that important details have been abstracted away from us users because "The Cloud" when in fact, these details do really matter for performance and other design decisions we have to make.

replies(8): >>45172495 #>>45172524 #>>45172841 #>>45175537 #>>45177495 #>>45178724 #>>45184264 #>>45185668 #

TheSoftwareGuy ◴[08 Sep 25 19:10 UTC] No.45172524[source]▶

>>45172432 #

>It's not like there's some secret sauce here in most of these implementation details. If there was, I'd understand not telling us. This is probably less an Apple-style culture of secrecy and more laziness and a belief that important details have been abstracted away from us users because "The Cloud" when in fact, these details do really matter for performance and other design decisions we have to make.

Having worked inside AWS I can tell you one big reason is the attitude/fear that anything we put in out public docs may end up getting relied on by customers. If customers rely on the implementation to work in a specific way, then changing that detail requires a LOT more work to prevent breaking customer's workloads. If it is even possible at that point.

replies(6): >>45172717 #>>45174090 #>>45175464 #>>45177301 #>>45178450 #>>45179528 #

wubrr ◴[08 Sep 25 21:14 UTC] No.45174090[source]▶

>>45172524 #

Right now, it is basically impossible to reliably build full applications with things like DynamoDB (among other AWS products), without relying on internal behaviour which isn't explicitly documented.

replies(3): >>45174812 #>>45176329 #>>45177577 #

cbsmith ◴[09 Sep 25 01:27 UTC] No.45176329[source]▶

>>45174090 #

I've built several DynamoDB apps, and while you might have some expectations of internal behaviour, you can build apps that are pretty resilient to change of the internal behaviour but rely heavily on the documented behaviour. I actually find the extent of the opacity a helpful guide on the limitations of the service.

replies(1): >>45177912 #

1. catlifeonmars ◴[09 Sep 25 05:51 UTC] No.45177912{3}[source]▶

>>45176329 #

Agree. TTL 48h SLA comes to mind.

↑