Turbopuffer: Fast search on object storage

(turbopuffer.com)

379 points Sirupsen | 2 comments | 09 Jul 24 14:48 UTC | HN request time: 0.413s | source

Show context

eknkc ◴[09 Jul 24 21:29 UTC] No.40921379[source]▶

Is there a good general purpose solution where I can store a large read only database in s3 or something and do lookups directly on it?

Duckdb can open parquet files over http and query them but I found it to trigger a lot of small requests reading bunch of places from the files. I mean a lot.

I mostly need key / value lookups and could potentially store each key in a seperate object in s3 but for a couple hundred million objects.. It would be a lot more managable to have a single file and maybe a cacheable index.

replies(5): >>40922137 #>>40922166 #>>40922842 #>>40923712 #>>40927099 #

1. cdchn ◴[10 Jul 24 04:29 UTC] No.40923712[source]▶

>>40921379 #

>Is there a good general purpose solution where I can store a large read only database in s3 or something and do lookups directly on it?

I think this is pretty much what AWS Athena is.

replies(1): >>40926053 #

2. tiew9Vii ◴[10 Jul 24 12:16 UTC] No.40926053[source]▶

>>40923712 (TP) #

Cloud backed SQLLite looks like it might be good for this. Doesn’t support S3 though

https://sqlite.org/cloudsqlite/doc/trunk/www/index.wiki

↑