←back to thread

77 points eatonphil | 1 comments | | HN request time: 0.242s | source
Show context
jauntywundrkind ◴[] No.40370671[source]
Nice to see methodology here. Ideally Lancedb lance v2 and nimble would also both be represented here. It feels like there's huge appetite to do better than Parquet; ideally work like this would help inform where we go next.

https://blog.lancedb.com/lance-v2/

https://github.com/facebookincubator/nimble

replies(1): >>40373163 #
apavlo ◴[] No.40373163[source]
Lance v2 looks interesting. I like their meta-data + container story. Lacking SOTA encoding schemes though.

There is also Vortex (https://github.com/fulcrum-so/vortex). That has modern encoding schemes that we want to use.

BtrBlocks (https://github.com/maxi-k/btrblocks) from the Germans is another Parquet alternative.

Nimble (formerly Alpha) is a complicated story. We worked with the Velox team for over a year to open-source and extend it. But plans got stymied by legal. This was in collaboration with Meta + CWI + Nvidia + Voltron. We decided to go a separate path because Nimble code has no spec/docs. Too tightly coupled with Velox/Folly.

Given that, we are working on a new file format. We hope to share our ideas/code later this year.

replies(2): >>40373277 #>>40374724 #
1. intelVISA ◴[] No.40374724[source]
Glad to see new ideas in this space, hopefully will be unleashing my own Parquet successor soon(tm).