←back to thread

245 points gatesn | 3 comments | | HN request time: 0.765s | source
Show context
the_mitsuhiko ◴[] No.41840459[source]
> One of the unique attributes of the (in-progress) Vortex file format is that it encodes the physical layout of the data within the file's footer. This allows the file format to be effectively self-describing and to evolve without breaking changes to the file format specification.

That is quite interesting. One challenge in general with parqet and arrow in the otel / observability ecosystem is that the shape of data is not quite known with spans. There are arbitrary attributes on them, and they can change. To the best of my knowledge no particularly great solution exists today for encoding this. I wonder to which degree this system could be "abused" for that.

replies(8): >>41840665 #>>41842038 #>>41842282 #>>41842347 #>>41843259 #>>41844697 #>>41846992 #>>41848634 #
1. cle ◴[] No.41842038[source]
Isn't this what the Arrow IPC File format does too? Is there something unique about this?
replies(1): >>41842276 #
2. _willmanning ◴[] No.41842276[source]
Compression! Vortex can easily be 10x smaller than the equivalent Arrow representation (and decompresses very quickly into Arrow)
replies(1): >>41843374 #
3. cle ◴[] No.41843374[source]
Nice!