←back to thread

Dolt is Git for data

(www.dolthub.com)
358 points timsehn | 1 comments | | HN request time: 0s | source
Show context
sytse ◴[] No.22734084[source]
Very cool! The world needs better version control for data.

How does this compare to something like Pachyderm?

How does it work under the covers? What is a splice and what does it mean when it overlaps? https://github.com/liquidata-inc/dolt/blob/84d9eded517167eb2...

Is it feasible to use Conflict-free Replicated Data Types (CRDT) for this?

replies(2): >>22734407 #>>22735544 #
aboodman ◴[] No.22735544[source]
Hi Sid, if you are curious about how it works internally, you can read some of the old docs from Noms here (Dolt uses a fork of Noms as its internal storage layer).

https://github.com/attic-labs/noms/blob/master/doc/intro.md

To answer your question, it is pretty easy to make Noms (or Dolt) into a CRDT by defining a merge function that is deterministic.

We experimented with this in Noms but the result wasn't that satisfying and we didn't take it any further:

https://github.com/attic-labs/noms/blob/master/doc/decent/ab...

replies(1): >>22744196 #
sytse ◴[] No.22744196[source]
Thanks for your answer!
replies(1): >>22753485 #
1. sytse ◴[] No.22753485[source]
The article was published with https://www.dolthub.com/blog/2020-03-30-dolt-use-cases/ and here is the short HN thread https://news.ycombinator.com/item?id=22751209