←back to thread

Dolt is Git for data

(www.dolthub.com)
358 points timsehn | 1 comments | | HN request time: 0s | source
Show context
peteforde ◴[] No.22734564[source]
Only 39 days since the last "GitHub for data" was announced: https://news.ycombinator.com/item?id=22375774

I'll say what I said in February: I started a company with the same premise 9 years ago, during the prime "big data" hype cycle. We burned through a lot of investor money only to realize that there was not a market opportunity to capture. That is, many people thought it was cool - we even did co-sponsored data contests with The Economist - but at the end of the day, we couldn't find anyone with an urgent problem that they were willing to pay to solve.

I wish these folks luck! Perhaps things have changed; we were part of a flock of 5 or 10 similar projects and I'm pretty sure the only one still around today is Kaggle.

https://www.youtube.com/watch?v=EWMjQhhxhQ4

replies(15): >>22734677 #>>22734738 #>>22734742 #>>22734839 #>>22735019 #>>22735030 #>>22735213 #>>22735358 #>>22735661 #>>22736049 #>>22736513 #>>22736785 #>>22737514 #>>22737860 #>>22738642 #
ken ◴[] No.22735030[source]
That's GitHub for data. It's a service, and they still haven't launched anything yet.

This is Git for data. It's a program, and it appears to be an open-source one you can download and use today.

replies(2): >>22735037 #>>22735068 #
enos_feedler ◴[] No.22735068[source]
There is actually an old git for data project too:

https://github.com/datproject/dat

It's ~5 years old and I really wanted it to be huge. Hoping this new project is a success. Especially since I notice I went to high school with one of the founders of Dolt (Hey Tim!)

replies(3): >>22735185 #>>22736907 #>>22737454 #
1. visarga ◴[] No.22735185{3}[source]
Can it remove a file from the repo history? It's a GDPR feature that makes git hard to use for data.