←back to thread

213 points shcheklein | 1 comments | | HN request time: 0.203s | source
Show context
dmpetrov ◴[] No.41890616[source]
hi there! Maintainer and author here. Excited to see DVC on the front page!

Happy to answer any questions about DVC and our sister project DataChain https://github.com/iterative/datachain that does data versioning with a bit different assumptions: no file copy and built-in data transformations.

replies(3): >>41890932 #>>41896923 #>>41897005 #
ajoseps ◴[] No.41890932[source]
if the data files are all just text files, what are the differences between DVC and using plain git?
replies(3): >>41891059 #>>41891080 #>>41893500 #
1. agile-gift0262 ◴[] No.41893500[source]
It's not just to manage file versioning. Yo can define a pipeline with different stages, the dependencies and outputs of each stage and DVC will figure out which stages need running depending on what dependencies have changed. Stages can also output metrics and plots, and DVC has utilities to expose, explore and compare those.