Managing the Data Pipeline with Git + Luigi Last updated February 25, 2015 One of the common pains of managing data, especially for larger companies, is that a lot of data gets dirty (which you may or may not even notice!) and becomes scattered around everywhere. Many ad hoc scripts are running in different places, these scripts silently generate dirty data. Further, if and when a script results i
{{#tags}}- {{label}}
{{/tags}}