"[Discussion]" Should I be using DVC (Data Version Control) in my day-to-day work?

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • dvc

    🦉 ML Experiments and Data Management with Git

  • You mean https://dvc.org/

  • MLflow

    Open source platform for the machine learning lifecycle

  • Recently (just a few days back), we started looking into setting up our own wandb-like server using mlflow. It's neat. So, highly recommended to try if you want to track all your experiments.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • ml-git

    Discontinued ML-Git is a tool which provides a Distributed Version Control system to enable efficient dataset management. Like its name emphasizes, it is meant to be like git in mindset, concept and workflows. ML-Git enables the following operations. Manage a repository of different datasets, labels and models. Versioning immutable versions of models, labels and documents. Distribute these ML artifacts between members of a team or across organizations. Apply the right data governance and security models to t

  • What you did looks like how ml-git works: https://github.com/HPInc/ml-git

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts