Our great sponsors
-
flyte
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
The most important thing is versioning and reproducibility, https://github.com/flyteorg/flyte is an option for data pipelines but is quite complicated. As long as the path from raw data to input to the model is traceable any solution is fine.
For running experiments, http://polyaxon.com/ is a really good free open-source package that has lots of nice integrations so you can quickly run experiments in k8s but it might be overkill in some cases.
I wrote a detailed survey on this. However, I'm biased since I have a project of my own: Ploomber.