example-get-started
ZnTrack
example-get-started | ZnTrack | |
---|---|---|
2 | 2 | |
167 | 41 | |
0.0% | - | |
4.0 | 7.7 | |
13 days ago | 3 days ago | |
Python | Python | |
- | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
example-get-started
-
VS Code extension to track ML experiments
Or open this project https://github.com/iterative/example-get-started in GitHub Codespaces as an example. It will run the extension in Codespaces automatically.
-
Tuning Hyperparameters with Reproducible Experiments
We're going to be working with an existing NLP project. You can get the code we're working with in this repo. It already has DVC set up, but you can check out the Get Started docs if you want to know how the DVC pipeline was created.
ZnTrack
-
What are some good examples of well-engineered pipelines
I expaned a bit on them with my own package https://zntrack.readthedocs.io/ - a general framework for building DVC pipelines through python scripts (and more). This finally brings me to the project I'm actually working on https://github.com/zincware/IPSuite which brings all of this together for the specific use case of machine learned interatomic potentials.
-
HPC Rocket - A tool to run Slurm jobs from CI pipelines
This looks really interesting! I have a similar scenario but haven't looked into it yet. Have you looked at dvc.org - I'm planning on using it together with slurm and what they call CML for my projects. On that context I also wrote a tool that makes DVC more pythonic https://github.com/zincware/ZnTrack altough I'm currently restructuring it a bit but having backwards compatibility in mind.
What are some alternatives?
metaflow - :rocket: Build and manage real-life ML, AI, and data science projects with ease!
fiftyone - The open-source tool for building high-quality datasets and computer vision models
PyDrive2 - Google Drive API Python wrapper library. Maintained fork of PyDrive.
mlem - 🐶 A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one day🤞
cml_dvc_case
Activeloop Hub - Data Lake for Deep Learning. Build, manage, query, version, & visualize datasets. Stream data real-time to PyTorch/TensorFlow. https://activeloop.ai [Moved to: https://github.com/activeloopai/deeplake]
dataset-registry - Dataset registry DVC project
messages - xcompute sub-module (CAE schema layer) including serialization utilities and bindings