dagster-sklearn
yellowbrick
dagster-sklearn | yellowbrick | |
---|---|---|
3 | 2 | |
40 | 4,198 | |
- | 0.3% | |
0.0 | 2.8 | |
about 1 year ago | 9 months ago | |
Python | Python | |
- | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
dagster-sklearn
-
Scheduling tools for ETL and ML flow
I would give dagster a look. It has a built-in native scheduler and is cross-platform. It is general purpose, so your team can grow with it and tackle broader set of use cases if needed. If you struggle to get started after reading their docs/tutorials, you can take a look at my personal repo. Ive gotten a few feedback that my example has been very useful in getting started. I know they revamped their docs recently, but havent looked at their tutorial again or looked to see if they provided an intermediate level full example yet, so I need to get back in there to see.
-
Dagster Tutorials/Presentations
Hey! I've recently started to use dagster and it's been great with its 0.11.x releases. I am still a newbie with it and maybe only use 20% of its features and abstractions. Here's my work-in-progress personal Github repo. Not sure if you'll learn much from it.
-
Is anyone trying to switch out of data science, and if so, what jobs are you applying for?
I have created a trivial, contrived scikit-learn example using dagster so that people have an idea of how it can be used.
yellowbrick
- [D] DL Practitioners, Do You Use Layer Visualization Tools s.a GradCam in Your Process?
-
Any interesting open projects to join? Or anyone want with some good ideas want to start one?
I have contributed to Yellowbrick in the past. https://github.com/DistrictDataLabs/yellowbrick/
What are some alternatives?
Dask - Parallel computing with task scheduling
kmodes - Python implementations of the k-modes and k-prototypes clustering algorithms, for clustering categorical data
dagster - An orchestration platform for the development, production, and observation of data assets.
Anaconda - Anaconda turns your Sublime Text 3 in a full featured Python development IDE including autocompletion, code linting, IDE features, autopep8 formating, McCabe complexity checker Vagrant and Docker support for Sublime Text 3 using Jedi, PyFlakes, pep8, MyPy, PyLint, pep257 and McCabe that will never freeze your Sublime Text 3
ploomber - The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
itermplot - An awesome iTerm2 backend for Matplotlib, so you can plot directly in your terminal.
best-of-ml-python - 🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
seaborn-image - High-level API for attractive and descriptive image visualization in Python
dagster-example-pipeline - Template Dagster repo using poetry and a single Docker container; works well with CICD
fpdf2 - Simple PDF generation for Python
scikit-survival - Survival analysis built on top of scikit-learn
sports-betting - Collection of sports betting AI tools.