|7 days ago||5 days ago|
|GNU General Public License v3.0 or later||Apache License 2.0|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
[D] GPU buying recommendation
1 project | reddit.com/r/MachineLearning | 17 Jul 2021
If you just want to run tensorflow or pytorch for a Jupyter notebook, setting the environment shouldn't be difficult. I know that AWS has a marketplace of preconfigured images. However, you can go as advanced as setting up a cluster of gpu-equipped nodes to setup Horovod (https://github.com/horovod/horovod) to do distributed machine learning. Yes, there's a learning curve, but you cannot acquire this skillet any other way.
SKLean, TensorFlow, etc vs Spark ML?
1 project | reddit.com/r/apachespark | 12 Feb 2021
I'm the maintainer for an open source project called Horovod that allows you to distribute deep learning training (e.g., TensorFlow) on platforms like Spark.
Cluster machine learning
1 project | reddit.com/r/HPC | 11 Feb 2021
You'll want to use horovod to run keras in a distributed system. Then use Slurm to manage the cluster and run the job.
[D] Productionalizing machine learning pipelines for small teams
3 projects | reddit.com/r/MachineLearning | 8 Aug 2021
For running experiments, http://polyaxon.com/ is a really good free open-source package that has lots of nice integrations so you can quickly run experiments in k8s but it might be overkill in some cases.
Top 5 tools to get started with MLOps !
4 projects | reddit.com/r/MLOpsIndia | 16 Jul 2021
Polyaxon : https://polyaxon.com
Open source alternative to AWS Sagemaker, Google AI Platform, and Azure ML
1 project | reddit.com/r/CKsTechNews | 28 Mar 20211 project | news.ycombinator.com | 28 Mar 2021
What are some alternatives?
MLflow - Open source platform for the machine learning lifecycle
petastorm - Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
DeepDanbooru - AI based multi-label girl image classification system, implemented by using TensorFlow.
NudeNet - Neural Nets for Nudity Detection and Censoring
thinc - 🔮 A refreshing functional take on deep learning, compatible with your favorite libraries
pytorch-summary - Model summary in PyTorch similar to `model.summary()` in Keras
onepanel - The open source, end-to-end computer vision platform. Label, build, train, tune, deploy and automate in a unified platform that runs on any cloud and on-premises.
AdamP - AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights (ICLR 2021)
NLNS - Neural Large Neighborhood Search for the Capacitated Vehicle Routing Problem
mpi-operator - Kubernetes Operator for Allreduce-style Distributed Training
ploomber - Write maintainable, production-ready pipelines using Jupyter or your favorite text editor. Develop locally, deploy to the cloud. ☁️
eaf-jupyter - Jupyter in Emacs