sagemaker-training-toolkit
data-science-ipython-notebooks
sagemaker-training-toolkit | data-science-ipython-notebooks | |
---|---|---|
1 | 1 | |
470 | 26,490 | |
2.8% | - | |
6.3 | 0.0 | |
about 1 month ago | about 2 months ago | |
Python | Python | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
sagemaker-training-toolkit
-
Distributed training with Horovod/MPI
I'm using sagemaker-training-toolkit to attempt hyperparameter optimization and trying to take advantage of all the cores on each machine using their MPI options (which uses Horovod with MPI to my understanding). I'm pretty new to this space and can't find anything that describes in somewhat lay-terms how training works in this distributed model. With AllReduce, how often does the reduce happen? I'm trying to figure out if all training threads are training a shared model such that every thread is training on the "latest" version of the model.
data-science-ipython-notebooks
-
Beginner in Python for Data Science
data science ipython notebooks
What are some alternatives?
image-super-resolution - 🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.
manjaro-linux - Shell scripts for setting up Manjaro Linux for Python programming and deep learning
jina - ☁️ Build multimodal AI applications with cloud-native stack
BirdNET - Soundscape analysis with BirdNET.
Activeloop Hub - Data Lake for Deep Learning. Build, manage, query, version, & visualize datasets. Stream data real-time to PyTorch/TensorFlow. https://activeloop.ai [Moved to: https://github.com/activeloopai/deeplake]
fugue - A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
torchlambda - Lightweight tool to deploy PyTorch models to AWS Lambda
kmodes - Python implementations of the k-modes and k-prototypes clustering algorithms, for clustering categorical data
spotty - Training deep learning models on AWS and GCP instances
sports-betting - Collection of sports betting AI tools.
sagemaker-tensorflow-training-toolkit - Toolkit for running TensorFlow training scripts on SageMaker. Dockerfiles used for building SageMaker TensorFlow Containers are at https://github.com/aws/deep-learning-containers.
PMapper - A tool for quickly evaluating IAM permissions in AWS.