Our great sponsors
|about 1 month ago||11 months ago|
|Apache License 2.0||Apache License 2.0|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
[P] We are building a curated list of open source tooling for data-centric AI workflows, looking for contributions.
12 projects | reddit.com/r/MachineLearning | 3 Mar 2023
The paid product came out of an open source tool: https://github.com/snorkel-team/snorkel
[Discussion] - "data sourcing will be more important than model building in the era of foundational model fine-tuning"
4 projects | reddit.com/r/MachineLearning | 3 Dec 2022
Can't use load_data from utils
2 projects | reddit.com/r/learnpython | 21 Feb 2022
Actually, I referenced it in my issue as well. There seems to be different utils.py file in different folders under the snorkel-tutorials repo but the utils file we get after importing snorkel has a different [file](https://github.com/snorkel-team/snorkel/blob/master/snorkel/utils/core.py) ,i.e. the utils file is different in the main snorkel repo
[D] A hand-picked selection of the best Python ML Libraries of 2021
2 projects | reddit.com/r/MachineLearning | 21 Dec 2021
[P] Composer: a new PyTorch library to train models ~2-4x faster with better algorithms
7 projects | reddit.com/r/MachineLearning | 16 Mar 2022
Pytorch lightning benchmarks against pytorch on every PR (benchmarks to make sure that it is mot slower.
[D] What Repetitive Tasks Related to Machine Learning do You Hate Doing?
4 projects | reddit.com/r/MachineLearning | 21 Feb 2022
There is already a ton of momentum around automating ML workflows. I would suggest you contribute to a preexisting project like, for instance, PyTorch Lightning or fast.ai.
[D] Are you using PyTorch or TensorFlow going into 2022?
6 projects | reddit.com/r/MachineLearning | 14 Dec 2021
Is the problem the sheer number of options, or the fact that they are all together in one place? Would it be better if they were organized into the different trainer entrypoints (fit, validate, ...)? If that is the case, there was an RFC proposing this which you might find interesting, feel free to drop by and comment on the issue: https://github.com/PyTorchLightning/pytorch-lightning/issues/10444
[D] Colab TPU low performance
2 projects | reddit.com/r/MachineLearning | 18 Nov 2021
I wanted to make a quick performance comparison between the GPU (Tesla K80) and TPU (v2-8) available in Google Colab with PyTorch. To do so quickly, I used an MNIST example from pytorch-lightning that trains a simple CNN.
[D] How to avoid CPU bottlenecking in PyTorch - training slowed by augmentations and data loading?
2 projects | reddit.com/r/MachineLearning | 10 Nov 2021
We've noticed GPU 0 on our 3 GPU system is sometimes idle (which would explain performance differences). However its unclear to us why that may be. Similar to this issue
[P] An introduction to PyKale https://github.com/pykale/pykale, a PyTorch library that provides a unified pipeline-based API for knowledge-aware multimodal learning and transfer learning on graphs, images, texts, and videos to accelerate interdisciplinary research. Welcome feedback/contribution!
2 projects | reddit.com/r/MachineLearning | 25 Apr 2021
If you want a good example for reference, take a look at Pytorch Lightning's readme (https://github.com/PyTorchLightning/pytorch-lightning) It answers the 3 questions of "what is this", "why should I care", and "how do i use it" almost instantly
2 projects | reddit.com/r/pytorch | 24 Apr 2021
[D] Advanced Takeaways from fast.ai book
2 projects | reddit.com/r/MachineLearning | 23 Mar 2021
Lower precision training can help and on pytorch lightning is just a simple flag you can set
[D] How to be more productive while doing Deep Learning experiments?
10 projects | reddit.com/r/MachineLearning | 25 Feb 2021
First of all, use high-level ML frameworks (AllenNLP, PyTorch-Lightning). No need to write boilerplate code and implement standard ML approaches from scratch. Here are some suggestions (thought more NLP-focused) that I feel improved my research coding experience a lot.
PyTorch Lightning Flash appears to be copying fastai (without any credit) [D]
2 projects | reddit.com/r/MachineLearning | 5 Feb 2021
According to the README it's patent pending, but I learned about that from this HN thread. Funny thing is I didn't even remember there was a snafu about patents, but looked it up because of some vague recollection of the PL founder getting into a tussle about some other trivial topic (apparently it was how well PyTorch works on TPUs).
What are some alternatives?
mmdetection - OpenMMLab Detection Toolbox and Benchmark
detectron2 - Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
pytorch-grad-cam - Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
sparktorch - Train and run Pytorch models on Apache Spark.
fastai - The fastai deep learning library
Keras - Deep Learning for humans
composer - Train neural networks up to 7x faster
skweak - skweak: A software toolkit for weak supervision applied to NLP tasks
Sacred - Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
metaflow - :rocket: Build and manage real-life data science projects with ease!
ffcv - FFCV: Fast Forward Computer Vision (and other ML workloads!)
skorch - A scikit-learn compatible neural network library that wraps PyTorch