snorkel
pytorch-lightning
DISCONTINUED
Our great sponsors
snorkel | pytorch-lightning | |
---|---|---|
5 | 19 | |
5,500 | 19,188 | |
0.5% | - | |
5.5 | 9.9 | |
about 1 month ago | 11 months ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
snorkel
-
[P] We are building a curated list of open source tooling for data-centric AI workflows, looking for contributions.
The paid product came out of an open source tool: https://github.com/snorkel-team/snorkel
- [Discussion] - "data sourcing will be more important than model building in the era of foundational model fine-tuning"
-
Can't use load_data from utils
Actually, I referenced it in my issue as well. There seems to be different utils.py file in different folders under the snorkel-tutorials repo but the utils file we get after importing snorkel has a different [file](https://github.com/snorkel-team/snorkel/blob/master/snorkel/utils/core.py) ,i.e. the utils file is different in the main snorkel repo
- [D] A hand-picked selection of the best Python ML Libraries of 2021
pytorch-lightning
-
[P] Composer: a new PyTorch library to train models ~2-4x faster with better algorithms
Pytorch lightning benchmarks against pytorch on every PR (benchmarks to make sure that it is mot slower.
-
[D] What Repetitive Tasks Related to Machine Learning do You Hate Doing?
There is already a ton of momentum around automating ML workflows. I would suggest you contribute to a preexisting project like, for instance, PyTorch Lightning or fast.ai.
-
[D] Are you using PyTorch or TensorFlow going into 2022?
Is the problem the sheer number of options, or the fact that they are all together in one place? Would it be better if they were organized into the different trainer entrypoints (fit, validate, ...)? If that is the case, there was an RFC proposing this which you might find interesting, feel free to drop by and comment on the issue: https://github.com/PyTorchLightning/pytorch-lightning/issues/10444
-
[D] Colab TPU low performance
I wanted to make a quick performance comparison between the GPU (Tesla K80) and TPU (v2-8) available in Google Colab with PyTorch. To do so quickly, I used an MNIST example from pytorch-lightning that trains a simple CNN.
-
[D] How to avoid CPU bottlenecking in PyTorch - training slowed by augmentations and data loading?
We've noticed GPU 0 on our 3 GPU system is sometimes idle (which would explain performance differences). However its unclear to us why that may be. Similar to this issue
-
[P] An introduction to PyKale https://github.com/pykale/pykale​, a PyTorch library that provides a unified pipeline-based API for knowledge-aware multimodal learning and transfer learning on graphs, images, texts, and videos to accelerate interdisciplinary research. Welcome feedback/contribution!
If you want a good example for reference, take a look at Pytorch Lightning's readme (https://github.com/PyTorchLightning/pytorch-lightning) It answers the 3 questions of "what is this", "why should I care", and "how do i use it" almost instantly
- Pytorch Template
-
[D] Advanced Takeaways from fast.ai book
Lower precision training can help and on pytorch lightning is just a simple flag you can set
-
[D] How to be more productive while doing Deep Learning experiments?
First of all, use high-level ML frameworks (AllenNLP, PyTorch-Lightning). No need to write boilerplate code and implement standard ML approaches from scratch. Here are some suggestions (thought more NLP-focused) that I feel improved my research coding experience a lot.
-
PyTorch Lightning Flash appears to be copying fastai (without any credit) [D]
According to the README it's patent pending, but I learned about that from this HN thread. Funny thing is I didn't even remember there was a snafu about patents, but looked it up because of some vague recollection of the PL founder getting into a tussle about some other trivial topic (apparently it was how well PyTorch works on TPUs).
What are some alternatives?
mmdetection - OpenMMLab Detection Toolbox and Benchmark
detectron2 - Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
pytorch-grad-cam - Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
sparktorch - Train and run Pytorch models on Apache Spark.
fastai - The fastai deep learning library
Keras - Deep Learning for humans
composer - Train neural networks up to 7x faster
skweak - skweak: A software toolkit for weak supervision applied to NLP tasks
Sacred - Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
metaflow - :rocket: Build and manage real-life data science projects with ease!
ffcv - FFCV: Fast Forward Computer Vision (and other ML workloads!)
skorch - A scikit-learn compatible neural network library that wraps PyTorch