multimodal
mmf
multimodal | mmf | |
---|---|---|
3 | 2 | |
1,301 | 5,417 | |
3.4% | 0.1% | |
8.0 | 5.5 | |
5 days ago | 2 months ago | |
Python | Python | |
BSD 3-clause "New" or "Revised" License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
multimodal
-
[D]Are there any good solutions for multimodal classification? Libraries, AutoML tool?
There was a PyTorch MultiModal repo released just recently: https://github.com/facebookresearch/multimodal
- [N] TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
- TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
mmf
-
Context in first comment
mmf, which is a multimodal pytorch framework by facebook research, was released around 2-3 years ago and is now poorly maintained.
-
[N] TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
How is this different from mmf? https://github.com/facebookresearch/mmf
What are some alternatives?
label-studio - Label Studio is a multi-type data labeling and annotation tool with standardized output format
transformers - 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Multimodal-Toolkit - Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
smgeo - Geolocation Inference for Reddit
PyTorch-NLP - Basic Utilities for PyTorch Natural Language Processing (NLP)
asteroid - The PyTorch-based audio source separation toolkit for researchers
CapDec - CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)
mayavoz - Pytorch based speech enhancement toolkit.
img2dataset - Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
const_layout - Official implementation of the MM'21 paper "Constrained Graphic Layout Generation via Latent Optimization" (LayoutGAN++, CLG-LO, and Layout evaluation)