mmf
multimodal
mmf | multimodal | |
---|---|---|
2 | 3 | |
5,417 | 1,301 | |
0.1% | 3.4% | |
5.5 | 8.0 | |
2 months ago | 5 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
mmf
-
Context in first comment
mmf, which is a multimodal pytorch framework by facebook research, was released around 2-3 years ago and is now poorly maintained.
-
[N] TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
How is this different from mmf? https://github.com/facebookresearch/mmf
multimodal
-
[D]Are there any good solutions for multimodal classification? Libraries, AutoML tool?
There was a PyTorch MultiModal repo released just recently: https://github.com/facebookresearch/multimodal
- [N] TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
- TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
What are some alternatives?
transformers - 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
label-studio - Label Studio is a multi-type data labeling and annotation tool with standardized output format
smgeo - Geolocation Inference for Reddit
Multimodal-Toolkit - Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
asteroid - The PyTorch-based audio source separation toolkit for researchers
PyTorch-NLP - Basic Utilities for PyTorch Natural Language Processing (NLP)
CapDec - CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)
mayavoz - Pytorch based speech enhancement toolkit.
img2dataset - Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
const_layout - Official implementation of the MM'21 paper "Constrained Graphic Layout Generation via Latent Optimization" (LayoutGAN++, CLG-LO, and Layout evaluation)