ImageNet21K
Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper (by Alibaba-MIIL)
fashion-200k
Fashion 200K dataset used in paper "Automatic Spatially-aware Fashion Concept Discovery." (by xthan)
ImageNet21K | fashion-200k | |
---|---|---|
1 | 1 | |
695 | 60 | |
2.9% | - | |
10.0 | 10.0 | |
over 1 year ago | about 2 years ago | |
Python | ||
MIT License | Apache License 2.0 |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ImageNet21K
Posts with mentions or reviews of ImageNet21K.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-12-22.
-
Improving Search Quality for Non-English Queries with Fine-tuned Multilingual CLIP Models
ViT-B/32, using the ImageNet-21k dataset
fashion-200k
Posts with mentions or reviews of fashion-200k.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-12-22.
-
Improving Search Quality for Non-English Queries with Fine-tuned Multilingual CLIP Models
The images are a subset of the xthan/fashion-200k dataset, and we have commissioned their human annotations via Toloka’s crowdsourcing platform. Annotations were made in two steps. First, Toloka passed the 12,000 images to annotators in their large international user community, who added descriptive captions.
What are some alternatives?
When comparing ImageNet21K and fashion-200k you can also consider the following projects:
vision_transformer
OFA - Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
LMOps - General technology for enabling AI capabilities w/ LLMs and MLLMs
Fashion12K_german_queries
mPLUG-Owl - mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
docarray - Represent, send, store and search multimodal data