Our great sponsors
-
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
You can give OpenAI's CLIP a shot: https://github.com/openai/CLIP. It's capable of doing zero-shot classification. Here is a neat example of CLIP usage: https://github.com/haltakov/natural-language-image-search.
You can give OpenAI's CLIP a shot: https://github.com/openai/CLIP. It's capable of doing zero-shot classification. Here is a neat example of CLIP usage: https://github.com/haltakov/natural-language-image-search.
Alternatively, you can train your own classification model. I would've started with a pre-trained small-ish ResNet and worked from here. Here is a good tutorial of how to do that https://github.com/fastai/fastbook/blob/master/05_pet_breeds.ipynb.
Related posts
- The fastai book, published as Jupyter Notebooks
- fast.ai Book in Rust - Chapter 2 - Part 1
- Trying to get into machine learning and create a code that would recognise when my cat is detected on a camera, how should I approach this?
- Fastai Chapter 4 - The important parts, Part 2: Building a regression model
- Need help trying to run Fastai notebooks on kaggle.