Improving Search Quality for Non-English Queries with Fine-tuned Multilingual CLIP Models

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

vision_transformer

7 9,318 5.5 Jupyter Notebook

We’re going to look at a model that Open AI has trained with a broad multilingual dataset: The xlm-roberta-base-ViT-B-32 CLIP model, which uses the ViT-B/32image encoder, and the XLM-RoBERTa multilingual language model. Both of these are pre-trained:

ImageNet21K

1 695 10.0 Python

Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper

ViT-B/32, using the ImageNet-21k dataset

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Fashion12K_german_queries

1 3 0.0 Python

We have collaborated with Toloka to curate a 12,000 item dataset of fashion images drawn from e-commerce websites, to which human annotators have added descriptive captions in German. Toloka has made the data available to the public on GitHub, but you can also download it from Jina directly in DocArray format by following the instructions in the next section.

fashion-200k

1 60 10.0

Fashion 200K dataset used in paper "Automatic Spatially-aware Fashion Concept Discovery."

The images are a subset of the xthan/fashion-200k dataset, and we have commissioned their human annotations via Toloka’s crowdsourcing platform. Annotations were made in two steps. First, Toloka passed the 12,000 images to annotators in their large international user community, who added descriptive captions.

docarray

32 2,768 8.6 Python

Represent, send, store and search multimodal data

The German Fashion12k dataset is available for free use by the Jina AI community. After logging into Jina AI Cloud, you can download it directly in DocArrayformat:

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

DocArray – Represent, send, and store multimodal data for ML

1 project | news.ycombinator.com | 27 Apr 2023
Some questions about multimodal data.

1 project | /r/learnprogramming | 22 Mar 2023
Trying to create an AI recommender system that’s also ad-free video streaming.

1 project | /r/opensource | 21 Mar 2023
do you know any systems that can handle multimodal data fusion and representation learning?

1 project | /r/opensource | 20 Mar 2023
Want to Search Inside Videos Like a Pro? CLIP-as-service Can Help

1 project | dev.to | 9 Feb 2023

Improving Search Quality for Non-English Queries with Fine-tuned Multilingual CLIP Models

This page summarizes the projects mentioned and recommended in the original post on dev.to
docarray Data structures multimodal cross-modal neural-search
Post date: 22 Dec 2022

vision_transformer

ImageNet21K

InfluxDB

Fashion12K_german_queries

fashion-200k

docarray

SaaSHub

Related posts

DocArray – Represent, send, and store multimodal data for ML

Some questions about multimodal data.

Trying to create an AI recommender system that’s also ad-free video streaming.

do you know any systems that can handle multimodal data fusion and representation learning?

Want to Search Inside Videos Like a Pro? CLIP-as-service Can Help

Improving Search Quality for Non-English Queries with Fine-tuned Multilingual CLIP Models

This page summarizes the projects mentioned and recommended in the original post on dev.to docarray Data structures multimodal cross-modal neural-search Post date: 22 Dec 2022

vision_transformer

ImageNet21K

InfluxDB

Fashion12K_german_queries

fashion-200k

docarray

SaaSHub

Related posts

DocArray – Represent, send, and store multimodal data for ML

Some questions about multimodal data.

Trying to create an AI recommender system that’s also ad-free video streaming.

do you know any systems that can handle multimodal data fusion and representation learning?

Want to Search Inside Videos Like a Pro? CLIP-as-service Can Help

This page summarizes the projects mentioned and recommended in the original post on dev.to
docarray Data structures multimodal cross-modal neural-search
Post date: 22 Dec 2022