-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
We introduce GLAMI-1M: the largest multilingual image-text classification dataset and benchmark. The dataset contains images of fashion products with item descriptions, each in 1 of 13 languages. Categorization into 191 classes has high-quality annotations: all 100k images in the test set and 75% of the 1M training set were human-labeled. The paper presents baselines for image-text classification showing that the dataset presents a challenging fine-grained classification problem: The best scoring EmbraceNet model using both visual and textual features achieves 69.7% accuracy. Experiments with a modified Imagen model show the dataset is also suitable for image generation conditioned on text. The dataset, source code and model checkpoints are published here: https://github.com/glami/glami-1m
Related posts
-
Glami-1M: A Multilingual Image-Text Fashion Dataset
-
[R] Roboflow 100: An open source object detection benchmark of 224,714 labeled images in novel domains to compare model performance
-
Introducing RF100: An open source object detection benchmark of 224,714 labeled images across 100 novel domains to compare model performance
-
We took YOLOv5 and YOLOv7, trained them on 100 datasets, and compared their accuracy! 🔥 The results may surprise you.
-
Voxel51 Is Hiring AI Researchers and Scientists — What the New Open Science Positions Mean