CatLIP: Clip Vision Accuracy with 2.7x Faster Pre-Training on Web-Scale Data

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

uform

8 885 9.2 Python

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

question: any good on-device size image embedding models?
tried https://github.com/unum-cloud/uform which i do like, especially they also support languages other than English. Any recommendations on other alternatives?

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Multimodal Embeddings for JavaScript, Swift, and Python

1 project | news.ycombinator.com | 25 Apr 2024
Show HN: UForm v2 Featuring Multimodal Matryoshka, Multimodal DPO, and ONNX

1 project | news.ycombinator.com | 28 Mar 2024
UForm v1: Multimodal Chat in 1.5B Parameters

1 project | news.ycombinator.com | 28 Dec 2023
Show HN: UForm v2 – tiny CLIP-like embeddings in 21 languages and Graphcore API

1 project | news.ycombinator.com | 18 Aug 2023
A Simple Version of Grok 1.5/ GPT-4 Vision from scratch, in one PyTorch file

1 project | news.ycombinator.com | 5 May 2024

CatLIP: Clip Vision Accuracy with 2.7x Faster Pre-Training on Web-Scale Data

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
huggingface-transformers Inference language-model language-vision multimodal
Post date: 25 Apr 2024

uform

InfluxDB

Related posts

Multimodal Embeddings for JavaScript, Swift, and Python

Show HN: UForm v2 Featuring Multimodal Matryoshka, Multimodal DPO, and ONNX

UForm v1: Multimodal Chat in 1.5B Parameters

Show HN: UForm v2 – tiny CLIP-like embeddings in 21 languages and Graphcore API

A Simple Version of Grok 1.5/ GPT-4 Vision from scratch, in one PyTorch file

CatLIP: Clip Vision Accuracy with 2.7x Faster Pre-Training on Web-Scale Data

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com huggingface-transformers Inference language-model language-vision multimodal Post date: 25 Apr 2024

uform

InfluxDB

Related posts

Multimodal Embeddings for JavaScript, Swift, and Python

Show HN: UForm v2 Featuring Multimodal Matryoshka, Multimodal DPO, and ONNX

UForm v1: Multimodal Chat in 1.5B Parameters

Show HN: UForm v2 – tiny CLIP-like embeddings in 21 languages and Graphcore API

A Simple Version of Grok 1.5/ GPT-4 Vision from scratch, in one PyTorch file

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
huggingface-transformers Inference language-model language-vision multimodal
Post date: 25 Apr 2024