-
uform
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
question: any good on-device size image embedding models?
tried https://github.com/unum-cloud/uform which i do like, especially they also support languages other than English. Any recommendations on other alternatives?
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
-
Multimodal Embeddings for JavaScript, Swift, and Python
-
Show HN: UForm v2 Featuring Multimodal Matryoshka, Multimodal DPO, and ONNX
-
UForm v1: Multimodal Chat in 1.5B Parameters
-
Show HN: UForm v2 – tiny CLIP-like embeddings in 21 languages and Graphcore API
-
A Simple Version of Grok 1.5/ GPT-4 Vision from scratch, in one PyTorch file