Adept Open Sources 8B Multimodal Modal

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

OpenAdapt

18 419 9.5 Python

AI-First Process Automation with Large [Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models

Thank you to the amazing team at Adept.ai for making this available!
For anyone interested in contributing to a fully open source alternative, join us at https://github.com/OpenAdaptAI/OpenAdapt
Lots of interesting work to be done, including integrating with Fuyu-8B!

LLaVA

20 16,101 9.4 Python

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Fuyu is not open source. At best, it is source-available. It's also not the only one.
A few other multimodal models that you can run locally include IDEFICS[0][1], LLaVA[2], and CogVLM[3]. I believe all of these have better licenses than Fuyu.
[0]: https://huggingface.co/blog/idefics
[1]: https://huggingface.co/HuggingFaceM4/idefics-80b-instruct
[2]: https://github.com/haotian-liu/LLaVA
[3]: https://github.com/THUDM/CogVLM

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
CogVLM

16 4,968 9.0 Python

a state-of-the-art-level open visual language model | 多模态预训练模型

Fuyu is not open source. At best, it is source-available. It's also not the only one.
A few other multimodal models that you can run locally include IDEFICS[0][1], LLaVA[2], and CogVLM[3]. I believe all of these have better licenses than Fuyu.
[0]: https://huggingface.co/blog/idefics
[1]: https://huggingface.co/HuggingFaceM4/idefics-80b-instruct
[2]: https://github.com/haotian-liu/LLaVA
[3]: https://github.com/THUDM/CogVLM

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project