Adept Open Sources 8B Multimodal Modal

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • OpenAdapt

    AI-First Process Automation with Large [Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models

  • Thank you to the amazing team at Adept.ai for making this available!

    For anyone interested in contributing to a fully open source alternative, join us at https://github.com/OpenAdaptAI/OpenAdapt

    Lots of interesting work to be done, including integrating with Fuyu-8B!

  • LLaVA

    [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

  • Fuyu is not open source. At best, it is source-available. It's also not the only one.

    A few other multimodal models that you can run locally include IDEFICS[0][1], LLaVA[2], and CogVLM[3]. I believe all of these have better licenses than Fuyu.

    [0]: https://huggingface.co/blog/idefics

    [1]: https://huggingface.co/HuggingFaceM4/idefics-80b-instruct

    [2]: https://github.com/haotian-liu/LLaVA

    [3]: https://github.com/THUDM/CogVLM

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • CogVLM

    a state-of-the-art-level open visual language model | 多模态预训练模型

  • Fuyu is not open source. At best, it is source-available. It's also not the only one.

    A few other multimodal models that you can run locally include IDEFICS[0][1], LLaVA[2], and CogVLM[3]. I believe all of these have better licenses than Fuyu.

    [0]: https://huggingface.co/blog/idefics

    [1]: https://huggingface.co/HuggingFaceM4/idefics-80b-instruct

    [2]: https://github.com/haotian-liu/LLaVA

    [3]: https://github.com/THUDM/CogVLM

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts