Mistral AI Launches New 8x22B Moe Model

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

  • The easiest is to use vllm (https://github.com/vllm-project/vllm) to run it on a Couple of A100's, and you can benchmark this using this library (https://github.com/EleutherAI/lm-evaluation-harness)

  • lm-evaluation-harness

    A framework for few-shot evaluation of language models.

  • The easiest is to use vllm (https://github.com/vllm-project/vllm) to run it on a Couple of A100's, and you can benchmark this using this library (https://github.com/EleutherAI/lm-evaluation-harness)

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • llamafile

    Distribute and run LLMs with a single file.

  • I think the llamafile[0] system works the best. Binary works on the command line or launches a mini webserver. Llamafile offers builds of Mixtral-8x7B-Instruct, so presumably they may package this one up as well (potentially a quantized format).

    You would have to confirm with someone deeper in the ecosystem, but I think you should be able to run this new model as is against a llamafile?

    [0] https://github.com/Mozilla-Ocho/llamafile

  • I built one using GPT-4[1]. It's not perfect but is working quite well and is now being used by hundreds of users, apart from me, to work on real, non-toy tasks. For example, I used it to build most of a production-ready AWS infrastructure (and accompanying deploy script) with the AWS CDK.

    I want to add Mistral support soon, probably via together.ai or a similar service.

    1 - https://github.com/plandex/plandex-ai

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • [D] github repositories for ai web search agents

    2 projects | /r/MachineLearning | 9 Dec 2023
  • Which LLM framework(s) do you use in production and why?

    5 projects | /r/LangChain | 5 Dec 2023
  • Run and create custom ChatGPT-like bots with OpenChat

    15 projects | news.ycombinator.com | 7 Jun 2023
  • AI leaderboards are no longer useful. It's time to switch to Pareto curves

    1 project | news.ycombinator.com | 30 Apr 2024
  • FLaNK AI Weekly for 29 April 2024

    44 projects | dev.to | 29 Apr 2024