Mistral AI Launches New 8x22B Moe Model

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

vllm

31 18,571 9.9 Python

A high-throughput and memory-efficient inference and serving engine for LLMs

The easiest is to use vllm (https://github.com/vllm-project/vllm) to run it on a Couple of A100's, and you can benchmark this using this library (https://github.com/EleutherAI/lm-evaluation-harness)

lm-evaluation-harness

34 5,070 9.9 Python

A framework for few-shot evaluation of language models.

The easiest is to use vllm (https://github.com/vllm-project/vllm) to run it on a Couple of A100's, and you can benchmark this using this library (https://github.com/EleutherAI/lm-evaluation-harness)

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
llamafile

34 14,839 9.6 C++

Distribute and run LLMs with a single file.

I think the llamafile[0] system works the best. Binary works on the command line or launches a mini webserver. Llamafile offers builds of Mixtral-8x7B-Instruct, so presumably they may package this one up as well (potentially a quantized format).
You would have to confirm with someone deeper in the ecosystem, but I think you should be able to run this new model as is against a llamafile?
[0] https://github.com/Mozilla-Ocho/llamafile

plandex-ai

1 - -

I built one using GPT-4[1]. It's not perfect but is working quite well and is now being used by hundreds of users, apart from me, to work on real, non-toy tasks. For example, I used it to build most of a production-ready AWS infrastructure (and accompanying deploy script) with the AWS CDK.
I want to add Mistral support soon, probably via together.ai or a similar service.
1 - https://github.com/plandex/plandex-ai

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

[D] github repositories for ai web search agents

2 projects | /r/MachineLearning | 9 Dec 2023
Which LLM framework(s) do you use in production and why?

5 projects | /r/LangChain | 5 Dec 2023
Run and create custom ChatGPT-like bots with OpenChat

15 projects | news.ycombinator.com | 7 Jun 2023
AI leaderboards are no longer useful. It's time to switch to Pareto curves

1 project | news.ycombinator.com | 30 Apr 2024
FLaNK AI Weekly for 29 April 2024

44 projects | dev.to | 29 Apr 2024

Mistral AI Launches New 8x22B Moe Model

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Transformer Gpt evaluation-framework llm language-model
Post date: 9 Apr 2024

vllm

lm-evaluation-harness

InfluxDB

llamafile

plandex-ai

Related posts

[D] github repositories for ai web search agents

Which LLM framework(s) do you use in production and why?

Run and create custom ChatGPT-like bots with OpenChat

AI leaderboards are no longer useful. It's time to switch to Pareto curves

FLaNK AI Weekly for 29 April 2024

Mistral AI Launches New 8x22B Moe Model

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Transformer Gpt evaluation-framework llm language-model Post date: 9 Apr 2024

vllm

lm-evaluation-harness

InfluxDB

llamafile

plandex-ai

Related posts

[D] github repositories for ai web search agents

Which LLM framework(s) do you use in production and why?

Run and create custom ChatGPT-like bots with OpenChat

AI leaderboards are no longer useful. It's time to switch to Pareto curves

FLaNK AI Weekly for 29 April 2024

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Transformer Gpt evaluation-framework llm language-model
Post date: 9 Apr 2024