petals vs DeepSpeed-MII

petals

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading (by bigscience-workshop)

Source Code

petals.dev

Suggest alternative

Edit details

DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed. (by microsoft)

Deep Learning Inference Pytorch

Source Code

Suggest alternative

Edit details

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

petals		DeepSpeed-MII
	Project
99	Mentions	6
8,819	Stars	1,713
1.8%	Growth	3.6%
7.9	Activity	8.5
12 days ago	Latest Commit	8 days ago
Python	Language	Python
MIT License	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

petals

Posts with mentions or reviews of petals. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-26.

Chameleon: Meta's New Multi-Modal LLM
1 project | news.ycombinator.com | 21 May 2024

Things like [petals](https://github.com/bigscience-workshop/petals) exist, distributed computing over willing participants. Right now corporate cash is being rammed into the space so why not snap it up while you can, but the moment it dries up projects like petals will see more of the love they deserve.
I envision a future where crypto-style booms happen over tokens useful for purchasing priority computational time, which is earned by providing said computational time. This way researchers can daisy-chain their independent smaller rigs together into something with gargantuan capabilities.
Mistral Large
4 projects | news.ycombinator.com | 26 Feb 2024

So how long until we can do an open source Mistral Large?
We could make a start on Petals or some other open source distributed training network cluster possibly?
[0] https://petals.dev/
Distributed Inference and Fine-Tuning of Large Language Models over the Internet
2 projects | news.ycombinator.com | 2 Jan 2024

Can check out their project at https://github.com/bigscience-workshop/petals
Make no mistake—AI is owned by Big Tech
2 projects | /r/transhumanism | 7 Dec 2023
Would you donate computation and storage to help build an open source LLM?
1 project | /r/ArtificialInteligence | 4 Dec 2023
Run 70B LLM Inference on a Single 4GB GPU with This New Technique
3 projects | news.ycombinator.com | 3 Dec 2023

There is already an implementation along the same line using the torrent architecture.
https://petals.dev/
Run LLMs in bittorrent style
2 projects | /r/opensource | 20 Nov 2023

Check it out at Petals.dev. Chatbot
Is distributed computing dying, or just fading into the background?
1 project | news.ycombinator.com | 18 Nov 2023
Ask HN: Are there any projects currently exploring distributed AI training?
1 project | news.ycombinator.com | 2 Nov 2023

https://github.com/bigscience-workshop/petals
Mistral 7B,The complete Guide of the Best 7B model
3 projects | news.ycombinator.com | 31 Oct 2023

https://github.com/bigscience-workshop/petals
Inference only: https://lite.koboldai.net/

DeepSpeed-MII

Posts with mentions or reviews of DeepSpeed-MII. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-12-22.

Stable Diffusion plus DeepSpeed
1 project | /r/StableDiffusion | 12 Apr 2023
[D] When chatGPT stops being free: Run SOTA LLM in cloud
4 projects | /r/MachineLearning | 22 Dec 2022

Microsoft/DeepSpeed-MII for an up 40x reduction on inference cost on Azure, this thing also supports int8 and fp16 bloom out of the box, but it fails on Azure due to instance size.
Image Creation Time for each GPU.
4 projects | /r/StableDiffusion | 7 Nov 2022
Anyone tried DeepSpeed-MII with stablediffusion?
2 projects | /r/StableDiffusion | 22 Oct 2022

Haven't tried it yet but they have some example code here: https://github.com/microsoft/DeepSpeed-MII/blob/main/examples/local/txt2img-example.py
[P] Pure C/C++ port of OpenAI's Whisper
10 projects | /r/MachineLearning | 10 Oct 2022

What are some alternatives?

When comparing petals and DeepSpeed-MII you can also consider the following projects:

text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

whisper.cpp - Port of OpenAI's Whisper model in C/C++

llama - Inference code for Llama models

xformers - Hackable and optimized Transformers building blocks, supporting a composable construction.

alpaca-lora - Instruct-tune LLaMA on consumer hardware

AITemplate - AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

GLM-130B - GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

whisper-rs - Rust bindings to https://github.com/ggerganov/whisper.cpp

Auto-GPT - An experimental open-source attempt to make GPT-4 fully autonomous. [Moved to: https://github.com/Significant-Gravitas/Auto-GPT]

XNNPACK - High-efficiency floating-point neural network inference operators for mobile, server, and Web

Open-Assistant - OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

rocm-gfx803

petals vs text-generation-webui DeepSpeed-MII vs whisper.cpp petals vs llama DeepSpeed-MII vs xformers petals vs alpaca-lora DeepSpeed-MII vs AITemplate petals vs GLM-130B DeepSpeed-MII vs whisper-rs petals vs Auto-GPT DeepSpeed-MII vs XNNPACK petals vs Open-Assistant DeepSpeed-MII vs rocm-gfx803

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

Compare petals vs DeepSpeed-MII and see what are their differences.

petals

DeepSpeed-MII

petals

DeepSpeed-MII

What are some alternatives?