Run 100B+ language models at home, BitTorrent‑style

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

petals

98 8,710 8.3 Python

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

A Petals dev here. We say up front that "Single-batch inference runs at ≈ 1 sec per step (token)".
In turn, "parallel inference" refers to the high-throughput scenario when you generate multiple sequences in parallel. This is useful when you process some large dataset with LLM (e.g. run inference with batch size of 200) or run a beam search with a large beam width. In this case, you can actually get the speed of hundreds of tokens per sec, see our benchmarks for parallel forward passes: https://github.com/bigscience-workshop/petals#benchmarks
If you have another wording in mind that is more up front, please let us know, we'd be happy to improve the project description. Petals is a non-commercial research project, and we don't want to oversell anything.

hivemind

40 1,842 5.4 Python

Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.

I'm not entirely how the approach they're using works [0], but I study federated learning and one of the highly-cited survey papers has several chapters (5 and 6 in particular) addressing potential attacks, failure modes, and bias [1].
0: https://github.com/learning-at-home/hivemind
1: https://arxiv.org/abs/1912.04977

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Would anyone be interested in contributing to some group projects?

4 projects | /r/learnmachinelearning | 24 Aug 2023
Hive mind:Train deep learning models on thousands of volunteers across the world

1 project | news.ycombinator.com | 20 Jun 2023
Could a model not be trained by a decentralized network? Like Seti @ home or kinda-sorta like bitcoin. Petals accomplishes this somewhat, but if raw computer power is the only barrier to open-source I'd be happy to try organizing decentalized computing efforts

2 projects | /r/LocalLLaMA | 17 Jun 2023
Orca (built on llama13b) looks like the new sheriff in town

2 projects | /r/LocalLLaMA | 6 Jun 2023
Do you think that AI research will slow down to a halt because of regulation?

1 project | /r/singularity | 21 May 2023

Run 100B+ language models at home, BitTorrent‑style

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Deep Learning Pytorch volunteer-computing mixture-of-experts distributed-training
Post date: 20 Mar 2023

petals

hivemind

InfluxDB

Related posts

Would anyone be interested in contributing to some group projects?

Hive mind:Train deep learning models on thousands of volunteers across the world

Could a model not be trained by a decentralized network? Like Seti @ home or kinda-sorta like bitcoin. Petals accomplishes this somewhat, but if raw computer power is the only barrier to open-source I'd be happy to try organizing decentalized computing efforts

Orca (built on llama13b) looks like the new sheriff in town

Do you think that AI research will slow down to a halt because of regulation?

Run 100B+ language models at home, BitTorrent‑style

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Deep Learning Pytorch volunteer-computing mixture-of-experts distributed-training Post date: 20 Mar 2023

petals

hivemind

InfluxDB

Related posts

Would anyone be interested in contributing to some group projects?

Hive mind:Train deep learning models on thousands of volunteers across the world

Could a model not be trained by a decentralized network? Like Seti @ home or kinda-sorta like bitcoin. Petals accomplishes this somewhat, but if raw computer power is the only barrier to open-source I'd be happy to try organizing decentalized computing efforts

Orca (built on llama13b) looks like the new sheriff in town

Do you think that AI research will slow down to a halt because of regulation?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Deep Learning Pytorch volunteer-computing mixture-of-experts distributed-training
Post date: 20 Mar 2023