st-moe-pytorch VS hivemind

Compare st-moe-pytorch vs hivemind and see what are their differences.

st-moe-pytorch

Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch (by lucidrains)
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
st-moe-pytorch hivemind
1 40
231 1,845
- 1.7%
7.8 5.4
3 months ago 5 days ago
Python Python
MIT License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

st-moe-pytorch

Posts with mentions or reviews of st-moe-pytorch. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-05.

hivemind

Posts with mentions or reviews of hivemind. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-07.

What are some alternatives?

When comparing st-moe-pytorch and hivemind you can also consider the following projects:

OpenMoE - A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

replika-research - Replika.ai Research Papers, Posters, Slides & Datasets

spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python

alpa - Training and serving large-scale neural networks with auto parallelization.

Super-SloMo - PyTorch implementation of Super SloMo by Jiang et al.

GLM-130B - GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

mesh-transformer-jax - Model parallel transformers in JAX and Haiku

HiveMind-core - Join the OVOS collective, utils for OpenVoiceOS mesh networking

FedML - FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.

petals - 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Open-Assistant - OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

mixture-of-experts - PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538