Local and Global loss

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

mixture-of-experts

2 835 5.3 Python

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

I have a requirement of training pipeline similar to Mixture of Experts (https://github.com/davidmrau/mixture-of-experts/blob/master/moe.py) but I want to train the Experts on a local loss for 1 epoch before predicting outputs from them (which would then be concatenated for the global loss of MoE). Can anyone suggest what’s the best way to set up this training pipeline?

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

[Rumor] Potential GPT-4 architecture description

2 projects | /r/LocalLLaMA | 20 Jun 2023
Would anyone be interested in contributing to some group projects?

4 projects | /r/learnmachinelearning | 24 Aug 2023
Hive mind:Train deep learning models on thousands of volunteers across the world

1 project | news.ycombinator.com | 20 Jun 2023
Could a model not be trained by a decentralized network? Like Seti @ home or kinda-sorta like bitcoin. Petals accomplishes this somewhat, but if raw computer power is the only barrier to open-source I'd be happy to try organizing decentalized computing efforts

2 projects | /r/LocalLLaMA | 17 Jun 2023
Orca (built on llama13b) looks like the new sheriff in town

2 projects | /r/LocalLLaMA | 6 Jun 2023

This page summarizes the projects mentioned and recommended in the original post on /r/pytorch
moe mixture-of-experts sparsely-gated-mixture-of-experts Pytorch re-implementation
Post date: 4 Mar 2021

mixture-of-experts

InfluxDB

Related posts

[Rumor] Potential GPT-4 architecture description

Would anyone be interested in contributing to some group projects?

Hive mind:Train deep learning models on thousands of volunteers across the world

Could a model not be trained by a decentralized network? Like Seti @ home or kinda-sorta like bitcoin. Petals accomplishes this somewhat, but if raw computer power is the only barrier to open-source I'd be happy to try organizing decentalized computing efforts

Orca (built on llama13b) looks like the new sheriff in town