[D] PyTorch Distributed Training Libraries: What are the current options?

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
Sevalla - Deploy and host your apps and databases, now with $50 credit!
Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
sevalla.com
featured
  1. accelerate

    🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

    There's also huggingface accelerate to look at. It seems to require less changes to the codebase than the others (except you're using Lightning anyway). https://github.com/huggingface/accelerate

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. horovod

    Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

    Check out Horovod - https://github.com/horovod/horovod

  4. stoke

    Discontinued A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-precision, and PyTorch extensions. (by fidelity)

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Discussion Thread

    1 project | /r/neoliberal | 10 Feb 2023
  • [D] What is the recommended approach to training NN on big data set?

    2 projects | /r/MachineLearning | 8 Dec 2022
  • Anyone know of any papers or models for segmenting satellite images of a city into things like roads, buildings, parks, etc?

    1 project | /r/MLQuestions | 25 Apr 2022
  • Embedding Python

    1 project | /r/fortran | 21 Feb 2022
  • [D] GPU buying recommendation

    1 project | /r/MachineLearning | 17 Jul 2021

Did you know that Python is
the 2nd most popular programming language
based on number of references?