Ongoing research training transformer models at scale (by NVIDIA)

Megatron-LM Alternatives

Similar projects and alternatives to Megatron-LM

  • DeepSpeed

    DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

  • TensorRT

    NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications.

  • InfluxDB

    Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.

  • ColossalAI

    Making large AI models cheaper, faster and more accessible

  • server

    The Triton Inference Server provides an optimized cloud and edge inferencing solution. (by triton-inference-server)

  • DeepLearningExamples

    1 Megatron-LM VS DeepLearningExamples

    State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

  • xla

    1 Megatron-LM VS xla

    Enabling PyTorch on XLA Devices (e.g. Google TPU)

  • ChatGPT-Siri

    Shortcuts for Siri using ChatGPT API gpt-3.5-turbo & gpt-4 model, supports continuous conversations, configure the API key & save chat records. 由 ChatGPT API gpt-3.5-turbo & gpt-4 模型驱动的智能 Siri,支持连续对话,配置API key,配置系统prompt,保存聊天记录。

  • Sonar

    Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better Megatron-LM alternative or higher similarity.

Megatron-LM reviews and mentions

Posts with mentions or reviews of Megatron-LM. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-04-26.


Basic Megatron-LM repo stats
9 days ago

NVIDIA/Megatron-LM is an open source project licensed under GNU General Public License v3.0 or later which is an OSI approved license.

The primary programming language of Megatron-LM is Python.

Updating dependencies is time-consuming.
Solutions like Dependabot or Renovate update but don't merge dependencies. You need to do it manually while it could be fully automated! Add a Merge Queue to your workflow and stop caring about PR management & merging. Try Mergify for free.