[D] Is a GPT-J successor in the works?

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • YaLM-100B

    Pretrained language model with 100B parameters

  • There's been a few different open-source GPT-3 style large language models since GPT-J: ~175B: Bloom from huggingface, ~100B: YaLM from Yandex, and ~20B: GPT NeoX. None of them match GPT-3 performance but since their open source (for commercial use too) theyre worth checking out. I'm not sure if Stability has plans to train a GPT3 size model though.

  • gpt-neox

    An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

  • There's been a few different open-source GPT-3 style large language models since GPT-J: ~175B: Bloom from huggingface, ~100B: YaLM from Yandex, and ~20B: GPT NeoX. None of them match GPT-3 performance but since their open source (for commercial use too) theyre worth checking out. I'm not sure if Stability has plans to train a GPT3 size model though.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts