Is GPT actually using the encoder NOT the decoder part of the transformer?

This page summarizes the projects mentioned and recommended in the original post on /r/MLQuestions

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • minGPT

    A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

  • In the original paper they mention they are only using the decoder part of the model. However, their description and implementations seem to be using the encoder part of the transformer not the encoder. For example, this implementation of the original transformer encoder layer matches what the one in the GPT implementation.

  • transformer-pytorch

    Transformer: PyTorch Implementation of "Attention Is All You Need"

  • In the original paper they mention they are only using the decoder part of the model. However, their description and implementations seem to be using the encoder part of the transformer not the encoder. For example, this implementation of the original transformer encoder layer matches what the one in the GPT implementation.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • [P] Implementation of Transformer with detailed and easy description comments

    1 project | /r/MachineLearning | 2 Mar 2021
  • Lack of activation in transformer feedforward layer?

    2 projects | /r/learnmachinelearning | 20 May 2021
  • Simple Implementation of OpenAI Clip (Tutorial)

    1 project | news.ycombinator.com | 21 Feb 2024
  • ElevenLabs Launches Voice Translation Tool to Break Down Language Barriers

    2 projects | news.ycombinator.com | 10 Oct 2023
  • Open Source Libraries

    25 projects | /r/AudioAI | 2 Oct 2023