How to Build an AI Text Generator: Text Generation with a GPT-2 Model

This page summarizes the projects mentioned and recommended in the original post on dev.to

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • transformers

    🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

  • To fine-tune a pre-trained model, we could use the run_langauge_modeling.py. All we need are two text files; one containing the training text pieces, and another containing the text pieces for evaluation.

  • Towards-A-Deep-and-Unified-Understanding-of-Deep-Neural-Models-in-NLP

    Code implementation of paper Towards A Deep and Unified Understanding of Deep Neural Models in NLP

  • In Towards a Deep and Unified Understanding of Deep Neural Models in NLP, the authors propose a way to answer this. They also provide the code that we could use to analyze the GPT-2 model with.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • bertviz

    BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

  • Thanks to jessevig’s BertViz tool, we can peek at how GPT-2 works by visualizing the attention values.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • AI enthusiasm #6 - Finetune any LLM you want💡

    2 projects | dev.to | 16 Apr 2024
  • How to leverage the state-of-the-art NLP models in Rust

    3 projects | /r/infinilabs | 7 Jun 2023
  • Any recommendation for the replacement of the toolkit jiant? [Research] [Discussion]

    3 projects | /r/MachineLearning | 11 Jun 2022
  • Lack of activation in transformer feedforward layer?

    2 projects | /r/learnmachinelearning | 20 May 2021
  • Haystack DB – 10x faster than FAISS with binary embeddings by default

    3 projects | news.ycombinator.com | 28 Apr 2024