Minimal PyTorch re-implementation of GPT

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

minGPT

35 18,932 0.0 Python

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
iris

8 756 1.9 Python

Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%. (by eloialonso)

This is actually a pretty neat, self-contained implementation that can super easily extended beyond stereotypical natural language models, for example to create world models for video games [1] or to create robot models that can learn to imitate from large, chaotic human demonstration data [2] (disclaimer, I'm an author on the second one.) Basically, GPT (or minGPT) models are EXCELLENT sequence modelers, almost to the point where you can throw any sensible sequence data at it and hope to get interesting results, as long as you don't overfit.
Even though I have only been working on machine learning for around six years, it's crazy to see how the landscape has changed so fast so recently, including diffusion models and transformers. It's not too much to say that we might expect more major breakthroughs by the end of this decade, and end in a place we can't even imagine right now!
[1] https://github.com/eloialonso/iris

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
bet

3 92 2.1 Python

Code and website for Behavior Transformers: Cloning k modes with one stone.
machine-learning-articles

5 3,108 4.1

🧠💬 Articles I wrote about machine learning, archived from MachineCurve.com.

For anyone else who was new to the phrase "isotropic model":
https://github.com/christianversloot/machine-learning-articl...

machine-learning-articl

1 - -

For anyone else who was new to the phrase "isotropic model":
https://github.com/christianversloot/machine-learning-articl...

returnn-experiments

2 152 6.4 Python

experiments with RETURNN

This works for an architecture which has been well tuned and studied before, like LSTM or Transformer.
Once you do research on the model, testing out things, it often tends to become such kwarg monster in many frameworks.
Having everything (relevant) in one file (even in the config file itself with hyper params) allows you to copy the file for every experiment and modify it inplace. This avoids the kwargs mess. But then the config files are very complex, and can become messy in other ways (esp for research projects). Example: https://github.com/rwth-i6/returnn-experiments/blob/master/2...
Such approach makes it much more flexible and does not mess with the baseline code. As you say, it's more like an evolutionary DNA-like approach, where you then tend to do crossovers with other evolved good-performing configs, etc.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Shimmy 1.0: Gymnasium & PettingZoo bindings for popular external RL environments

1 project | /r/farama | 25 Apr 2023
Shimmy 1.0: Gymnasium & PettingZoo bindings for popular external RL environments

10 projects | /r/reinforcementlearning | 25 Apr 2023
[D] MuJoCo vs PyBullet? (esp. for custom environment)

2 projects | /r/reinforcementlearning | 7 Dec 2021
How to use MuJoCo from Python3

1 project | /r/reinforcementlearning | 18 Oct 2021
Show HN: Auto Wiki v2 – Turn your codebase into a Wiki now with diagrams

1 project | news.ycombinator.com | 23 Apr 2024

Minimal PyTorch re-implementation of GPT

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Machine Learning Deep Learning Transformers neural-networks reinforcement-learning
Post date: 6 Sep 2022

minGPT

iris

InfluxDB

bet

machine-learning-articles

machine-learning-articl

returnn-experiments

Related posts

Shimmy 1.0: Gymnasium & PettingZoo bindings for popular external RL environments

Shimmy 1.0: Gymnasium & PettingZoo bindings for popular external RL environments

[D] MuJoCo vs PyBullet? (esp. for custom environment)

How to use MuJoCo from Python3

Show HN: Auto Wiki v2 – Turn your codebase into a Wiki now with diagrams

Minimal PyTorch re-implementation of GPT

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Machine Learning Deep Learning Transformers neural-networks reinforcement-learning Post date: 6 Sep 2022

minGPT

iris

InfluxDB

bet

machine-learning-articles

machine-learning-articl

returnn-experiments

Related posts

Shimmy 1.0: Gymnasium &amp; PettingZoo bindings for popular external RL environments

Shimmy 1.0: Gymnasium &amp; PettingZoo bindings for popular external RL environments

[D] MuJoCo vs PyBullet? (esp. for custom environment)

How to use MuJoCo from Python3

Show HN: Auto Wiki v2 – Turn your codebase into a Wiki now with diagrams

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Machine Learning Deep Learning Transformers neural-networks reinforcement-learning
Post date: 6 Sep 2022

Shimmy 1.0: Gymnasium & PettingZoo bindings for popular external RL environments

Shimmy 1.0: Gymnasium & PettingZoo bindings for popular external RL environments