-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
>why did no one come up with this before
So it turns out someone did. Specifically google did. This exact same idea has been in flaxformers since at least November 2021.
https://github.com/google/flaxformer/blame/ee62754ebe5a5eeb1...
Specifically to save people a click it says:
> """Softmax function with an additional virtual logit equal to zero.
For compatibility with some previously trained models.
https://github.com/karpathy/nanoGPT/blob/f08abb45bd2285627d1...
At training time, probabilities for the next token are computed for each position, so if we feed in a sequence of n tokens, we basically get n training examples, one for each position, but at inference time, we only compute the next token since we’ve already output the preceding ones.