Our great sponsors
-
tensor2tensor
Discontinued Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
It's an interesting question. The original and official code used Post-LN. But then, after uploading the preprint, they changed it to Pre-LN via this PR in Aug 2017: https://github.com/tensorflow/tensor2tensor/commit/f5c9b17e617ea9179b7d84d36b1e8162cb369f25
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
- Understand how transformers work by demystifying all the math behind them
- Why the Original Transformer LLM Figure Is Wrong, and Other Interesting Tidbits
- What Are Transformer Models and How Do They Work?
- [P] Why I quit my lucrative job at Google to start Vectara? (neural search as a service for developers everywhere).
- Alias-Free GAN