Our great sponsors
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
For those who are interested, I have a Flax implementation of MADGRAD in flaxOptimizers (here). The optimizer solid and a refreshing departure from Adam-derived optimizers. One big caveat, however, is that you will need to tune your hyperparameters as they are likely to be orders of magnitude different from Adam's value.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
- Gemma.cpp: lightweight, standalone C++ inference engine for Gemma models
- I would say to Unity developers, don't use Godot.
- Applying All Recent Innovations To Train a Code Model
- How To Fine-Tune LLaMA, OpenLLaMA, And XGen, With JAX On A GPU Or A TPU
- Open-sourced LLMs are adept at mimicking ChatGPT’s style but not its factuality. There exists a substantial capabilities gap, which requires better base LM.