-
Scout Monitoring
Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
-
BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
I found this resource [0] handy for getting a grasp on all the different terms people use (zero/one-shot, tree of thoughts, RAG, etc). It's not super detailed, but was enough for me (a professional developer) to get started on some side projects with Mistral.
[0] https://www.promptingguide.ai/
It is also fairly hard to quantify (I was thinking about some naive approaches to do that during our work on BIG-Bench [1] but I couldn't think of something robust enough), so I don't think we will even be able to say we are past this peak until much later.
[1] https://github.com/google/BIG-bench/issues/801