Markov Chains Are the Original Language Models

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • markov-basics

    Basic demonstration of Markov chains

  • It's true that Markov chains are very limited in their capabilities. But one thing I love about them is that they are one of the simplest and intuitive ways to write code that *learns* from input data.

    If you're never written something the *learns*, try it out! Here's a very primitive one I wrote recently to explain the basic idea and explains it along the way.

    https://github.com/unoti/markov-basics/blob/main/markov-basi...

  • llama

    Inference code for Llama models

  • Predicting subsequent text is pretty much exactly what they do. Lots of very cool engineering that’s a real feat, but at its core it’s argmax(P(token|token,corpus)):

    https://github.com/facebookresearch/llama/blob/main/llama/ge...

    The engineering feats are up there with anything, but it’s a next token predictor.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts