Twitter (re)Releases Recommendation Algorithm on GitHub

This page summarizes the projects mentioned and recommended in the original post on /r/programming

Our great sponsors
  • Onboard AI - Learn any GitHub repo in 59 seconds
  • InfluxDB - Collect and Analyze Billions of Data Points in Real Time
  • SaaSHub - Software Alternatives and Reviews
  • the-algorithm

    For example, from the commit deleting the author_is_elon feature, I don't see a deletion of any config files. It may very well have been the case that the author_is_elon feature was never used for serving production traffic, being ignored by a config value. Maybe they need predicates like this in order to capture metrics. So if someone asks "are we showing more tweets from Democrats than Republicans?" they might need to define author_is_democrat and author_is_republican predicates to measure whether there is a discrepancy, controlling for various other factors. The mere existence of those features does not indicate anything nefarious.

  • the-algorithm-ml

    Source code for Twitter's Recommendation Algorithm

    I'm kind of surprised at how small https://github.com/twitter/the-algorithm-ml/blob/main/projects/home/recap/config/local_prod.yaml is. I would have expected tens of thousands of lines of config, but as you point out, some of that is spread out across different files.

  • Onboard AI

    Learn any GitHub repo in 59 seconds. Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at www.getonboard.dev.

  • Async Ruby

    An awesome asynchronous event-driven reactor for Ruby. (by socketry)

  • Finagle

    A fault tolerant, protocol-agnostic RPC system

    Don't really see how "enterprise scala" has anything to do with this, scala is meant to be parallelized , that's like it's whole thing with akka / actors / twitter's finagle (https://twitter.github.io/finagle/)

  • benchmarks

    Some benchmarks of different languages

  • InfluxDB

    Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts