[D] 7 years since Norm Matloff's blog post "STATISTICS: LOSING GROUND TO CS, LOSING IMAGE AMONG STUDENTS". How has the statistics vs CS situation evolved?

This page summarizes the projects mentioned and recommended in the original post on /r/statistics

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • codebraid

    Live code in Pandoc Markdown

  • Fortunately, literate programming is a thing, so there are still tools for that. If you want PDF/HTML output facilities, it looks like CodeBraid is the way to go. It uses the Pandoc framework, so you can do all kinds of neat things with it.

  • bert

    TensorFlow code and pre-trained models for BERT

  • IMO the biggest strength (there are many) that machine learning has over stats is "pretraining", which is basically training a model on one task, then using it in other tasks. Google spends $10K - 100K training BERT on an external knowledge base (usually gigabytes of text data), then freely puts it up for download. You can then "fine tune" BERT on your own dataset, which is more accurate and much cheaper/faster and less data intensive than it would be otherwise.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts