Will Rust-based data frame library Polars dethrone Pandas? We evaluate on 1M+ Stack Overflow questions

This page summarizes the projects mentioned and recommended in the original post on /r/rust

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • polars

    Dataframes powered by a multithreaded, vectorized query engine, written in Rust

  • Concretely I'm still waiting to be able to use the fix to this issue I reported 20 days ago :). I like your crate that's why I'm bringing up this issue as it is frustrating to see the python version having the fix while I need to use workarounds till the next version is released.

  • db-benchmark

    reproducible benchmark of database-like ops

  • We didn't conduct our own benchmarks for this post, but in this comparison from ~1 year ago, Polars emerged as the fastest https://h2oai.github.io/db-benchmark/

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • crates.io

    The Rust package registry

  • However you cannot publish to crates.io, if any of your dependencies point to github. I don't think its too bad, because you as a rust use can always point to our master, until we issue a new release next month.

  • plotnine

    A Grammar of Graphics for Python

  • The best one I've found is plotnine, which is just a reimplementation of the ggplot API.

  • jj

    A Git-compatible VCS that is both simple and powerful

  • That said, one alternative I've seen that is compatible with Git is JJ which looks interesting. And Pijul may have a chance.

  • explorer

    Series (one-dimensional) and dataframes (two-dimensional) for fast and elegant data exploration in Elixir

  • I've got to make a shameless plug for Explorer, which is a dataframe library for Elixir that builds on `polars`. It's not quite just bindings, as the idea is to have a functional, dplyr-esque API with pluggable backends (e.g. ExplorerSQL, ExplorerBallista). The main/default backend uses Elixir NIFs via Rustler to call polars.

  • VFSForGit

    Virtual File System for Git: Enable Git at Enterprise Scale

  • Well, MS on the other hand created a fork/tool adding VFS support to Git: https://github.com/microsoft/VFSForGit and it seemed to have worked out for them. It is sort of a hack, but just shows that Git has had enough momentum to justify this hack, instead of going with some better suited alternative tools.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • roast

    🦋 Raku test suite

  • For the Raku language, a data analytics module can help us be more useful to data scientist / programmers. Polars is a better option than Pandas. Why?

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts