Replacing Pandas with Polars. A Practical Guide

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Judoscale - Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com
featured
InfluxDB high-performance time series database
Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
influxdata.com
featured
  1. normconf2022

  2. Judoscale

    Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.

    Judoscale logo
  3. Pandas

    Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

    > The big thing pandas has going for it is that it's already been through this field testing. All the bugs have been ironed out by the hundreds of thousands of users.

    At this very moment pandas github repo has 1563 open issues labeled with a bug tag [0]. So much for "all the bugs have been ironed out".

    [0] https://github.com/pandas-dev/pandas/issues?q=is%3Aopen+is%3...

  4. fugue

    A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

    Fugue is an interesting library in this space , though I haven’t tried it

    https://github.com/fugue-project/fugue

    A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark, Dask and Ray without any rewrites.

  5. polars

    Dataframes powered by a multithreaded, vectorized query engine, written in Rust

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Data Visualisation Basics

    3 projects | dev.to | 6 Sep 2024
  • 7 Python Excel Libraries: In-Depth Review for Developers

    3 projects | dev.to | 18 Jul 2024
  • Stuff I Learned during Hanukkah of Data 2023

    5 projects | dev.to | 18 Dec 2023
  • Read files from s3 using Pandas/s3fs or AWS Data Wrangler?

    3 projects | /r/dataengineering | 6 Dec 2023
  • The Distributed Tensor Algebra Compiler (2022)

    4 projects | news.ycombinator.com | 15 Jun 2023