Polars vs ndarray performance

This page summarizes the projects mentioned and recommended in the original post on /r/rust

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • linfa

    A Rust machine learning framework.

  • I've been playing with data analytics and ml in rust for the last couple of weeks. A typical ML job requires transforming some data to feed the ml model to the then train the model. For ML I've been using linfa (https://github.com/rust-ml/linfa) which is surprisingly nice. I've been experimenting with ndarray and polars for data transformation (linfa uses ndarray) - from a UX standpoint. I'm pretty surprised by polars' performance (https://h2oai.github.io/db-benchmark/), which sits on top of arrow2, and it's definitely a great candidate for OLAP tasks. But I couldn't find any comparison between ndarray and polars, has anyone had any meaningful experience with the two or/and can point me to a benchmark comparison?

  • db-benchmark

    reproducible benchmark of database-like ops

  • I've been playing with data analytics and ml in rust for the last couple of weeks. A typical ML job requires transforming some data to feed the ml model to the then train the model. For ML I've been using linfa (https://github.com/rust-ml/linfa) which is surprisingly nice. I've been experimenting with ndarray and polars for data transformation (linfa uses ndarray) - from a UX standpoint. I'm pretty surprised by polars' performance (https://h2oai.github.io/db-benchmark/), which sits on top of arrow2, and it's definitely a great candidate for OLAP tasks. But I couldn't find any comparison between ndarray and polars, has anyone had any meaningful experience with the two or/and can point me to a benchmark comparison?

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Linfa: A Rust machine learning framework

    1 project | news.ycombinator.com | 1 Aug 2021
  • AII4DEVS #10: Diverse knowledge is the key to grow the next generation of ML practitioners into AI engineers.

    1 project | dev.to | 4 Jul 2021
  • Linfa release 0.3.0 - Elasticnet, approximated DBSCAN and Gaussian Naive Bayes + improvements

    3 projects | /r/rust | 20 Jan 2021
  • Burn: Deep Learning Framework built using Rust

    1 project | news.ycombinator.com | 24 Apr 2024
  • Burn Deep Learning Framework Release 0.12.0 Improved API and PyTorch Integration

    1 project | news.ycombinator.com | 31 Jan 2024