Holy Uck Fast Analysis with DuckDB and Pyarrow

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • pqrs

    Command line tool for inspecting Parquet files

    Shameless plug, my project exposes the common Parquet operations using a Rust CLI tool using the Rust API for Apache Arrow and can be used without any Java/Hadoop/Spark dependencies. Also available a static binary.

    https://github.com/manojkarthick/pqrs

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • DuckDB

    DuckDB is an analytical in-process SQL database management system

    We have a Relational API in addition to SQL! Here are some examples for the Python Relational API client:

    https://github.com/duckdb/duckdb/blob/master/examples/python...

    Plus, if you are working in Python, you can use DuckDB as the engine underneath Ibis, Fugue, Siuba, or anything that works with SQLAlchemy (using the DuckDB-engine driver)! In R, you can use dplyr or dbplyr.

    DuckDB's file format is one way to persist data (it uses a single file), but you can also write out to Parquet, or write out to Apache Arrow and then parquet (in a partitioned format I believe).

    Disclaimer - I write docs for DuckDB!

  • raku-Dan

    Top level raku Data ANalysis Module

    Hi @ritchie46 - I have just written [raku Dan](https://github.com/p6steve/raku-Dan) as a way to scratch the "data analytics" itch in a new way -- my next step is to write Dan::Polars as a polars binding via (eg.) raku NativeCall. Can you point me to a good recipe for success? [email protected]

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • How moving from Pandas to Polars made me write better code without writing better code

    2 projects | dev.to | 5 Mar 2024
  • GlareDB: An open source SQL database to query and analyze distributed data

    4 projects | /r/dataengineering | 8 Jun 2023
  • Building Databases over a Weekend

    5 projects | news.ycombinator.com | 20 Nov 2024
  • Toasty, an Async ORM for Rust

    6 projects | news.ycombinator.com | 23 Oct 2024
  • Rama on Clojure's terms, and the magic of continuation-passing style

    6 projects | news.ycombinator.com | 14 Oct 2024

Did you konow that Rust is
the 5th most popular programming language
based on number of metions?