db-benchmark
reproducible benchmark of database-like ops (by duckdblabs)
quivr
Python library for working with Arrow data in tabular form (by B612-Asteroid-Institute)
db-benchmark | quivr | |
---|---|---|
12 | 2 | |
124 | 21 | |
6.5% | - | |
8.0 | 9.1 | |
4 months ago | 6 months ago | |
R | Python | |
Mozilla Public License 2.0 | BSD 3-clause "New" or "Revised" License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
db-benchmark
Posts with mentions or reviews of db-benchmark.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2024-01-08.
- Database-Like Ops Benchmark
-
Polars
DuckDB maintains a benchmark of open source database-like tools, including Polars and Pandas
https://duckdblabs.github.io/db-benchmark/
- Planning a New Benchmarking for Comparing Filter2Groupby for 3,000 Files (100,000 Rows/Files)
- Pandas vs. Julia – cheat sheet and comparison
-
Polars supports SQL statement in Python Plus CLI Verion (Polars.exe 24.4MB)
DuckDB is also a SQL/Python app, refer to this benchmark, seem it run very fast https://duckdblabs.github.io/db-benchmark/
- The Return of the H2o.ai Database-Like Ops Benchmark
- I discovered that the fastest way to create a Pandas DataFrame from a CSV file is to actually use Polars
quivr
Posts with mentions or reviews of quivr.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2024-01-08.
-
Polars
Polars is cool, but man, I really have come to think that dataframes are disastrous for software. The mess of internal state and confusion of writing functions that take “df” and manipulate it - its all so hard to clean up once you’re deep in the mess.
Quivr (https://github.com/spenczar/quivr) is an alternative approach that has been working for me. Maybe types are good!
- Quivr: Beyond Dataframes for Python Data Engineering
What are some alternatives?
When comparing db-benchmark and quivr you can also consider the following projects:
Tidier.jl - Meta-package for data analysis in Julia, modeled after the R tidyverse.
DataFramesMeta.jl - Metaprogramming tools for DataFrames
db-benchmark - reproducible benchmark of database-like ops