|14 days ago||about 1 month ago|
|Mozilla Public License 2.0||GNU General Public License v3.0 or later|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Does Rust's performance advantage over python extend to numpy/pandas?
1 project | reddit.com/r/bioinformatics | 4 Jan 2022
It has some impressive benchmarks https://h2oai.github.io/db-benchmark/
Is Data Science 90% boring and 10% mega-interesting?
1 project | reddit.com/r/datascience | 28 Dec 2021
Programming languages for data roles Big Tech [OC]
1 project | reddit.com/r/learnmachinelearning | 22 Dec 2021
It’s not just speed but also memory usage and parallelization potential. Look how many times pandas is out of memory https://h2oai.github.io/db-benchmark/
The polars dataframe library now also exposes bindings to NodeJS
2 projects | reddit.com/r/node | 22 Dec 2021
Polars is a blazingly fast DataFrame library. Its written in Rust and until now exposed only bindings in Python. Its one of the best performing solutions in H2oAi's db-benchmark
Polars, lightning-fast DataFrame library
1 project | reddit.com/r/Python | 17 Dec 2021
Polars: Lightning-fast DataFrame library for Rust and Python
13 projects | news.ycombinator.com | 16 Dec 2021
Hmmm .. in the linked benchmarks , DataFrames.jl (Julia library) appears to be fairly competitive.13 projects | news.ycombinator.com | 16 Dec 2021
Rust and what it needs to gain space in computation-oriented applications
7 projects | reddit.com/r/rust | 24 Nov 2021
You should check out polars, datafusion, influxdb iox and databend, all written in native Rust and powered by the Apache Arrow format. Polars in particular is pretty dam fast and has bindings for Python.
Database-Like Ops Benchmark
1 project | news.ycombinator.com | 20 Nov 2021
A better dtypes for pandas dataframes pulled from Postgres
1 project | reddit.com/r/datascience | 14 Nov 2021
Here is a good comparison: https://h2oai.github.io/db-benchmark/
Made a Programing language using python
4 projects | reddit.com/r/Python | 29 Nov 2021
There's also lark, which is used by a plethora of projects (I haven't used it, but I heard about PreQL on a podcast where they talk for a bit about what it's like to develop a new language in lark)
A primer on programming languages for data science
2 projects | reddit.com/r/datascience | 17 Oct 2021
Just want to mention preql exists as an option - https://github.com/erezsh/Preql
Ask HN: SQL tooling: REPL-likes, Intellisense, etc.
1 project | news.ycombinator.com | 11 Jul 2021
8 projects | news.ycombinator.com | 10 Jul 2021
I share the author's point of view, which led me to start a new relational programming language that compiles to SQL. If that sounds interesting, you can find it here: https://github.com/erezsh/Preql
Preql: A relational language that compiles to SQL
1 project | reddit.com/r/SQL | 25 Mar 2021
Hi everyone, I'm happy to introduce Preql.2 projects | reddit.com/r/SQL | 24 Mar 2021
If you are a SQL veteran, who isn't afraid of recursive queries, check out the tree example. I think it really highlights how really complicated code can become much more managable: https://github.com/erezsh/Preql/blob/master/examples/tree.pql2 projects | reddit.com/r/SQL | 24 Mar 2021
Show HN: Preql – a database query language that compiles to SQL
1 project | news.ycombinator.com | 13 Mar 2021
Preql: a new relational programming language that compiles to SQL
1 project | news.ycombinator.com | 7 Jan 2021
Preql – a new relational programming language, that compiles to SQL
1 project | news.ycombinator.com | 7 Jan 2021
What are some alternatives?
arrow-datafusion - Apache Arrow DataFusion and Ballista query engines
polars - Fast multi-threaded DataFrame library in Rust | Python | Node.js
PyPika - PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.
sktime - A unified framework for machine learning with time series
DataFramesMeta.jl - Metaprogramming tools for DataFrames
Apache Arrow - Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
csvs-to-sqlite - Convert CSV files into a SQLite database
julia - The Julia Programming Language
databend - An elastic and reliable Serverless Data Warehouse, offers Blazing Fast Query and combines Elasticity, Simplicity, Low cost of the Cloud, built to make the Data Cloud easy
arrow2 - Unofficial transmute-free Rust library to work with the Arrow format
prosto - Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
rel8 - Hey! Hey! Can u rel8?