polars
databend
Our great sponsors
polars | databend | |
---|---|---|
144 | 32 | |
26,043 | 7,184 | |
6.1% | 2.5% | |
10.0 | 10.0 | |
5 days ago | 3 days ago | |
Rust | Rust | |
MIT License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
polars
-
Why Python's Integer Division Floors (2010)
This is because 0.1 is in actuality the floating point value value 0.1000000000000000055511151231257827021181583404541015625, and thus 1 divided by it is ever so slightly smaller than 10. Nevertheless, fpround(1 / fpround(1 / 10)) = 10 exactly.
I found out about this recently because in Polars I defined a // b for floats to be (a / b).floor(), which does return 10 for this computation. Since Python's correctly-rounded division is rather expensive, I chose to stick to this (more context: https://github.com/pola-rs/polars/issues/14596#issuecomment-...).
-
Polars
https://github.com/pola-rs/polars/releases/tag/py-0.19.0
-
Stuff I Learned during Hanukkah of Data 2023
That turned out to be related to pola-rs/polars#11912, and this linked comment provided a deceptively simple solution - use PARSE_DECLTYPES when creating the connection:
- Polars 0.20 Released
- Segunda linguagem
- Polars: Dataframes powered by a multithreaded query engine, written in Rust
- Summing columns in remote Parquet files using DuckDB
- Polars 0.34 is released. (A query engine focussing on DataFrame front ends)
databend
-
Solutions to manage runaway Snowflake costs?
Databend vs. Snowflake: https://github.com/datafuselabs/databend/issues/13059
-
I Accidentally Saved My Company Half a Million Dollars
Indeed, under a pay-as-you-go model, if there's a lack of precise control over the warehouse, such as a 10-minute suspension, it could lead to significant waste. This is because most queries might only take a few seconds, and the rest of the time is wasted. If you find Snowflake expensive, consider Databend. It's an open-source, cost-efficient alternative to Snowflake, and it maintains a consistent product experience with Snowflake.
Open-source: https://github.com/datafuselabs/databend
- Databend โ The Open Source Alternative to Snowflake Worth Considering
-
Anyone have experience with Databend (local or cloud)?
They're advertising as an open source direct competitor with Snowflake, with the ability to store data in parquet files. Github repo (5.6k stars) here.
-
An interesting SQL function in Databend: AI_TO_SQL
Databend has recently introduced an SQL function that generates SQL statements from natural language. This feature can significantly reduce the time required for writing and debugging SQL statements.
- Faster than Rust and C++: the PERFECT hash table
-
Parsing SQL with Rust
Hi, we used to use sqlparser in [Databend](https://github.com/datafuselabs/databend). But at last we decide to write our own sqlparser using nom-rule.
-
Databend v1.0
Link to Github: https://github.com/datafuselabs/databend
- Databend 1.0 Release | Blog | Databend
- Open source Snowflake alternative in Rust
What are some alternatives?
vaex - Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second ๐
datafusion - Apache DataFusion SQL Query Engine
modin - Modin: Scale your Pandas workflows by changing a single line of code
db-benchmark - reproducible benchmark of database-like ops
duckdb-rs - Ergonomic bindings to duckdb for Rust
DataFrames.jl - In-memory tabular data in Julia
datafuse - An elastic and reliable Cloud Warehouse, offers Blazing Fast Query and combines Elasticity, Simplicity, Low cost of the Cloud, built to make the Data Cloud easy [Moved to: https://github.com/datafuselabs/databend]
datatable - A Python package for manipulating 2-dimensional tabular data structures
barrel - ๐ข A database schema migration builder for Rust
Apache Arrow - Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
ClickHouse - ClickHouseยฎ is a free analytics DBMS for big data