chdb
glaredb
chdb | glaredb | |
---|---|---|
18 | 6 | |
1,726 | 527 | |
4.8% | 5.3% | |
9.5 | 9.8 | |
1 day ago | 6 days ago | |
C++ | Rust | |
Apache License 2.0 | GNU Affero General Public License v3.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
chdb
- FLaNK Stack Weekly 06 Nov 2023
-
DB Pilot: Query Postgres, files, S3 and more β all at once, from your laptop
Hey HN, creator of DB Pilot here.
I first announced DB Pilot on HN back in April: https://news.ycombinator.com/item?id=35761979.
Since then a lot has improved: More databases are supported, most of the product can now be used for free, and most importantly:
The app now comes with an analytics workspace powered by an embedded ClickHouse instance, running locally on your machine. This allows you to query local files, files on S3, PostgreSQL, SQLite & more - and all of those at once.
Embedding ClickHouse was possible thanks to chDB (https://github.com/chdb-io/chdb). A recent discussion on HN about it: https://news.ycombinator.com/item?id=37985005
- ChDB: Embedded OLAP SQL Engine Powered by ClickHouse
-
DuckDB 0.9.0
I recommend using ClickHouse instead of DuckDB.
It has been around since 2016, and it covers and extends the feature set of DuckDB with a huge margin. Worth noting that it never has breaking changes in its table format MergeTree.
I'm tracking the progress of DuckDB and see that it is modeled after ClickHouse, but does not approach it in terms of feature completeness, stability, or performance.
The closest to DuckDB option is to use its self-contained version, clickhouse-local: https://clickhouse.com/blog/extracting-converting-querying-l... or an embedded version, chdb: https://github.com/chdb-io/chdb
-
Is ClickHouse Moving Away from Open Source?
Different beasts, but if by any chance you love ClickHouse already and just want to run OLAP queries in-process, there's chdb: https://github.com/chdb-io/chdb
- ChDB: An Embedded OLAP SQL Engine Powered by ClickHouse
-
PRQL, Pipelined Relational Query Language
> Can you embed it in Python as a library?
https://github.com/chdb-io/chdb
pip install chdb
-
Using SQL inside Python pipelines with Duckdb, Glaredb (and others?)
New kid on the block that I prefer over DuckDB is CHDB (https://github.com/chdb-io/chdb). Embedded ClickHouse, so once you out grow your laptop you can simply move to an actual OLAP that's Open-source.
- ClickHouse-local and chdb performance issue on clickbench Q.23 Q28
glaredb
- GlareDB: An analytics DBMS for distributed data
- GlareDB β Your Data Pipeline, Simplified
-
Using SQL inside Python pipelines with Duckdb, Glaredb (and others?)
Glaredb: https://github.com/GlareDB/glaredb - just heard about this last week. We played around with hooking directly into snowflake, so that was cool, but I haven't heard of anyone else using it.
- GlareDB: An open source SQL database to query and analyze distributed data
What are some alternatives?
risingwave - SQL stream processing, analytics, and management. PostgreSQL simplicity, unrivaled performance, and seamless elasticity. π 10x more productive. π 10x more cost-efficient.
techslamneggs - The code for my May 3, 2023 workshop at Greenville's Tech Slam 'N Eggs!
openvino_notebooks - π Jupyter notebook tutorials for OpenVINOβ’
risinglight - An educational OLAP database system.
duckdb-wasm - WebAssembly version of DuckDB
Meerschaum - Create and manage data pipes with Meerschaum.
chdb-cli - Simple CLI / REPL for chdb made in Python
tensorbase - TensorBase is a new big data warehousing with modern efforts.
sqlite_blaster_python - A library for creating huge Sqlite indexes at breakneck speeds
datafuse - An elastic and reliable Cloud Warehouse, offers Blazing Fast Query and combines Elasticity, Simplicity, Low cost of the Cloud, built to make the Data Cloud easy [Moved to: https://github.com/datafuselabs/databend]
pyprql - Python extensions for PRQL
roapi - Create full-fledged APIs for slowly moving datasets without writing a single line of code.