glaredb
chdb
glaredb | chdb | |
---|---|---|
6 | 18 | |
530 | 1,726 | |
5.8% | 4.8% | |
9.8 | 9.5 | |
5 days ago | 6 days ago | |
Rust | C++ | |
GNU Affero General Public License v3.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
glaredb
- GlareDB: An analytics DBMS for distributed data
- GlareDB – Your Data Pipeline, Simplified
-
Using SQL inside Python pipelines with Duckdb, Glaredb (and others?)
Glaredb: https://github.com/GlareDB/glaredb - just heard about this last week. We played around with hooking directly into snowflake, so that was cool, but I haven't heard of anyone else using it.
- GlareDB: An open source SQL database to query and analyze distributed data
chdb
- FLaNK Stack Weekly 06 Nov 2023
-
DB Pilot: Query Postgres, files, S3 and more – all at once, from your laptop
Hey HN, creator of DB Pilot here.
I first announced DB Pilot on HN back in April: https://news.ycombinator.com/item?id=35761979.
Since then a lot has improved: More databases are supported, most of the product can now be used for free, and most importantly:
The app now comes with an analytics workspace powered by an embedded ClickHouse instance, running locally on your machine. This allows you to query local files, files on S3, PostgreSQL, SQLite & more - and all of those at once.
Embedding ClickHouse was possible thanks to chDB (https://github.com/chdb-io/chdb). A recent discussion on HN about it: https://news.ycombinator.com/item?id=37985005
- ChDB: Embedded OLAP SQL Engine Powered by ClickHouse
-
DuckDB 0.9.0
I recommend using ClickHouse instead of DuckDB.
It has been around since 2016, and it covers and extends the feature set of DuckDB with a huge margin. Worth noting that it never has breaking changes in its table format MergeTree.
I'm tracking the progress of DuckDB and see that it is modeled after ClickHouse, but does not approach it in terms of feature completeness, stability, or performance.
The closest to DuckDB option is to use its self-contained version, clickhouse-local: https://clickhouse.com/blog/extracting-converting-querying-l... or an embedded version, chdb: https://github.com/chdb-io/chdb
-
Is ClickHouse Moving Away from Open Source?
Different beasts, but if by any chance you love ClickHouse already and just want to run OLAP queries in-process, there's chdb: https://github.com/chdb-io/chdb
- ChDB: An Embedded OLAP SQL Engine Powered by ClickHouse
-
PRQL, Pipelined Relational Query Language
> Can you embed it in Python as a library?
https://github.com/chdb-io/chdb
pip install chdb
-
Using SQL inside Python pipelines with Duckdb, Glaredb (and others?)
New kid on the block that I prefer over DuckDB is CHDB (https://github.com/chdb-io/chdb). Embedded ClickHouse, so once you out grow your laptop you can simply move to an actual OLAP that's Open-source.
- ClickHouse-local and chdb performance issue on clickbench Q.23 Q28
What are some alternatives?
techslamneggs - The code for my May 3, 2023 workshop at Greenville's Tech Slam 'N Eggs!
risingwave - SQL stream processing, analytics, and management. We decouple storage and compute to offer speedy bootstrapping, dynamic scaling, time-travel queries, and efficient joins.
risinglight - An educational OLAP database system.
openvino_notebooks - 📚 Jupyter notebook tutorials for OpenVINO™
Meerschaum - Create and manage data pipes with Meerschaum.
duckdb-wasm - WebAssembly version of DuckDB
tensorbase - TensorBase is a new big data warehousing with modern efforts.
chdb-cli - Simple CLI / REPL for chdb made in Python
datafuse - An elastic and reliable Cloud Warehouse, offers Blazing Fast Query and combines Elasticity, Simplicity, Low cost of the Cloud, built to make the Data Cloud easy [Moved to: https://github.com/datafuselabs/databend]
sqlite_blaster_python - A library for creating huge Sqlite indexes at breakneck speeds
roapi - Create full-fledged APIs for slowly moving datasets without writing a single line of code.
pyprql - Python extensions for PRQL