talent-plan
duckdb
Our great sponsors
talent-plan | duckdb | |
---|---|---|
23 | 52 | |
9,813 | 16,576 | |
1.4% | 10.7% | |
0.0 | 10.0 | |
7 months ago | 7 days ago | |
Rust | C++ | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
talent-plan
-
Need insights to build a distributed key value store from scratch.
Pls check this course: https://github.com/pingcap/talent-plan . It includes how to implement sql and key value store. It’s an awesome course
-
Ask HN: Leetcode for Back End and Server Development
- https://hackattic.com/ : Interesting programming Problems.
- https://sadservers.com/ : Learn Linux by solving problems.
- https://fly.io/dist-sys/ : Distributed Systems Problems.
- https://github.com/pingcap/talent-plan/ : System Programming / Distributed System Challenge.
- https://protohackers.com/ : Server Programming Challenges.
- https://codecrafters.io/ : Implement server tech / softwares from scratch.
- https://hyperskill.org/ : Lots of projects based tutorials.
- https://github.com/pingcap/talent-plan/ : System Programming / Distributed System Challenge.
- new Rust Networking book
-
Gossip Glomers: Fly.io Distributed Systems Challenges
Awesome stuff!
Just to share in the way: open source training courses about distributed database and distributed systems by PingCap.
https://github.com/pingcap/talent-plan
- Learning and Building a KV Database in rust.
-
How does serde_json work?
You should try this lab by PingCAP (write a serde implementation for redis serialization protocol) ; you can find answers/seek help in github if you really don't know to figure out it.
-
Hey Rustaceans! Got a question? Ask here! (20/2022)!
I am doing pingcap's talent plan recently, where you need to implement a custom Deserializer for REdis Serialization Protocol for the building block 3. What confuses me most is the implementation of `EnumAccess` and `VariantAccess`. One of my attempt looks like the following:
-
Love
Looking at your project, I would recommend following this tutorial that makes you write key-value store in Rust. They don't provide you with a solution and make you come up with most stuff yourself, but give you some direction and structure for learning. You seem to have come up with most of this yourself already, but it might still be useful for you.
-
Newbie here. Just finished reading the book. What now?
If you are interested in network programming and distributed system, checkout https://github.com/pingcap/talent-plan
duckdb
- 🪄 DuckDB sql hack : get things SORTED w/ constraint CHECK
- DuckDB: Move to push-based execution model (2021)
-
DuckDB performance improvements with the latest release
I'm not sure if the fix is reassuring or not: https://github.com/duckdb/duckdb/pull/9411/files
-
Building a Distributed Data Warehouse Without Data Lakes
It's an interesting question!
The problem is that the data is spread everywhere - no choice about that. So with that in mind, how do you query that data? Today, the idea is that you HAVE to put it into a central location. With tools like Bacalhau[1] and DuckDB [2], you no longer have to - a single query can be sharded amongst all your data - EFFECTIVELY giving you a lot of what you want from a data lake.
It's not a replacement, but if you can do a few of these items WITHOUT moving the data, you will be able to see really significant cost and time savings.
[1] https://github.com/bacalhau-project/bacalhau
[2] https://github.com/duckdb/duckdb
- DuckDB 0.9.0
-
Push or Pull, is this a question?
[4] Switch to Push-Based Execution Model by Mytherin · Pull Request #2393 · duckdb/duckdb (github.com)
-
Show HN: Hydra 1.0 – open-source column-oriented Postgres
it depends on your query obviously.
In general, I did very deep benchmarking of pg, clickhouse and duckdb, and I sure didn't make stupid mistakes like this: https://news.ycombinator.com/item?id=36990831
My dataset has 50B rows and 2tb of data, and I think columnar dbs are very overhiped and I chose pg because:
- pg performance is acceptable, maybe 2-3x times slower than clickhouse and duckdb on some queries if pg is configured correctly and run on compressed storage
- clickhouse and duckdb start falling apart very fast because they specialized on very narrow type of queries: https://github.com/ClickHouse/ClickHouse/issues/47520 https://github.com/ClickHouse/ClickHouse/issues/47521 https://github.com/duckdb/duckdb/discussions/6696
-
🦆 Effortless Data Quality w/duckdb on GitHub ♾️
This action installs duckdb with the version provided in input.
-
Using SQL inside Python pipelines with Duckdb, Glaredb (and others?)
Duckdb: https://github.com/duckdb/duckdb - seems pretty popular, been keeping an eye on this for close to a year now.
-
CSV or Parquet File Format
The Parquet-Go library is very complex, not yet success to use it. So I ask whether DuckDB can provide API https://github.com/duckdb/duckdb/issues/7776
What are some alternatives?
dnsguide - A guide to writing a DNS Server from scratch in Rust
ClickHouse - ClickHouse® is a free analytics DBMS for big data
toydb - Distributed SQL database in Rust, written as a learning project
sqlite-worker - A simple, and persistent, SQLite database for Web and Workers.
rust-plus-golang - Rust + Go — Call Rust code from Go using FFI
datasette - An open source multi-tool for exploring and publishing data
mini-redis - Incomplete Redis client and server implementation using Tokio - for learning purposes only
octosql - OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.
duckdb-rs - Ergonomic bindings to duckdb for Rust
metabase-clickhouse-driver - ClickHouse database driver for the Metabase business intelligence front-end
incubator-horaedb - HoraeDB is a high-performance, distributed, cloud native time-series database.
datafusion - Apache DataFusion SQL Query Engine