tikv vs arrow-datafusion

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

tikv		arrow-datafusion
	Project
21	Mentions	55
14,476	Stars	4,924
1.7%	Growth	4.9%
9.7	Activity	9.9
1 day ago	Latest Commit	about 16 hours ago
Rust	Language	Rust
Apache License 2.0	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

tikv

Posts with mentions or reviews of tikv. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-25.

just wanted to ask is there an in memory database that uses s3 or gcp cloud storage as permanent storage
1 project | /r/Database | 4 Jul 2023

I know that very similar functionality to this is in TiDB Serverless ( https://tidbcloud.com ). TiDB is a distributed relational database. It uses TiKV ( which is a key/value engine ) as the storage engine. You could use SQL to access your K/V records. There is ongoing work in TiKV to support S3 directly as the storage backend ( https://github.com/tikv/tikv/issues/6506 ) .
Implementing a distributed key-value store on top of implementing Raft in Go
5 projects | news.ycombinator.com | 25 May 2023
Production grade databases in Rust
14 projects | /r/rust | 21 Apr 2023
Can anyone recommend tikv nosql database
1 project | /r/developers | 13 Jan 2023
Go devs that learned Rust, what are your thoughts on it?
7 projects | /r/golang | 8 Jan 2023
Apache Pegasus – A a distributed key-value storage system
4 projects | news.ycombinator.com | 6 Oct 2022

TiKV is basically a layer on top of rocksdb https://github.com/tikv/tikv/blob/956610725039835557e7516828...
TiKV is a highly scalable, low latency, and easy to use key-value database
1 project | news.ycombinator.com | 16 Sep 2022
Surrealdb – FOSS document-graph database, for the realtime web in Rust
6 projects | news.ycombinator.com | 16 Sep 2022

> Many,many smart people…
If you look inside the code you can see the stated features are a result of underlying engine (TiKV [0] also in c and rust from pingcap). Surrealdb is standing on shoulders of giants at present, they are TiKV, FoundationDB and rocksdb. The feature set they mentioned mostly coming from TiKV at present.
[0] https://tikv.org/
Cloud database for tomorrow's applications (written in Rust)
7 projects | news.ycombinator.com | 22 Aug 2022

Hi Diggsey, great question. We are currently focussed on functionality and stability, and then will draw our attention to performance. Coming this week we have a RocksDB storage implementation. We've only just launched our initial beta version, and we know there is a lot of improvement and work to be done (some of these performance issues we know about already and are on our Github issues list).
With regards to the consistency/isolation model, SurrealDB sits on top of a number of key-value stores. By using the distributed highly-available TiKV storage backend, https://tikv.org, (and we have a FoundationDB integration in the works), the database is designed to be highly-scalable and highly-available. The same guarantees (albeit just single-node, so no high-availability or scalability) will be available with the RocksDB implementation coming this week. By sitting on top of these key-value stores, SurrealDB ensures that all transactions are ACID compliant. We don't want to go for speed (for instance by writing to /dev/null) over anything, but want SurrealDB to be a reliable and performant backend for any application. Obviously we have a way to go to catch up with PostgreSQL (launched in 1996), but we will strive to get there!
CeresDB: A high-performance, distributed, schema-less and time-series database
3 projects | news.ycombinator.com | 15 Jun 2022

If you are looking for a production ready distributed store written in Rust. Check out TiKV(https://github.com/tikv/tikv), which was also mentioned in the acknowledge section of the project's README.
There's also a full-featured distributed RDBMS called TiDB built on top of TiKV.

arrow-datafusion

Posts with mentions or reviews of arrow-datafusion. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-25.

Velox: Meta's Unified Execution Engine [pdf]
2 projects | news.ycombinator.com | 25 Mar 2024

Python's Substrait seems like the biggest/most-used competitor-ish out there. I'd love some compare & contrast; my sense is that Substrait has a smaller ambition, and more wants to be a language for talking about execution rather than a full on execution engine. https://github.com/substrait-io/substrait
We can also see from the DataFusion discussion that they too see themselves as a bit of a Velox competitor. https://github.com/apache/arrow-datafusion/discussions/6441
What I Talk About When I Talk About Query Optimizer (Part 1): IR Design
7 projects | news.ycombinator.com | 29 Jan 2024

Agree, substrait is a really cool project! Related: if you like substrait you might want to check out datafusion too. The project is a query execution engine built on top of Apache Arrow (with SQL parser, query planner & optimizer, execution engine, extensible user defined functions, among others) and it implements a substrait provider and consumer: https://github.com/apache/arrow-datafusion/tree/main/datafus...
DuckDB performance improvements with the latest release
8 projects | news.ycombinator.com | 6 Nov 2023

The draft contains some preliminary benchmark results, comparing it to DuckDB.
https://github.com/apache/arrow-datafusion/issues/6782
Apache Arrow DataFusion
1 project | news.ycombinator.com | 1 Oct 2023
GlareDB: An open source SQL database to query and analyze distributed data
4 projects | /r/dataengineering | 8 Jun 2023

Apache Arrow is a pretty common memory structure these days. Datafusion is an open query engine built in Rust started by Andy Grove.
DuckDB 0.8.0
5 projects | news.ycombinator.com | 17 May 2023

DuckDB is a great piece of software if you are
If you are looking for a query engine implemented in a safe language (Rust) I definitely suggest checking out DataFusion. It is comparable to DuckDB in performance, has all the standard built in SQL functionality, and is extensible in pretty much all areas (query language, data formats, catalogs, user defined functions, etc)
https://arrow.apache.org/datafusion/
Disclaimer I am a maintainer of DataFusion
Data Engineering with Rust
5 projects | /r/rust | 9 May 2023

https://github.com/jorgecarleitao/arrow2 https://github.com/apache/arrow-datafusion https://github.com/apache/arrow-ballista https://github.com/pola-rs/polars https://github.com/duckdb/duckdb
Polars: Computing a new column from multiple columns - there must be a better way
1 project | /r/rust | 4 May 2023
Bridging Async and Sync Rust Code - A lesson learned while working with Tokio
3 projects | /r/rust | 10 Mar 2023

Problem comes when you want to do this inside an async context since we couldn't block an async task. https://users.rust-lang.org/t/sync-function-invoking-async/43364/6 You might need to do it in another runtime/thread. It is not recommended to do this, but sometimes it is unavoidable while implementing a third-party trait. https://github.com/apache/arrow-datafusion/issues/3777 However, I believe this isn't a problem particular to tokio, or any specific runtime.
Using Rust to write a Data Pipeline. Thoughts. Musings.
5 projects | /r/rust | 14 Jan 2023

What are some alternatives?

When comparing tikv and arrow-datafusion you can also consider the following projects:

redis-rs - Redis library for rust

polars - Dataframes powered by a multithreaded, vectorized query engine, written in Rust

rust-etcd - An etcd client library for Rust.

ClickHouse - ClickHouse® is a free analytics DBMS for big data

rust-rocksdb - rust wrapper for rocksdb

db-benchmark - reproducible benchmark of database-like ops

cassandra-rs - Cassandra (CQL) driver for Rust, using the DataStax C/C++ driver under the covers.

databend - 𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com

rust-postgres - Native PostgreSQL driver for the Rust programming language

nushell - A new type of shell

diesel - A safe, extensible ORM and Query Builder for Rust

duckdb - DuckDB is an in-process SQL OLAP Database Management System

tikv vs redis-rs arrow-datafusion vs polars tikv vs rust-etcd arrow-datafusion vs ClickHouse tikv vs rust-rocksdb arrow-datafusion vs db-benchmark tikv vs cassandra-rs arrow-datafusion vs databend tikv vs rust-postgres arrow-datafusion vs nushell tikv vs diesel arrow-datafusion vs duckdb

Compare tikv vs arrow-datafusion and see what are their differences.

tikv

arrow-datafusion

tikv

arrow-datafusion

What are some alternatives?