arroyo vs tensorbase

arroyo

Distributed stream processing engine in Rust (by ArroyoSystems)

Source Code

arroyo.dev

Suggest alternative

Edit details

tensorbase

TensorBase is a new big data warehousing with modern efforts. (by tensorbase)

Rust Bigdata Database Analytics Modern Infrastructure Data data-infrastructure High Performance Engineering rust-lang warehouse data-warehouse

Source Code

tensorbase.io

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

arroyo		tensorbase
	Project
13	Mentions	1
3,326	Stars	1,429
3.2%	Growth	0.4%
9.6	Activity	0.0
6 days ago	Latest Commit	about 2 years ago
Rust	Language	Rust
Apache License 2.0	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

arroyo

Posts with mentions or reviews of arroyo. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-18.

FLaNK AI Weekly 18 March 2024
39 projects | dev.to | 18 Mar 2024
Arryo 0.8 released — streaming SQL engine
1 project | /r/dataengineering | 1 Dec 2023
Query Engines: Push vs. Pull
4 projects | news.ycombinator.com | 1 Aug 2023

Interesting - I looked into your code a bit. I found your window aggregation library [1]. You may be interested in looking into the Rust implementation of some of the research work I've been a part of [2].
In Flink, I believe the reason they need to implement their own backpressure system is that they multiplex TCP connections. That is, they have multiple logical streams flowing through a single TCP connection. If that's the case, you need to do some work to 1) detect which logical stream is the one that's blocking, and 2) don't block because other logical streams may be able to use the active TCP connection.
Thinking it through, I think what Flink's approach buys is not necessarily better performance, but better just a manageable number of connections. That is, imagine you have a process P1 with operators A, B and C. And then P2 has D, E, F. Now imagine that this is a shuffle, where A, B and C are fully connected to D, E and F. In my old system, you would have 9 TCP connections. In Flink, you will have 1.
[1] https://github.com/ArroyoSystems/arroyo/blob/master/arroyo-w...
Arroyo
1 project | /r/devopspro | 17 Jun 2023
Show HN: Arroyo – Write SQL on streaming data
3 projects | news.ycombinator.com | 6 Jun 2023
Release v0.3.0 · ArroyoSystems/arroyo - Stream Processing Engine
1 project | /r/rust | 2 Jun 2023
Arroyo 0.2 released - Rust stream processing engine, now on Kubernetes
1 project | /r/rust | 2 May 2023
Distributed stream processing engine written in Rust
1 project | news.ycombinator.com | 13 Apr 2023
ArroyoSystems/arroyo: Arroyo is a distributed stream processing engine written in Rust
1 project | /r/devopsish | 11 Apr 2023
Arroyo, a new open-source SQL stream processing engine written in Rust
1 project | /r/programming | 5 Apr 2023

tensorbase

Posts with mentions or reviews of tensorbase. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-07-18.

ToyDB: Distributed SQL Database in Rust
8 projects | news.ycombinator.com | 18 Jul 2021

+ The result of TB's architectural performance: the untuned write throughput of TB is ~ 2x faster than that of CH in the Rust driver bench, or ~70% faster by using CH own ```clickHouse-client``` command. Use [this parallel script](https://github.com/tensorbase/tools/blob/main/import_csv_to_...) to try it yourself!
3. Thanks to the Arrow-DataFusion, TensorBase has supported good parts of TPC-H. [Untuned TPC-H Q1 result here](https://github.com/tensorbase/benchmarks/blob/main/tpch.md).
4. In simple (no-groupby) aggregation, TensorBase is several times faster than ClickHouse. [Benchmark here](https://github.com/tensorbase/benchmarks/blob/main/quick.md).
5. For complex groupby aggregations, recently we help to boost the speed of the TB engine to the same level of ClickHouse(not released, but coming soon).
6. TB will soon supports MySQl wire protocol, distributed query, adaptive columnar storage optimization... Watch [issues here](https://github.com/tensorbase/tensorbase/issues)
Finally, it is really great to build an AP database in Rust. Welcome to join!
Disclaimer: I am the author of TensorBase.

What are some alternatives?

When comparing arroyo and tensorbase you can also consider the following projects:

bytewax - Python Stream Processing

awesome-bigdata - A curated list of awesome big data frameworks, ressources and other awesomeness.

risingwave - SQL stream processing, analytics, and management. We decouple storage and compute to offer speedy bootstrapping, dynamic scaling, time-travel queries, and efficient joins.

tools

Benthos - Fancy stream processing made operationally mundane

benchmarks

cli - Railway CLI

gitplay - Learn how a software project (using git) evolved over time from its commit log. Its like YouTube for a git project. Desktop app built with Rust and SolidJS

feldera - Feldera Continuous Analytics Platform

toydb - Distributed SQL database in Rust, written as a learning project

timely-dataflow - A modular implementation of timely dataflow in Rust

naphtha - Universal database connection layer for your application in Rust. Implements the most common functions insert, update and remove for database connections. Change the database without having to adjust your code. Specific models can be stored in different databases. Query models by property. Migrations in pure Rust and available during runtime.

arroyo vs bytewax tensorbase vs awesome-bigdata arroyo vs risingwave tensorbase vs tools arroyo vs Benthos tensorbase vs benchmarks arroyo vs cli tensorbase vs gitplay arroyo vs feldera tensorbase vs toydb arroyo vs timely-dataflow tensorbase vs naphtha

Compare arroyo vs tensorbase and see what are their differences.

arroyo

tensorbase

arroyo

tensorbase

What are some alternatives?