conduit vs timely-dataflow

conduit

Conduit streams data between data stores. Kafka Connect replacement. No JVM required. (by ConduitIO)

Go data-integration ETL data-pipeline data-engineering data-stream Conduit Kafka kafkaconnect

Source Code

conduit.io

Suggest alternative

Edit details

timely-dataflow

A modular implementation of timely dataflow in Rust (by TimelyDataflow)

Suggest topics

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

conduit		timely-dataflow
	Project
7	Mentions	11
348	Stars	3,157
3.2%	Growth	0.9%
9.5	Activity	7.0
about 4 hours ago	Latest Commit	11 days ago
Go	Language	Rust
Apache License 2.0	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

conduit

Posts with mentions or reviews of conduit. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-04-30.

Pulling CDC data from Postgres
5 projects | /r/dataengineering | 30 Apr 2023

I'd like to mention Conduit + its Postgres connector. The Pg connector comes built-in, so all that is needed is a single Conduit binary to get started. It relies on WAL, but the connector creates the replication slot itself (if needed).
How to connect already setup kafka cluster to mongodb?
1 project | /r/apachekafka | 15 Jul 2022

GitHub - ConduitIO/Conduit: Data Integration for Production Data Stores. Conduit is meant to be a bit more general-purpose than Kafka Connect and is an easy drop-in replacement. We're working hard to make that even easier. We are still in the early stages of this project and are trying to build more and more connectors. You can find out more about our connector roadmap on the Github repo. Our connector philosophy is to be real-time first, and double down on change data capture (CDC) capabilities, all with permissive licensing.
What services you guys used for CDC (Change Data capture) for Sql as well as no sql databases ?
4 projects | /r/dataengineering | 14 Jul 2022

If you're looking for a tool with a UI and in which you can also easily extend the functionality with your own, custom data connectors, you might also want take a look at Conduit which is another open-source tool we've developed to make building and running real-time data infrastructure more straightforward and less time consuming.
Alternative Kafka Integration Framework to Kafka Connect?
3 projects | /r/apachekafka | 21 Jun 2022

You might want to check out: https://github.com/conduitio/conduit
Where is the modern data stack for software engineers?
1 project | dev.to | 4 Feb 2022

This is why we are working on a project called Conduit at Meroxa. We hope to change the experience software engineers have with data.
Conduit: Data Integration for Production Data Stores
1 project | news.ycombinator.com | 20 Jan 2022
Conduit: Data Integration Tool for Production Data Stores written in Go
1 project | /r/golang | 20 Jan 2022

timely-dataflow

Posts with mentions or reviews of timely-dataflow. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-21.

Readyset: A MySQL and Postgres wire-compatible caching layer
5 projects | news.ycombinator.com | 21 Feb 2024

They have a bit about their technical foundation here[0].
Given that Readyset was co-founded by Jon Gjengset (but has apparently since departed the company), who authored the paper on Noria[1], I would assume that Readyset is the continuation of that research.
So it shares some roots with Materialize. They have a common conceptual ancestry in Naiad, where Materialize evolved out of timely-dataflow.
[0]: https://docs.readyset.io/concepts/streaming-dataflow
[1]: https://jon.thesquareplanet.com/papers/osdi18-noria.pdf
[2]: https://dl.acm.org/doi/10.1145/2517349.2522738
[3]: https://github.com/TimelyDataflow/timely-dataflow
Mandala: experiment data management as a built-in (Python) language feature
4 projects | /r/ProgrammingLanguages | 11 Apr 2023

And systems like timely dataflow, https://github.com/TimelyDataflow/timely-dataflow
Arroyo: A distributed stream processing engine written in Rust
3 projects | /r/rust | 4 Apr 2023

Project looks cool! Glad you open sourced it. It could use some comments in the code base to help contributors ;). I also like the datafusion usage, that is awesome. BTW I work on github.com/bytewax/bytewax, which is based on https://github.com/TimelyDataflow/timely-dataflow another Rust dataflow computation engine.
Rust MPI -- Will there ever be a fully oxidized implementation?
4 projects | /r/rust | 5 Mar 2023

Just found this https://github.com/TimelyDataflow/timely-dataflow and my heart skipped a beat.
Streaming processing in Python using Timely Dataflow with Bytewax
1 project | /r/Python | 9 Nov 2022

Bytewax is a Python native binding to the Timely Dataflow library (written in Rust) for building highly scalable streaming (and batch) processing pipelines.
Alternative Kafka Integration Framework to Kafka Connect?
3 projects | /r/apachekafka | 21 Jun 2022

I am working on Bytewax, which is a Python stream processing framework built on Timely Dataflow. It is not exactly a Kafka integration framework because it is a more of a general stream processing framework, but might be interesting for you. We are focused on enabling people to more easily debug, containerize, parallelize and customize and less on enabling a declarative integration framework. It is still early days for us! And we are looking for feedback and ideas from the community.
[AskJS] JavaScript for data processing
5 projects | /r/javascript | 27 May 2022

We used to use a library called Pond.js, https://github.com/esnet/pond, but the reliance on Immutable.JS caused some performance pitfalls, so we wrote a system from scratch that deals with data in a batched streaming fashion. A lot of the concepts were borrowed from a Rust library called timely-dataflow, https://github.com/TimelyDataflow/timely-dataflow.
Dataflow: An Efficient Data Processing Library for Machine Learning
2 projects | /r/rust | 17 Jan 2022

Though the name "Dataflow" might be an unfortunate name conflict with another Rust project: https://github.com/TimelyDataflow/timely-dataflow
Ask HN: Is there a way to subscribe to an SQL query for changes?
17 projects | news.ycombinator.com | 22 Apr 2021

> In the simplest case, I'm talking about regular SQL non-materialized views which are essentially inlined.
I see that now -- makes sense!
> Wish we had some better database primitives to assemble rather than building everything on Postgres - its not ideal for a lot of things.
I'm curious to hear more about this! We agree that better primitives are required and that's why Materialize is written in Rust using using TimelyDataflow[1] and DifferentialDataflow[2] (both developed by Materialize co-founder Frank McSherry). The only relationship between Materialize and Postgres is that we are wire-compatible with Postgres and we don't share any code with Postgres nor do we have a dependence on it.
[1] https://github.com/TimelyDataflow/timely-dataflow
7 Real-Time Data Streaming Tools You Should Consider On Your Next Project
2 projects | dev.to | 20 Mar 2021

Under the hood, Materialize uses Timely Dataflow (TDF) as the stream-processing engine. This allows Materialize to take advantage of the distributed data-parallel compute engine. The great thing about using TDF is that it has been in open source development since 2014 and has since been battle-tested in production at large Fortune 1000-scale companies.

What are some alternatives?

When comparing conduit and timely-dataflow you can also consider the following projects:

turbine-go - Turbine Library for Go

noria - Fast web applications through dynamic, partially-stateful dataflow

dozer - Dozer is a real-time data movement tool that leverages CDC from various sources and moves data into various sinks.

differential-datalog - DDlog is a programming language for incremental computation. It is well suited for writing programs that continuously update their output in response to input changes. A DDlog programmer does not write incremental algorithms; instead they specify the desired input-output mapping in a declarative manner.

sqlpipe - SQLpipe makes it easy to move the result of one query from one database to another.

materialize - The data warehouse for operational workloads.

bytewax - Python Stream Processing

Benthos - Fancy stream processing made operationally mundane

realtime - Broadcast, Presence, and Postgres Changes via WebSockets

deprecated-core - 🔮 Instill Core contains components for supporting Instill VDP and Instill Model

differential-dataflow - An implementation of differential dataflow using timely dataflow on Rust.

conduit vs turbine-go timely-dataflow vs noria conduit vs dozer timely-dataflow vs differential-datalog conduit vs sqlpipe timely-dataflow vs materialize conduit vs bytewax timely-dataflow vs bytewax conduit vs Benthos timely-dataflow vs realtime conduit vs deprecated-core timely-dataflow vs differential-dataflow

Compare conduit vs timely-dataflow and see what are their differences.

conduit

timely-dataflow

conduit

timely-dataflow

What are some alternatives?