Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 12 data-stream Open-Source Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
conduit
Conduit streams data between data stores. Kafka Connect replacement. No JVM required. (by ConduitIO)
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
scramjet
Public tracker for Scramjet Cloud Platform, a platform that bring data from many environments together.
Project mention: Knative switchboard series, part 1. Setup Knative Eventing with Kafka from scratch, scale based on events volume, and monitor | dev.to | 2024-01-04Knative dashboards together with Kafka's dashboards it sheds light on almost any aspect of what's going on in the system.
Branchless or not in this case, it still touches memory in not so good pattern. I found that a significant speedup of a classic BS could be achieved by switching to linear SIMD search when the remaining range has a width of 3-4 SIMD lines (or maybe even a little more). The bounds of that range are likely already touched and in cache, then prefetching helps. It gives 30-50% gain on 1K items array of integers, 10-25% on 1M items, depending on data distribution. Here is an example in C#: https://github.com/Spreads/Spreads/blob/main/src/Spreads.Cor...
I'd like to mention Conduit + its Postgres connector. The Pg connector comes built-in, so all that is needed is a single Conduit binary to get started. It relies on WAL, but the connector creates the replication slot itself (if needed).
Project mention: What are your favorite tools or components in the Kafka ecosystem? | /r/apachekafka | 2023-05-31For example, CLIs, UIs, monitoring tools / integrations, cluster administration, stream processing libraries (Flink, Kafka Streams, smaller / newer libs), etc? Anything in the ML / AI space (e.g. a quick Google search came up with https://github.com/ertis-research/kafka-ml).
EthanYuan/open-transaction-pool: A CKB Open Transaction solution based on memory pool.
data-stream related posts
- Beautiful branchless binary search
- Spreads: NEW Data - star count:384.0
- Spreads: NEW Data - star count:384.0
- Spreads: NEW Data - star count:384.0
- Spreads: NEW Data - star count:384.0
- Spreads: NEW Data - star count:384.0
- Spreads: NEW Data - star count:384.0
-
A note from our sponsor - InfluxDB
www.influxdata.com | 27 Apr 2024
Index
What are some of the best open-source data-stream projects? This list will help you:
Project | Stars | |
---|---|---|
1 | awesome-bigdata | 12,792 |
2 | strimzi-kafka-operator | 4,456 |
3 | go-streams | 1,753 |
4 | boomfilters | 1,574 |
5 | openscap | 1,268 |
6 | Spreads | 417 |
7 | conduit | 345 |
8 | scramjet | 254 |
9 | openHistorian | 168 |
10 | kafka-ml | 144 |
11 | CohesionKit | 7 |
12 | open-transaction-pool | 3 |
Sponsored