python-fake-data-producer-for-apache-kafka
ClickHouse
python-fake-data-producer-for-apache-kafka | ClickHouse | |
---|---|---|
41 | 251 | |
85 | 42,570 | |
- | 1.3% | |
2.7 | 10.0 | |
over 1 year ago | 5 days ago | |
Python | C++ | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
python-fake-data-producer-for-apache-kafka
-
Postgres on a Budget: 3 Free Solutions You Should Know
Aiven offers a range of open-source managed data infrastructures, including PostgreSQL, Apache Kafka, Elasticsearch, Grafana, InfluxDB, MySQL, Redis and more.
-
SonicScan - A Music Fingerprinting and Identification App
For the database I used Valkey on Aiven for free.
-
How to Connect Your Next.js React Application to Redis
For this demo I'm using Aiven, they offer a 1GB free account, which is more than enough for a POC or prototype.
-
Run Postgres For Free: Top 3 Options
Unlike Neon, Aiven provides many open-source managed data infrastructures, such as PostgreSQL Apache Cassandra, Apache Kafka, Apache Kafka Connect, Apache Kafka MirrorMaker 2, Elasticsearch, Grafana, InfluxDB, M3, M3 Aggregator, MySQL, and Redis. It’s a jack of all trades regarding its support for open-source technology.
-
Building Scalable Data Pipelines with Python – A Complete Guide.
A PostgreSQL database (created using Aiven and connected using DBeaver).
-
Top 10 Affordable Options To Host Your PostgreSQL Database
Aiven
-
Top 8 Managed Postgres Providers
Aiven provides managed cloud service for PostgreSQL, making sure databases run smoothly, safely, and can grow easily on different cloud platforms.
-
How to Deploy Infisical to Manage Application Secrets on Koyeb
An Aiven account to provision the Redis database.
-
Helping PostgreSQL® professionals with AI-assisted performance recommendations
I have the luxury of working for Aiven which provides professionals an integrated platform for all their data needs. In the last three years I witnessed the growth of the platform and its evolution with the clear objective to make it better usable at scale. Tooling like integrations, Terraform providers and the Console facilitate the work that platform administrators have to perform on daily basis.
-
ElephantSQL Is Shutting Down
I had good experience with Aiven in the past, we needed something located in the EU: https://aiven.io/
ClickHouse
- Strategies for Fast Lexers
-
From Go to Rust: Supercharging Our ClickHouse UDFs with Alloy
At Agnostic, we build open-source infrastructure for collaborative blockchain data platforms. One of our flagship tools is clickhouse-evm, a suite of high-performance User Defined Functions (UDFs) that brings native Ethereum decoding and querying capabilities directly into ClickHouse.
-
🧠 From Hive and Elastic to ClickHouse: What Surprised Me
Over the past few weeks, I’ve been diving into ClickHouse — and it’s been full of surprises.
- Show HN: Hacker News historic upvote and score data
-
Cross-Compiling Haskell under NixOS with Docker
I attended the AWS Summit 2025 in Singapore. I enjoyed the event. There were booths from various companies which I found interesting, such as GitLab and ClickHouse. More importantly, I got to meet very interesting people.
-
ClickHouse raises $350M Series C
https://github.com/ClickHouse/clickhouse
Disclosure: I started at Citus & ended up at ClickHouse
-
How to Build a Streaming Deduplication Pipeline with Kafka, GlassFlow, and ClickHouse
ClickHouse: A fast columnar database. It will be our final destination for clean data. And, for simplicity in this tutorial, we'll cleverly use it as our "memory" or state store to remember which events we've already seen recently.
- Waiting for Postgres 18: Accelerating Disk Reads with Asynchronous I/O
-
Why You Shouldn’t Invest In Vector Databases?
In fact, even in the absence of these commercial databases, users can effortlessly install PostgreSQL and leverage its built-in pgvector functionality for vector search. PostgreSQL stands as the benchmark in the realm of open-source databases, offering comprehensive support across various domains of database management. It excels in transaction processing (e.g., CockroachDB), online analytics (e.g., DuckDB), stream processing (e.g., RisingWave), time series analysis (e.g., Timescale), spatial analysis (e.g., PostGIS), and more. For non-professional users seeking to explore vector databases, they can readily download the open-source PostgreSQL or utilize managed services like Supabase and Neon to establish their own basic AI applications. Other than PostgreSQL, several open-source databases, including OpenSearch, ClickHouse, and Cassandra, have implemented their own vector search functionality. You do not need to adopt a new vector database if you have already used these systems.
-
Reproducing Hacker News writing style fingerprinting
https://gh-api.clickhouse.tech/play?user=play#U0VMRUNUICogRl...
I subscribe to this issue to keep up with updates:
https://github.com/ClickHouse/ClickHouse/issues/29693#issuec...
And ofc, for those that don't know, the official API https://github.com/HackerNews/API
What are some alternatives?
strimzi-kafka-cli - Command Line Interface for the Strimzi Kafka Operator
loki - Like Prometheus, but for logs.
demo-scene - Scripts and samples to support Confluent Demos, Talks, and Blogs. Not all of the examples in this repository are kept up to date. For automated tutorials and QA'd code, see https://github.com/confluentinc/tutorials/
RocksDB - A library that provides an embeddable, persistent key-value store for fast storage.
fake-data-producer-for-apache-kafka-docker - Fake Data Producer for Aiven for Apache Kafka® in a Docker Image
DuckDB - DuckDB is an analytical in-process SQL database management system