-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
My company has been using a lot of PySpark, but we're working with not-large data (<1TB/source/day) so Spark can be a bit of overkill sometimes and I've been looking for a light-weight replacement. I think I found a replacement that fits all our needs called Meerschaum but I don't see a lot of other DEs talking about it.
Hi there, Meerschaum author here! Please see this template repository and the plugins page if you'd like to learn more. The Meerschaum Compose workflow is similar to Meltano's but more lightweight and time-series-focused. Please don't let the number of stars discourage you from trying it out!
Related posts
-
I’m struggling with how to ask for help with my task.
-
For those of you who were self taught, what was your path into data engineering
-
Wanted to share my open source incremental ETL framework: Meerschaum
-
Python ETL - Jupyter/Pandas/Postgresql(DW) - Project Structure and Scripting
-
Tools that allow you to use scripts to build/maintain data pipeline