Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 4 Python etl-framework Projects
-
dataall
A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
flowrunner
Flowrunner is a lightweight package to organize and represent Data Engineering/Science workflows
An awesome read!
Something related that I found out about from HN a few months back is another engine called quokka. It's particularly interesting and applicable how quokka schedules distributed queries to outperform Spark https://github.com/marsupialtail/quokka/blob/master/blog/why...
Python etl-framework related posts
- FLaNK Weekly 31 December 2023
- Quokka – Distributed Polars on Ray
- Why your dataframe library needs to understand vector embeddings
- Python Package to build ETL flows/dags
- Launch HN: Patterns (YC S21) – A much faster way to build and deploy data apps
- Datajob: Build and deploy a serverless data pipeline on AWS with no effort.
-
A note from our sponsor - InfluxDB
www.influxdata.com | 29 Apr 2024
Index
What are some of the best open-source etl-framework projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | quokka | 1,081 |
2 | dataall | 210 |
3 | patterns-devkit | 106 |
4 | flowrunner | 8 |
Sponsored