Top 5 Python data-observability Projects
-
soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
-
Project mention: Show HN: PipeRider – open-source Data Impact Analysis for dbt changes | news.ycombinator.com | 2023-09-06
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
dbt-data-reliability
dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
-
swiple
Swiple enables you to easily observe, understand, validate and improve the quality of your data
-
soda-spark
Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Python data-observability related posts
Index
What are some of the best open-source data-observability projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | soda-core | 1,745 |
2 | piperider | 467 |
3 | dbt-data-reliability | 338 |
4 | swiple | 76 |
5 | soda-spark | 60 |