The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 4 Python data-testing Projects
-
soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
-
Project mention: Show HN: PipeRider – open-source Data Impact Analysis for dbt changes | news.ycombinator.com | 2023-09-06
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
soda-spark
Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
-
NOTE:
The open source projects on this list are ordered by number of github stars.
The number of mentions indicates repo mentiontions in the last 12 Months or
since we started tracking (Dec 2020).
The latest post mention was on 2023-09-06.
Python data-testing related posts
- Data profiling tools / approaches?
- Data QC? Great Expectations?
- Show HN: Soda Core is now GA – Test data like you would test your code
- Data Quality - Great Expectations for Data Engineers
- dbt vs R/Python for transformation
- SodaCL - preview of a new "data reliability as code" language
- How do you test your pipelines?
-
A note from our sponsor - WorkOS
workos.com | 28 Mar 2024
Index
What are some of the best open-source data-testing projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | soda-core | 1,724 |
2 | piperider | 467 |
3 | soda-spark | 60 |
4 | data_check | 4 |
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com