Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems. Learn more →
Python data-quality-monitoring Projects
-
soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
-
Judoscale
Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
-
swiple
Swiple enables you to easily observe, understand, validate and improve the quality of your data
NOTE:
The open source projects on this list are ordered by number of github stars.
The number of mentions indicates repo mentiontions in the last 12 Months or
since we started tracking (Dec 2020).
Python data-quality-monitoring discussion
Python data-quality-monitoring related posts
-
Data profiling tools / approaches?
-
Data QC? Great Expectations?
-
Show HN: Soda Core is now GA – Test data like you would test your code
-
Data Quality - Great Expectations for Data Engineers
-
dbt vs R/Python for transformation
-
SodaCL - preview of a new "data reliability as code" language
-
Being constantly shut down by more senior team members when I mention adding some QA in our work
-
A note from our sponsor - InfluxDB
influxdata.com | 23 Apr 2025