Python data-quality-monitoring

Open-source Python projects categorized as data-quality-monitoring

Python data-quality-monitoring Projects

data-quality-monitoring
  1. soda-core

    :zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io

  2. Judoscale

    Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.

    Judoscale logo
  3. swiple

    Swiple enables you to easily observe, understand, validate and improve the quality of your data

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python data-quality-monitoring discussion

Log in or Post with

Python data-quality-monitoring related posts

  • Data profiling tools / approaches?

    1 project | /r/dataengineering | 15 Feb 2023
  • Data QC? Great Expectations?

    1 project | /r/dataengineering | 30 Jan 2023
  • Show HN: Soda Core is now GA – Test data like you would test your code

    1 project | news.ycombinator.com | 28 Jun 2022
  • Data Quality - Great Expectations for Data Engineers

    2 projects | /r/dataengineering | 18 Mar 2022
  • dbt vs R/Python for transformation

    2 projects | /r/dataengineering | 25 Feb 2022
  • SodaCL - preview of a new "data reliability as code" language

    1 project | /r/dataengineering | 13 Feb 2022
  • Being constantly shut down by more senior team members when I mention adding some QA in our work

    1 project | /r/dataengineering | 10 Jan 2022
  • A note from our sponsor - InfluxDB
    influxdata.com | 23 Apr 2025
    Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems. Learn more →

Index

# Project Stars
1 soda-core 2,069
2 swiple 82

Sponsored
Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?