SaaSHub helps you find the best software and product alternatives Learn more →
Top 18 Python Data Validation Projects
-
cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Project mention: Ask HN: Not a webdev, why are these sites so good? | news.ycombinator.com | 2024-06-18https://cleanlab.ai/
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
-
deepchecks
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
-
-
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
-
-
-
-
Encord Active
Open source active learning toolkit to find failure modes in your computer vision models, prioritize data to label next, and drive data curation to improve model performance.
-
-
python-codicefiscale
:it: :credit_card: italian fiscal codes encoding, decoding and validation - codifica, decodifica e validazione del Codice Fiscale italiano.
-
❄️ https://github.com/akmalsoliev/Validoopsie
-
snowflake-provisioning
Snowflake Database, Schema, and Warehouse provisioning with Access Roles & Generating and Provisioning of Functional Roles & Snowflake Source Export, Snowflake cloning, and data tieout tool
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Data Validation discussion
Python Data Validation related posts
-
Detect, Defend, Prevail: Payments Fraud Detection using ML & Deepchecks
-
Deepchecks: Open-source ML testing and validation library
-
Deepchecks' New Open Source is on Product Hunt, and Needs Your Help
-
Do you think we need an open-source web scraping monitoring tool?
-
[D] Is accurately estimating image quality even possible?
-
Python: Data validation
-
Deepchecks
-
A note from our sponsor - SaaSHub
www.saashub.com | 26 Mar 2025
Index
What are some of the best open-source Data Validation projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | cleanlab | 10,241 |
2 | jsonschema | 4,723 |
3 | deepchecks | 3,742 |
4 | pandera | 3,698 |
5 | Cerberus | 3,198 |
6 | schema | 2,904 |
7 | Schematics | 2,582 |
8 | soda-core | 2,043 |
9 | voluptuous | 1,831 |
10 | cleanvision | 1,058 |
11 | colander | 456 |
12 | Encord Active | 448 |
13 | valideer | 263 |
14 | python-codicefiscale | 77 |
15 | Validoopsie | 58 |
16 | snowflake-provisioning | 42 |
17 | laravel-validation | 12 |
18 | data_check | 4 |