SaaSHub helps you find the best software and product alternatives Learn more →
Top 14 Python Data Validation Projects
-
Project mention: Forced to move away from Django template because of nested forms ? | /r/django | 2022-07-07
Forms are hard. We use python jsonschema to write our form schemas and validation and use react json schema form for the front end. It's a long time in the making and we still have to write widgets and extensions to get everything we need. Good luck.
-
Project mention: Do you think we need an open-source web scraping monitoring tool? | /r/webscraping | 2023-05-06
I wrote scrapy-test as a proof of concept for validating live pages for scrapy spiders if you're looking for some reference but if you're not using scrapy I'd recommend just adding validation tests using data validation tools like cerberus which is super underrated. I cover popular data validation techniques on this short blog I wrote if you want to learn more.
-
Sonar
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
-
Validation using Schema
-
deepchecks
Deepchecks - Tests for Continuous Validation of ML Models & Data. Deepchecks is a Python package for comprehensively validating your machine learning models and data with minimal effort.
Project mention: [D] DL Practitioners, Do You Use Layer Visualization Tools s.a GradCam in Your Process? | /r/MachineLearning | 2022-10-28 -
-
I use Pandera, so I just need to define the expected input/output schemas (i.e. column names, types, and constraints on them), and Pandera automatically generates fake data for the unit tests, and validates the result: https://github.com/unionai-oss/pandera
-
-
InfluxDB
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
-
Project mention: [D] Is accurately estimating image quality even possible? | /r/MachineLearning | 2023-04-22
Github: https://github.com/cleanlab/cleanvision Blogpost: https://cleanlab.ai/blog/cleanvision/
-
objectiv-analytics
Open-source product analytics infrastructure for data teams that want full control. Built for high quality data collection and ready to use for advanced analytics & ML.
Project mention: Get tools to test, validate and debug your tracking instrumentation → Set up error-free user behavior tracking → No more missing/faulty data downstream. | /r/u_objectiv_io | 2022-09-19 -
-
-
Encord Active
Open source active learning toolkit to find failure modes in your computer vision models, prioritize data to label next, and drive data curation to improve model performance.
Project mention: Show HN: Open-source toolkit for ML model evaluation and active learning | news.ycombinator.com | 2023-05-09 -
python-codicefiscale
:it: :credit_card: italian fiscal codes encoding, decoding and validation - codifica, decodifica e validazione del Codice Fiscale italiano.
-
Project mention: Is there a technique/package that allows me to do Laravel-style validation of POST data in Django? | /r/django | 2022-10-19
My question: Does anything resembling Laravel's form validation exist for Django? I found django-laravel-validator, laravel-validation, and python_payload_validation, but none of them look suitable for real-world projects.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Data Validation related posts
- Do you think we need an open-source web scraping monitoring tool?
- [D] Is accurately estimating image quality even possible?
- Python: Data validation
- Deepchecks
- deepchecks: Test Suites for Validating ML Models & Data. Deepchecks is a Python package for comprehensively validating your machine learning models and data with minimal effort.
- Forced to move away from Django template because of nested forms ?
- Can you suggest something more to grow in scraping?
-
A note from our sponsor - #<SponsorshipServiceOld:0x00007f0920d7d810>
www.saashub.com | 6 Jun 2023
Index
What are some of the best open-source Data Validation projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | jsonschema | 4,150 |
2 | Cerberus | 2,980 |
3 | schema | 2,751 |
4 | deepchecks | 2,694 |
5 | Schematics | 2,558 |
6 | pandera | 2,351 |
7 | voluptuous | 1,769 |
8 | cleanvision | 609 |
9 | objectiv-analytics | 448 |
10 | colander | 437 |
11 | valideer | 262 |
12 | Encord Active | 204 |
13 | python-codicefiscale | 61 |
14 | laravel-validation | 10 |