Python Data Validation

Open-source Python projects categorized as Data Validation

Top 14 Python Data Validation Projects

  • jsonschema

    An implementation of the JSON Schema specification for Python

    Project mention: Forced to move away from Django template because of nested forms ? | /r/django | 2022-07-07

    Forms are hard. We use python jsonschema to write our form schemas and validation and use react json schema form for the front end. It's a long time in the making and we still have to write widgets and extensions to get everything we need. Good luck.

  • Cerberus

    Lightweight, extensible data validation library for Python (by pyeve)

    Project mention: Do you think we need an open-source web scraping monitoring tool? | /r/webscraping | 2023-05-06

    I wrote scrapy-test as a proof of concept for validating live pages for scrapy spiders if you're looking for some reference but if you're not using scrapy I'd recommend just adding validation tests using data validation tools like cerberus which is super underrated. I cover popular data validation techniques on this short blog I wrote if you want to learn more.

  • Sonar

    Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.

  • schema

    Schema validation just got Pythonic

    Project mention: Python: Data validation | dev.to | 2023-01-20

    Validation using Schema

  • deepchecks

    Deepchecks - Tests for Continuous Validation of ML Models & Data. Deepchecks is a Python package for comprehensively validating your machine learning models and data with minimal effort.

    Project mention: [D] DL Practitioners, Do You Use Layer Visualization Tools s.a GradCam in Your Process? | /r/MachineLearning | 2022-10-28
  • Schematics

    Python Data Structures for Humans™.

  • pandera

    A light-weight, flexible, and expressive statistical data testing library

    Project mention: Unit testing functions that input/output dataframes? | /r/datascience | 2023-03-05

    I use Pandera, so I just need to define the expected input/output schemas (i.e. column names, types, and constraints on them), and Pandera automatically generates fake data for the unit tests, and validates the result: https://github.com/unionai-oss/pandera

  • voluptuous

    CONTRIBUTIONS ONLY: Voluptuous, despite the name, is a Python data validation library.

  • InfluxDB

    Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.

  • cleanvision

    Automatically find issues in image datasets and practice data-centric computer vision.

    Project mention: [D] Is accurately estimating image quality even possible? | /r/MachineLearning | 2023-04-22

    Github: https://github.com/cleanlab/cleanvision Blogpost: https://cleanlab.ai/blog/cleanvision/

  • objectiv-analytics

    Open-source product analytics infrastructure for data teams that want full control. Built for high quality data collection and ready to use for advanced analytics & ML.

    Project mention: Get tools to test, validate and debug your tracking instrumentation → Set up error-free user behavior tracking → No more missing/faulty data downstream. | /r/u_objectiv_io | 2022-09-19
  • colander

    A serialization/deserialization/validation library for strings, mappings and lists.

  • valideer

    Lightweight data validation and adaptation Python library.

  • Encord Active

    Open source active learning toolkit to find failure modes in your computer vision models, prioritize data to label next, and drive data curation to improve model performance.

    Project mention: Show HN: Open-source toolkit for ML model evaluation and active learning | news.ycombinator.com | 2023-05-09
  • python-codicefiscale

    :it: :credit_card: italian fiscal codes encoding, decoding and validation - codifica, decodifica e validazione del Codice Fiscale italiano.

  • laravel-validation

    A PHP Laravel like validation for python language

    Project mention: Is there a technique/package that allows me to do Laravel-style validation of POST data in Django? | /r/django | 2022-10-19

    My question: Does anything resembling Laravel's form validation exist for Django? I found django-laravel-validator, laravel-validation, and python_payload_validation, but none of them look suitable for real-world projects.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-05-09.

Python Data Validation related posts

Index

What are some of the best open-source Data Validation projects in Python? This list will help you:

Project Stars
1 jsonschema 4,150
2 Cerberus 2,980
3 schema 2,751
4 deepchecks 2,694
5 Schematics 2,558
6 pandera 2,351
7 voluptuous 1,769
8 cleanvision 609
9 objectiv-analytics 448
10 colander 437
11 valideer 262
12 Encord Active 204
13 python-codicefiscale 61
14 laravel-validation 10
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com