Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 Data Validation Open-Source Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
ajv
The fastest JSON schema Validator. Supports JSON Schema draft-04/06/07/2019-09/2020-12 and JSON Type Definition (RFC8927)
-
cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
-
deepchecks
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
-
Encord Active
Open source active learning toolkit to find failure modes in your computer vision models, prioritize data to label next, and drive data curation to improve model performance.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Formik and Yup empower you to build robust and user-friendly forms in React. By leveraging their capabilities, you can streamline form management, reduce boilerplate code, and ensure a smooth user experience with clear and effective validation. Refer to the official documentation of Formik https://formik.org/ and Yup https://github.com/jquense/yup for in-depth exploration and advanced use cases.
Project mention: Framework Interoperable Component Libraries Using Lit Web Components. | dev.to | 2023-10-08I've been very passionate about a project called react-jsonschema-form (github, editor). I personally hate writing forms, and love the idea of serializable components, schema, validation all in one. I've always wanted an alternative to this project that offered an alternative to react, and possibly the ability to render a schema form to static HTML (like ssg).
Project mention: Popular Libraries For Building Type-safe Web Application APIs | dev.to | 2024-04-07Ajv’s documentation is available here.
Project mention: [Research] Detecting Annotation Errors in Semantic Segmentation Data | /r/MachineLearning | 2023-11-05We have feely open-sourced our new method for improving segmentation data, published a paper on the research behind it, and released a 5-min code tutorial. You can also read more in the blog if you'd like.
Project mention: Popular Libraries For Building Type-safe Web Application APIs | dev.to | 2024-04-07You can check out Superstruct documentation here.
Project mention: Detect, Defend, Prevail: Payments Fraud Detection using ML & Deepchecks | dev.to | 2024-01-13Also if you have any confusion related to it. You can directly go to their discussion section in github :
Project mention: Show HN: Config-file-validator – CLI tool to validate all your config files | news.ycombinator.com | 2023-09-29I was expecting this to validate the configuration files are also valid for their use cases, not just valid JSON, TOML, etc.
If you're looking for that and Python is your jam, the library cerberus[0] is very good at it.
[0]: https://github.com/pyeve/cerberus
Luckily, there is a large pool of community wisdom around and outside of Rails which may help us a lot here. Instead of inventing our own wheel for now we will use one invented before us by others. Pretty much sure you have seen this magic used outside of Hogwarts before: https://dry-rb.org/gems/dry-validation.
(1) You might want to check out https://github.com/t-kalinowski/Rapp by my colleague Tomasz
(2) I think part of that is in scope for strict (https://github.com/hadley/strict). You might also be well served by adopting some more data validation tooling, e.g. pointblank (https://rstudio.github.io/pointblank/).
Project mention: Launch HN: Encord (YC W21) – Unit testing for computer vision models | news.ycombinator.com | 2024-01-31We base our pricing on your user and consumption scale and would be happy to discuss this with you directly. Please feel free to explore the OS version of Active at https://github.com/encord-team/encord-active. Note that some features, such as natural language search using GPU accelerated APIs, are not included in the cloud version.
Data Validation related posts
- Converting React Forms to Formik and Yup
- Crafting Forms in React: Vanilla vs. React Hook Form vs. Formik
- Popular Libraries For Building Type-safe Web Application APIs
- Ask HN: Looking for a project to contribute to? (April 2024)
- Lessons from open-source: Replace zod with superstruct if you do not use zod’s advanced capabilities
- Show HN: Data Caterer – Data generation and validation tool
- Show HN: Data Caterer – Data generation and validation tool
-
A note from our sponsor - InfluxDB
www.influxdata.com | 28 Apr 2024
Index
What are some of the best open-source Data Validation projects? This list will help you:
Project | Stars | |
---|---|---|
1 | Yup | 22,201 |
2 | react-jsonschema-form | 13,630 |
3 | ajv | 13,383 |
4 | cleanlab | 8,651 |
5 | Superstruct | 6,810 |
6 | jsonschema | 4,432 |
7 | deepchecks | 3,350 |
8 | Cerberus | 3,108 |
9 | pandera | 3,007 |
10 | schema | 2,831 |
11 | Schematics | 2,571 |
12 | JSON-Splora | 1,862 |
13 | voluptuous | 1,801 |
14 | soda-core | 1,751 |
15 | forgJs | 1,666 |
16 | dry-validation | 1,315 |
17 | tv4 | 1,161 |
18 | is-my-json-valid | 955 |
19 | cleanvision | 921 |
20 | pointblank | 826 |
21 | schema-inspector | 504 |
22 | colander | 440 |
23 | Encord Active | 420 |
Sponsored