data_check
data and pipeline testing with and for SQL (by andrjas)
data-validator
A tool to validate data, built around Apache Spark. (by target)
data_check | data-validator | |
---|---|---|
1 | 2 | |
4 | 98 | |
- | - | |
8.3 | 7.4 | |
about 2 months ago | 21 days ago | |
Python | Scala | |
MIT License | GNU General Public License v3.0 or later |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
data_check
Posts with mentions or reviews of data_check.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-03-18.
-
Anyone aware of any Data Validation Framework with custom SQL capability
Maybe this can help: https://github.com/andrjas/data_check
data-validator
Posts with mentions or reviews of data-validator.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-03-18.
What are some alternatives?
When comparing data_check and data-validator you can also consider the following projects:
soda-sql - Data profiling, testing, and monitoring for SQL accessible data.
F2-Data-Pipeline - Pipeline for Automated Updates of Kaggle's "Formula 2 Dataset"
mmlspark - Simple and Distributed Machine Learning [Moved to: https://github.com/microsoft/SynapseML]
data-caterer - Data generation and validation tool for any data source
HAMB