popmon
datacompy
popmon | datacompy | |
---|---|---|
1 | 4 | |
486 | 386 | |
0.6% | 8.8% | |
6.9 | 7.5 | |
3 months ago | 1 day ago | |
Python | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
popmon
datacompy
- How to Check 2 SQL Tables Are the Same
-
Comparing 2 CSV files
datacompy is a package to compare 2 pandas dataframes
- Performing Data Tests on External Data/Complex Data Quality Checks
-
Best Practice When Comparing Data Across Two SQL Servers in Python
https://github.com/capitalone/datacompy will allow you to compare two tables/dataframes against one another, and see detailed results on the difference.
What are some alternatives?
sweetviz - Visualize and compare datasets, target values and associations, with one line of code.
koalas - Koalas: pandas API on Apache Spark
cape-dataframes - Privacy transformations on Spark and Pandas dataframes backed by a simple policy language.
data-science-ipython-notebooks - Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
ydata-profiling - 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
data-diff - Compare tables within or across databases
dbt-audit-helper - Useful macros when performing data audits
lifetimes - Lifetime value in Python
visualiza - A general-purpose dynamic data visualizer.
Optimus - :truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
diffable-sql