datacompy
popmon
Our great sponsors
datacompy | popmon | |
---|---|---|
4 | 1 | |
382 | 486 | |
8.9% | 1.0% | |
7.4 | 6.9 | |
6 days ago | 3 months ago | |
Python | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
datacompy
- How to Check 2 SQL Tables Are the Same
-
Comparing 2 CSV files
datacompy is a package to compare 2 pandas dataframes
- Performing Data Tests on External Data/Complex Data Quality Checks
-
Best Practice When Comparing Data Across Two SQL Servers in Python
https://github.com/capitalone/datacompy will allow you to compare two tables/dataframes against one another, and see detailed results on the difference.
popmon
What are some alternatives?
koalas - Koalas: pandas API on Apache Spark
sweetviz - Visualize and compare datasets, target values and associations, with one line of code.
data-science-ipython-notebooks - Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
cape-dataframes - Privacy transformations on Spark and Pandas dataframes backed by a simple policy language.
data-diff - Compare tables within or across databases
ydata-profiling - 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
dbt-audit-helper - Useful macros when performing data audits
visualiza - A general-purpose dynamic data visualizer.
lifetimes - Lifetime value in Python
diffable-sql
Optimus - :truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark