datacompy
dbt-audit-helper
datacompy | dbt-audit-helper | |
---|---|---|
4 | 2 | |
386 | 283 | |
8.8% | 3.9% | |
7.5 | 7.9 | |
5 days ago | 17 days ago | |
Python | ||
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
datacompy
- How to Check 2 SQL Tables Are the Same
-
Comparing 2 CSV files
datacompy is a package to compare 2 pandas dataframes
- Performing Data Tests on External Data/Complex Data Quality Checks
-
Best Practice When Comparing Data Across Two SQL Servers in Python
https://github.com/capitalone/datacompy will allow you to compare two tables/dataframes against one another, and see detailed results on the difference.
dbt-audit-helper
-
How to Check 2 SQL Tables Are the Same
In case you haven't tried dbt (www.getdbt.com / "Data Build Tool") - there's a whole package ecosystem that solves for things like this. The one that came to mind is "dbt-audit-helper": https://github.com/dbt-labs/dbt-audit-helper#compare_relatio...
It's kind of like PyPI/DockerHub for SQL. Lots of cool stuff in there...here's link to the package hub: https://hub.getdbt.com/
-
What do like and dislike about your job as a DE?
Cool, I’ve never used row(), I’ll definitely check that out! I’ve tried the dbt audit helper package and I liked it except that I really don’t like the dbt cloud console. Anyone know how to use that package in a local shell?
What are some alternatives?
koalas - Koalas: pandas API on Apache Spark
merkle-tree-solidity - JS - Solidity sha3 merkle tree bridge. Generate proofs in JS; verify in Solidity.
data-science-ipython-notebooks - Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
data-diff - Compare tables within or across databases
dbhub.io - A "Cloud" for SQLite databases. Collaborative development for your data. 😊
visualiza - A general-purpose dynamic data visualizer.
handy_sql_queries
popmon - Monitor the stability of a Pandas or Spark dataframe ⚙︎
diffable-sql
blog