flytekit
datacompy
flytekit | datacompy | |
---|---|---|
2 | 4 | |
205 | 386 | |
3.9% | 8.8% | |
9.7 | 7.5 | |
3 days ago | 6 days ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
flytekit
-
From Incubation to Graduation, and Beyond: FlytePath
Modin: Speeds up Pandas
-
Release the TextHTMLPress package to PyPI
Based on references on setup Python project, package structure, and a production-level Python package, I refactor the package as shown below:
datacompy
- How to Check 2 SQL Tables Are the Same
-
Comparing 2 CSV files
datacompy is a package to compare 2 pandas dataframes
- Performing Data Tests on External Data/Complex Data Quality Checks
-
Best Practice When Comparing Data Across Two SQL Servers in Python
https://github.com/capitalone/datacompy will allow you to compare two tables/dataframes against one another, and see detailed results on the difference.
What are some alternatives?
pymilvus - Python SDK for Milvus.
koalas - Koalas: pandas API on Apache Spark
caer - High-performance Vision library in Python. Scale your research, not boilerplate.
data-science-ipython-notebooks - Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
pan-os-python - The PAN-OS SDK for Python is a package to help interact with Palo Alto Networks devices (including physical and virtualized Next-generation Firewalls and Panorama). The pan-os-python SDK is object oriented and mimics the traditional interaction with the device via the GUI or CLI/API.
data-diff - Compare tables within or across databases
popmon - Monitor the stability of a Pandas or Spark dataframe ⚙︎
dbt-audit-helper - Useful macros when performing data audits
warehouse - The Python Package Index
visualiza - A general-purpose dynamic data visualizer.
TextHTMLPress - A command-line static site generator for generating a complete HTML web site from raw data and files.