swiple
soda-core
swiple | soda-core | |
---|---|---|
1 | 5 | |
78 | 1,768 | |
- | 2.5% | |
0.0 | 8.9 | |
5 days ago | 6 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
swiple
-
FastAPI + Poetry Docker Image, 3.7x size reduction
Just finished modifying a FastAPI + Poetry Docker image and reduced the image size by 3.7x. The Docker image originated from Jason Adam and thought other might find it valuable. All the code can be found here: https://github.com/Swiple/swiple/blob/main/backend/Dockerfile
soda-core
- Looking for Unit Testing framework in Database Migration Process
-
Data profiling tools / approaches?
Tools like Soda Core could be really helpful for this. For example, it allows you to set up a change over time threshold which could take the form of: change avg last 3 for missing_count(column_name) < 20%
-
Data QC? Great Expectations?
You can give https://github.com/sodadata/soda-core - open source and (in my opinion) easy to get a lot of value with minimum effort.
- Show HN: Soda Core is now GA – Test data like you would test your code
-
Soda Core (OSS) is now GA! So, why should you add checks to your data pipelines?
Give Soda Core a try! It's really easy. If you only have 2 minutes, check out our docs or interactive demo (pretty cool no?). If you have a bit more time, install it and give it a spin! Want to look at it later? Star on Github. Got stuck? As in our Slack community.
What are some alternatives?
Flight-Test-Data-Analytics-Module-01 - Code to support Module 01 of the Daedalus Aerospace Flight Test Data Analytics course.
great_expectations - Always know what to expect from your data.
soda-sql - Data profiling, testing, and monitoring for SQL accessible data.
dbt-data-reliability - dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
data-diff - Compare tables within or across databases
dictum - Describe business metrics with YAML, query and visualize in Jupyter with zero SQL
cleanlab - The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
cuallee - Possibly the fastest DataFrame-agnostic quality check library in town.
panda_patrol
dbt-snowflake-monitoring - A dbt package from SELECT to help you monitor Snowflake performance and costs
pointblank - Data quality assessment and metadata reporting for data frames and database tables