SaaSHub helps you find the best software and product alternatives Learn more β
Top 11 dataquality Open-Source Projects
-
OpenMetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
-
soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
fastapi-greatexpectations
Run greatexpectations.io on ANY SQL Engine using REST API. Supported by FastAPI, Pydantic and SQLAlchemy as best data quality tool
-
setup-duckdb-action
π¦ Blazing Fast and highly customizable Github Action to setup a DuckDb runtime
Project mention: How to Dynamically Adjust the Height of a Textarea in ReactJS | dev.to | 2023-10-25In this blog post, I have demonstrated how I addressed the challenge of dynamically adjusting the height of a textarea element based on its content, preventing the need for vertical scrolling in the title section of the OpenMetadata Knowledge article page.
If the issue happen a lot, there is also: https://github.com/datafold/data-diff
That is a nice tool to do it cross database as well.
I think it's based on checksum method.
Project mention: Show HN: Snowflake Data Quality Checks in Python | news.ycombinator.com | 2024-02-11
View on GitHub
dataquality related posts
-
How to Dynamically Adjust the Height of a Textarea in ReactJS
-
Blog - Project Nessie: A Look in the Depths
-
What is your favorite data catalog?
-
Data Governance Hands On with Amazon DataZone
-
What OSS are you using for data contracts?
-
Data Quality at Scale with Great Expectations, Spark, and Airflow on EMR
-
Thoughts around decube.io (data observability and catalog platform)
-
A note from our sponsor - SaaSHub
www.saashub.com | 12 May 2024
Index
What are some of the best open-source dataquality projects? This list will help you:
Project | Stars | |
---|---|---|
1 | great_expectations | 9,497 |
2 | OpenMetadata | 4,227 |
3 | deequ | 3,138 |
4 | data-diff | 2,862 |
5 | soda-core | 1,768 |
6 | re_data | 1,527 |
7 | zingg | 886 |
8 | cuallee | 110 |
9 | fastapi-greatexpectations | 12 |
10 | setup-duckdb-action | 5 |
11 | data_check | 4 |
Sponsored