SaaSHub helps you find the best software and product alternatives Learn more →
HTML data-quality Projects
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
Project mention: Open-Source Observability for the Semantic Layer | news.ycombinator.com | 2024-01-16Think of Datadrift as a simple & open-source Monte Carlo for the semantic layer era. The repo is at https://github.com/data-drift/data-drift
Datadrift started as an internal tool built at our former company, a large European B2B Fintech. We had data reliability challenges impacting key metrics used for financial and regulatory reporting.
However, when we tried existing data quality tools we where always frustrated. They provide row-level static testing (eg. uniqueness or nullness) which does not address time-varying metrics like revenues. And commercial observability solutions costs $manyK a month and brings compliance and security overhead.
We designed Datadrift to solve these problems. Datadrift works by simply adding a monitor where your metric is computed. It then understands how your metric is computed and on which upstream tables it depends. When an issue occurs, it pinpoints exactly which rows have been updated and introducing the change.
You can also set up alerting and customise it. For example, you can decide to open and assign an Github issue to the analyst owning the revenue metric when a +10% change is detected. We tried to make it easy to customise and developer friendly.
We are thinking of adding features around root cause analysis automation/issues pattern analysis to help data teams improve metrics quality overtime. We’d love to hear your feature requests.
Datadrift is built with Python and Go, and licensed under GPL. Our docs are here: https://github.com/data-drift/data-drift?tab=readme-ov-file#...
Dev set up and demo : https://app.claap.io/sammyt/drift-db-demo-a18-c-ApwBh9kt4p-0...
We’re very eager to get your feedback!
HTML data-quality related posts
- Open-Source Observability for the Semantic Layer
- How to design a software for extracting and validating data in existing DB(s)
- whylogs: The open standard for data logging
- Sentry for Data Teams
- How are you guys testing your data?
- I am Alessya Visnjic, co-founder and CEO of WhyLabs. I am here to talk about MLOps, AI Observability and our recent product announcements. Ask me anything!
- Machine learning’s crumbling foundations – by Cory Doctorow
-
A note from our sponsor - SaaSHub
www.saashub.com | 26 Apr 2024
Index
Project | Stars | |
---|---|---|
1 | re_data | 1,521 |
2 | data-drift | 298 |
Sponsored