Advice on a Data Quality framework

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

eurybia

3 203 5.1 Jupyter Notebook

⚓ Eurybia monitors model drift over time and securizes model deployment with data validation

So we just trained a model to try and do the same, and then sort of read its entrails through Shapash. The more it can tell the difference, the more your data has changed. We can know which variable has changed the most, and how much it's important to our models. If all else fails (and also if all else works), we can still know (again, this is all quantified in some way, we need numbers, not eyeballings) how much our models predictions have evolved over time, independantly of particular data changes, legit or not. How can our models predictions change if the data is all clean, you ask ? I mean I asked, but you would have too, in my shoes. What lies beyond data engineering ? What is the meaning of life ? The answer is concept drift, and that's where we're starting to work on now that we have a good grasp on data drift. Anyways, the tool is Eurybia. If any part of my ramblings resemble some of your work, please give it a try and chat us up here or through the repo, we are of course very eager to get feedbacks and possibly even contributions, who knows. See ya !

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Impact of Input Length on the Reasoning Performance of Large Language Models

1 project | news.ycombinator.com | 1 May 2024
Kolmogorov-Arnold Networks

4 projects | news.ycombinator.com | 30 Apr 2024
Quick tip: Write numpy arrays directly to the SingleStore VECTOR data type

1 project | dev.to | 1 May 2024
Navigating the Risky Waters of Loan Defaults: A Predictive Beacon

1 project | dev.to | 30 Apr 2024
Alternative Chunking Methods

1 project | news.ycombinator.com | 30 Apr 2024

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering Post date: 18 May 2022

eurybia

InfluxDB

Related posts

Impact of Input Length on the Reasoning Performance of Large Language Models

Kolmogorov-Arnold Networks

Quick tip: Write numpy arrays directly to the SingleStore VECTOR data type

Navigating the Risky Waters of Loan Defaults: A Predictive Beacon

Alternative Chunking Methods