Advice on a Data Quality framework

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • eurybia

    ⚓ Eurybia monitors model drift over time and securizes model deployment with data validation

  • So we just trained a model to try and do the same, and then sort of read its entrails through Shapash. The more it can tell the difference, the more your data has changed. We can know which variable has changed the most, and how much it's important to our models. If all else fails (and also if all else works), we can still know (again, this is all quantified in some way, we need numbers, not eyeballings) how much our models predictions have evolved over time, independantly of particular data changes, legit or not. How can our models predictions change if the data is all clean, you ask ? I mean I asked, but you would have too, in my shoes. What lies beyond data engineering ? What is the meaning of life ? The answer is concept drift, and that's where we're starting to work on now that we have a good grasp on data drift. Anyways, the tool is Eurybia. If any part of my ramblings resemble some of your work, please give it a try and chat us up here or through the repo, we are of course very eager to get feedbacks and possibly even contributions, who knows. See ya !

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Impact of Input Length on the Reasoning Performance of Large Language Models

    1 project | news.ycombinator.com | 1 May 2024
  • Kolmogorov-Arnold Networks

    4 projects | news.ycombinator.com | 30 Apr 2024
  • Quick tip: Write numpy arrays directly to the SingleStore VECTOR data type

    1 project | dev.to | 1 May 2024
  • Navigating the Risky Waters of Loan Defaults: A Predictive Beacon

    1 project | dev.to | 30 Apr 2024
  • Alternative Chunking Methods

    1 project | news.ycombinator.com | 30 Apr 2024