-
whylogs
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
This is why we've been trying to encourage people to think about lightweight data logging as a mitigation for data quality problems. Similar to how we monitor applications with Prometheus, we should approach ML monitoring with the same rigor.
Disclaimer: I'm one of the authors. We spend a lot of effort to build the standard for data logging here: https://github.com/whylabs/whylogs. It's meant to be a lightweight and open standard for collecting statistical signatures of your data without having to run SQL/expensive analysis.
Related posts
-
whylogs: The open standard for data logging
-
I am Alessya Visnjic, co-founder and CEO of WhyLabs. I am here to talk about MLOps, AI Observability and our recent product announcements. Ask me anything!
-
Launching end-to-end data quality platform
-
Show HN: Snowflake Data Quality Checks in Python
-
Open-Source Observability for the Semantic Layer