[D] Which statistical test would you use to detect drift in a dataset of images?

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

alibi-detect

9 2,082 7.6 Python

Algorithms for outlier, adversarial and drift detection

Wasserstein distance is not very suitable for drift detection on most problems given that the sample complexity (and estimation error) scales with O(n^(-1/d)) with n the number of instances (100k-10m in your case) and d the feature dimension (192 in your case). More interesting will be to use for instance a detector based on the maximum mean discrepancy (MMD) with estimation error of O(n^(-1/2)). Notice the absence of the feature dimension here. You can find scalable implementations in Alibi Detect (disclosure: I am a contributor): MMD docs, image example. We just added the KeOps backend for the MMD detector to scale and speed up the drift detector further, so if you install from master, you can leverage this backend and easily scale the detector to 1mn instances on e.g. 1 RTX2080Ti GPU. Check this example for more info.

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

[D] Distributions to represent an Image Dataset
1 project | /r/MachineLearning | 24 Feb 2023
What Machine Learning model monitoring tools can you recommend?
1 project | /r/mlops | 2 Dec 2021
[D] Is this a reasonable assumption in machine learning?
1 project | /r/MachineLearning | 5 Jul 2021
[D] How do you deal with covariate shift and concept drift in production?
2 projects | /r/MachineLearning | 28 Oct 2021
Looking for recommendations to monitor / detect data drifts over time
3 projects | /r/datascience | 15 Apr 2023

[D] Which statistical test would you use to detect drift in a dataset of images?

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning
anomaly outlier concept-drift Detection unsupervised-learning
Post date: 24 Aug 2022

alibi-detect

InfluxDB

Related posts

[D] Which statistical test would you use to detect drift in a dataset of images?

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning anomaly outlier concept-drift Detection unsupervised-learning Post date: 24 Aug 2022

alibi-detect

InfluxDB

Related posts

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning
anomaly outlier concept-drift Detection unsupervised-learning
Post date: 24 Aug 2022