Top 5 imbalanced-data Open-Source Projects
-
smote_variants
A collection of 85 minority oversampling techniques (SMOTE) for imbalanced learning with multi-class oversampling and model selection features
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
multi-domain-imbalance
[ECCV 2022] Multi-Domain Long-Tailed Recognition, Imbalanced Domain Generalization, and Beyond
-
xrays-and-gradcam
Classification and Gradient-based Localization of Chest Radiographs using PyTorch.
-
radius-constrained-kmeans
Codes for "No More Than 6FT Apart: Robust K-Means via Radius Upper Bounds", ICASSP 2022
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: HIGHLY unbalanced dataset (>600:1 negative:positive examples), how do I deal with this? | /r/learnmachinelearning | 2023-06-06You can try data augmentation approaches (e.g., smote-variants) or synthetic data generation (e.g., ydata-synthetic). Based on the ratio, I would also try learning the characteristics of you majority class and then generate a smaller sample for it (undersampling).
imbalanced-data related posts
Index
What are some of the best open-source imbalanced-data projects? This list will help you:
Project | Stars | |
---|---|---|
1 | imbalanced-regression | 757 |
2 | smote_variants | 596 |
3 | multi-domain-imbalance | 117 |
4 | xrays-and-gradcam | 44 |
5 | radius-constrained-kmeans | 2 |
Sponsored