[D] Best methods for imbalanced multi-class classification with high dimensional, sparse predictors

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  1. CloudForest

    Ensembles of decision trees in go/golang.

    The best method i've seen for dealing with this bias is to create "artificial contrasts" by including possibly many permutated copies of each feature and then doing a statistical test of the random forest importance values for each feature vs its shuffled contrasts. This method is described here: https://www.jmlr.org/papers/volume10/tuv09a/tuv09a.pdf and there is an implementation here: https://github.com/ryanbressler/CloudForest

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. nodevectors

    Fastest network node embeddings in the west

    The best candidates for it would be UMAP or graph embedding methods

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Show HN: DeepShot – an open-source NBA predictor with ML, EWMA, and live UI

    1 project | news.ycombinator.com | 17 May 2025
  • TabPFN: Foundation Model for Tabular Data

    1 project | news.ycombinator.com | 16 May 2025
  • Deploying LLMs on Amazon EKS using NVIDIA GPUs

    3 projects | dev.to | 16 May 2025
  • I Don't Like NumPy

    15 projects | news.ycombinator.com | 15 May 2025
  • ruby_llm VS langchainrb - a user suggested alternative

    2 projects | 14 May 2025

Did you know that Go is
the 4th most popular programming language
based on number of references?