yggdrasil-decision-forests
decision-forests
yggdrasil-decision-forests | decision-forests | |
---|---|---|
4 | 1 | |
428 | 651 | |
3.0% | 0.9% | |
9.5 | 8.3 | |
5 days ago | 11 days ago | |
C++ | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
yggdrasil-decision-forests
-
Why do tree-based models still outperform deep learning on tabular data? (2022)
Is it this library https://github.com/google/yggdrasil-decision-forests ?
-
Binary image classification using random forest algorithm
However if you know cpp you can use Yggdrasil https://github.com/google/yggdrasil-decision-forests.
- Why do tree-based models still outperform deep learning on tabular data?
-
[P] Tree compiler that speeds up LightGBM model inference by ~30x
Have you tried to compare with Yggdrasil, the decision forest engine (c++, both training and inference) powering TensorFlow Decision Forests ?
decision-forests
-
Why do tree-based models still outperform deep learning on tabular data?
I can't explain it, but I help maintain TensorFlow Decision Forests [1] and Yggdrasil Decision Forests [2], and in an AutoML system at work that trains models on lots of various users data, decision forest models gets selected as best (after AutoML tries various model types and hyperparameters) somewhere between 20% to 40% of the times, systematically. It's pretty interesting. Other ML types considered are NN, linear models (with auto feature crossings generation), and a couple of other variations.
[1] https://github.com/tensorflow/decision-forests
What are some alternatives?
LightGBM - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
Spearmint - Spearmint Bayesian optimization codebase
tensorflow - An Open Source Machine Learning Framework for Everyone
srbench - A living benchmark framework for symbolic regression
decision-tree-classifier - Decision Tree Classifier and Boosted Random Forest
higgs-logistic-regression
flashlight - A C++ standalone library for machine learning [Moved to: https://github.com/flashlight/flashlight]
interpret - Fit interpretable models. Explain blackbox machine learning.
MLBenchmarks.jl - ML models benchmarks on public dataset