Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression. Learn more →
Top 14 Python random-forest Projects
-
Project mention: Book or web book recommendation request: a data visualization cookbook using Python for scientists. | reddit.com/r/Python | 2023-02-22
Have you tried Orange? https://orangedatamining.com/ This is not a direct answer to your question but Orange has Python based stuff for data mining and visualization. It is very intuitive as for being a graphical interface.
-
mljar-supervised
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
Project mention: [P] Build data web apps in Jupyter Notebook with Python only | reddit.com/r/MachineLearning | 2023-02-15Sure, at the bottom of our website you can subscribe for newsletter.
-
InfluxDB
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
-
Project mention: awesome-fraud-detection-papers: NEW Extended Research - star count:1277.0 | reddit.com/r/algoprojects | 2023-02-12
-
awesome-gradient-boosting-papers
A curated list of gradient boosting research papers with implementations.
-
You can use Optuna, SMAC or hyperopt
-
decision-forests
A collection of state-of-the-art algorithms for the training, serving and interpretation of Decision Forest models in Keras.
Project mention: Why do tree-based models still outperform deep learning on tabular data? | news.ycombinator.com | 2022-08-03I can't explain it, but I help maintain TensorFlow Decision Forests [1] and Yggdrasil Decision Forests [2], and in an AutoML system at work that trains models on lots of various users data, decision forest models gets selected as best (after AutoML tries various model types and hyperparameters) somewhere between 20% to 40% of the times, systematically. It's pretty interesting. Other ML types considered are NN, linear models (with auto feature crossings generation), and a couple of other variations.
-
Project mention: Open Source Hacktivism, Open Source Gains Traction in the Enterprise, and More: Open Source Matters | dev.to | 2022-05-15
FastTreeSHAP - A Python package from LinkedIn for fast interpretation of the TreeSHAP algorithm.
-
Sonar
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
-
Project mention: EleutherAI announces it has become a non-profit | news.ycombinator.com | 2023-03-02
> My big gripe, and for obvious reasons, is that we need to step away from cloud-based inference, and it doesn't seem like anyone's working on that.
I think there are steps being taken in this direction (check out [1] and [2] for interesting lightweight transpile / ad-hoc training projects) but there is a lack of centralized community for these constrained problems.
-
Project mention: Ask HN: What is the most impactful thing you've ever built? | news.ycombinator.com | 2022-11-18
-
shapley
The official implementation of "The Shapley Value of Classifiers in Ensemble Games" (CIKM 2021).
-
Project mention: Using machine learning to identify malware in Android applications | reddit.com/r/learnmachinelearning | 2022-05-10
-
HDB_Resale_Prices
Predicted and identified the drivers of Singapore HDB resale prices (2015-2019) with 0.96 Rsquare & $20,000 MAE. Web app deployment using Streamlit for user price prediction.
Just curious, have you seen the projects (e.g. this) using statistical learning to predict HDB resale prices?
-
TitanicPassangerSurvivalPredictor
A Web-App that uses Machine-Learning to predict a persons chances of surviving the Titanic Wreckage as a Passenger
Project mention: I Made a web-app that predicts whether you would have survived the Titanic wreck. | reddit.com/r/titanic | 2022-07-08github repository: https://github.com/karan51ngh/TitanicPassangerSurvivalPredictor
-
Unredactor
In this project we are tryinbg to create unredactor. Unredactor will take a redacted document and the redacted flag as input, inreturn it will give the most likely candidates to fill in redacted location. In this project we are only considered about unredacting names only. The data that we are considering is imdb data set with many review files. These files are used to buils corpora for finding tfidf score. Few files are used to train and in these files names are redacted and written into redact
Interestingly, some years back (perhaps 12-15 years?) someone developed a program that would examine the font a physically redacted document was written in, and the spacing to try to unredact it, with some relatively decent success as only a set combination of words/letters etc. could fill a specific redacted portion. Of course the larger the redacted block, the harder it becomes. It was interesting none the less, not sure what happened to it though. This: https://github.com/gt0410/Unredactor is similar, but not what I was thinking of, and this: https://hackaday.com/2008/08/01/exposing-poorly-redacted-pdfs/ may also prove interesting for you.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python random-forest related posts
- [D] Most Popular AI Research Aug 2022 - Ranked Based On GitHub Stars
- Why do tree-based models still outperform deep learning on tabular data?
- Cold Showers
- Simple and embedded friendly C code for Machine Learning inference algorithms
- Regression with the C64
- Miceforest: Fast, Memory Efficient, Multiple Imputation by Chained Equations
- Show HN: Multiple Imputation with Lightgbm
-
A note from our sponsor - InfluxDB
www.influxdata.com | 28 Mar 2023
Index
What are some of the best open-source random-forest projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | orange | 4,011 |
2 | mljar-supervised | 2,524 |
3 | awesome-fraud-detection-papers | 1,311 |
4 | awesome-gradient-boosting-papers | 880 |
5 | SMAC3 | 813 |
6 | decision-forests | 561 |
7 | FastTreeSHAP | 388 |
8 | emlearn | 242 |
9 | miceforest | 229 |
10 | shapley | 191 |
11 | DroidDetective | 79 |
12 | HDB_Resale_Prices | 15 |
13 | TitanicPassangerSurvivalPredictor | 1 |
14 | Unredactor | 0 |