Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression. Learn more →
Top 14 Python random-forest Projects
🍊 :bar_chart: :bulb: Orange: Interactive data analysisProject mention: Book or web book recommendation request: a data visualization cookbook using Python for scientists. | reddit.com/r/Python | 2023-02-22
Have you tried Orange? https://orangedatamining.com/ This is not a direct answer to your question but Orange has Python based stuff for data mining and visualization. It is very intuitive as for being a graphical interface.
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic DocumentationProject mention: [P] Build data web apps in Jupyter Notebook with Python only | reddit.com/r/MachineLearning | 2023-02-15
Sure, at the bottom of our website you can subscribe for newsletter.
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
A curated list of data mining papers about fraud detection.Project mention: awesome-fraud-detection-papers: NEW Extended Research - star count:1277.0 | reddit.com/r/algoprojects | 2023-02-12
A curated list of gradient boosting research papers with implementations.Project mention: [R] Boosted Trees Literature | reddit.com/r/MachineLearning | 2023-02-14
SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter OptimizationProject mention: [D]How to optimize an ANN? | reddit.com/r/MachineLearning | 2022-08-12
You can use Optuna, SMAC or hyperopt
A collection of state-of-the-art algorithms for the training, serving and interpretation of Decision Forest models in Keras.Project mention: Why do tree-based models still outperform deep learning on tabular data? | news.ycombinator.com | 2022-08-03
I can't explain it, but I help maintain TensorFlow Decision Forests  and Yggdrasil Decision Forests , and in an AutoML system at work that trains models on lots of various users data, decision forest models gets selected as best (after AutoML tries various model types and hyperparameters) somewhere between 20% to 40% of the times, systematically. It's pretty interesting. Other ML types considered are NN, linear models (with auto feature crossings generation), and a couple of other variations.
Fast SHAP value computation for interpreting tree-based modelsProject mention: Open Source Hacktivism, Open Source Gains Traction in the Enterprise, and More: Open Source Matters | dev.to | 2022-05-15
FastTreeSHAP - A Python package from LinkedIn for fast interpretation of the TreeSHAP algorithm.
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
Machine Learning inference engine for Microcontrollers and Embedded devicesProject mention: EleutherAI announces it has become a non-profit | news.ycombinator.com | 2023-03-02
> My big gripe, and for obvious reasons, is that we need to step away from cloud-based inference, and it doesn't seem like anyone's working on that.
I think there are steps being taken in this direction (check out  and  for interesting lightweight transpile / ad-hoc training projects) but there is a lack of centralized community for these constrained problems.
Multiple Imputation with LightGBM in PythonProject mention: Ask HN: What is the most impactful thing you've ever built? | news.ycombinator.com | 2022-11-18
The official implementation of "The Shapley Value of Classifiers in Ensemble Games" (CIKM 2021).
A machine learning malware analysis framework for Android apps.Project mention: Using machine learning to identify malware in Android applications | reddit.com/r/learnmachinelearning | 2022-05-10
Predicted and identified the drivers of Singapore HDB resale prices (2015-2019) with 0.96 Rsquare & $20,000 MAE. Web app deployment using Streamlit for user price prediction.Project mention: What's that HDB worth? | reddit.com/r/singaporefi | 2022-04-29
Just curious, have you seen the projects (e.g. this) using statistical learning to predict HDB resale prices?
A Web-App that uses Machine-Learning to predict a persons chances of surviving the Titanic Wreckage as a PassengerProject mention: I Made a web-app that predicts whether you would have survived the Titanic wreck. | reddit.com/r/titanic | 2022-07-08
github repository: https://github.com/karan51ngh/TitanicPassangerSurvivalPredictor
In this project we are tryinbg to create unredactor. Unredactor will take a redacted document and the redacted flag as input, inreturn it will give the most likely candidates to fill in redacted location. In this project we are only considered about unredacting names only. The data that we are considering is imdb data set with many review files. These files are used to buils corpora for finding tfidf score. Few files are used to train and in these files names are redacted and written into redactProject mention: Redacted and Sanitized | reddit.com/r/conspiracyNOPOL | 2022-10-24
Interestingly, some years back (perhaps 12-15 years?) someone developed a program that would examine the font a physically redacted document was written in, and the spacing to try to unredact it, with some relatively decent success as only a set combination of words/letters etc. could fill a specific redacted portion. Of course the larger the redacted block, the harder it becomes. It was interesting none the less, not sure what happened to it though. This: https://github.com/gt0410/Unredactor is similar, but not what I was thinking of, and this: https://hackaday.com/2008/08/01/exposing-poorly-redacted-pdfs/ may also prove interesting for you.
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python random-forest related posts
[D] Most Popular AI Research Aug 2022 - Ranked Based On GitHub Stars
5 projects | reddit.com/r/MachineLearning | 3 Sep 2022
Why do tree-based models still outperform deep learning on tabular data?
5 projects | news.ycombinator.com | 3 Aug 2022
4 projects | news.ycombinator.com | 18 Jun 2022
Simple and embedded friendly C code for Machine Learning inference algorithms
1 project | reddit.com/r/C_Programming | 1 Jan 2022
Regression with the C64
1 project | news.ycombinator.com | 27 Dec 2021
Miceforest: Fast, Memory Efficient, Multiple Imputation by Chained Equations
1 project | news.ycombinator.com | 15 Dec 2021
Show HN: Multiple Imputation with Lightgbm
1 project | news.ycombinator.com | 15 Oct 2021
A note from our sponsor - InfluxDB
www.influxdata.com | 28 Mar 2023
What are some of the best open-source random-forest projects in Python? This list will help you: