Python random-forest

Open-source Python projects categorized as random-forest

Top 14 Python random-forest Projects

  • orange

    🍊 :bar_chart: :bulb: Orange: Interactive data analysis

    Project mention: Book or web book recommendation request: a data visualization cookbook using Python for scientists. | reddit.com/r/Python | 2023-02-22

    Have you tried Orange? https://orangedatamining.com/ This is not a direct answer to your question but Orange has Python based stuff for data mining and visualization. It is very intuitive as for being a graphical interface.

  • mljar-supervised

    Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation

    Project mention: [P] Build data web apps in Jupyter Notebook with Python only | reddit.com/r/MachineLearning | 2023-02-15

    Sure, at the bottom of our website you can subscribe for newsletter.

  • InfluxDB

    Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.

  • awesome-fraud-detection-papers

    A curated list of data mining papers about fraud detection.

    Project mention: awesome-fraud-detection-papers: NEW Extended Research - star count:1277.0 | reddit.com/r/algoprojects | 2023-02-12
  • awesome-gradient-boosting-papers

    A curated list of gradient boosting research papers with implementations.

    Project mention: [R] Boosted Trees Literature | reddit.com/r/MachineLearning | 2023-02-14
  • SMAC3

    SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization

    Project mention: [D]How to optimize an ANN? | reddit.com/r/MachineLearning | 2022-08-12

    You can use Optuna, SMAC or hyperopt

  • decision-forests

    A collection of state-of-the-art algorithms for the training, serving and interpretation of Decision Forest models in Keras.

    Project mention: Why do tree-based models still outperform deep learning on tabular data? | news.ycombinator.com | 2022-08-03

    I can't explain it, but I help maintain TensorFlow Decision Forests [1] and Yggdrasil Decision Forests [2], and in an AutoML system at work that trains models on lots of various users data, decision forest models gets selected as best (after AutoML tries various model types and hyperparameters) somewhere between 20% to 40% of the times, systematically. It's pretty interesting. Other ML types considered are NN, linear models (with auto feature crossings generation), and a couple of other variations.

    [1] https://github.com/tensorflow/decision-forests

  • FastTreeSHAP

    Fast SHAP value computation for interpreting tree-based models

    Project mention: Open Source Hacktivism, Open Source Gains Traction in the Enterprise, and More: Open Source Matters | dev.to | 2022-05-15

    FastTreeSHAP - A Python package from LinkedIn for fast interpretation of the TreeSHAP algorithm.

  • Sonar

    Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.

  • emlearn

    Machine Learning inference engine for Microcontrollers and Embedded devices

    Project mention: EleutherAI announces it has become a non-profit | news.ycombinator.com | 2023-03-02

    > My big gripe, and for obvious reasons, is that we need to step away from cloud-based inference, and it doesn't seem like anyone's working on that.

    I think there are steps being taken in this direction (check out [1] and [2] for interesting lightweight transpile / ad-hoc training projects) but there is a lack of centralized community for these constrained problems.

    [1] https://github.com/emlearn/emlearn

  • miceforest

    Multiple Imputation with LightGBM in Python

    Project mention: Ask HN: What is the most impactful thing you've ever built? | news.ycombinator.com | 2022-11-18
  • shapley

    The official implementation of "The Shapley Value of Classifiers in Ensemble Games" (CIKM 2021).

  • DroidDetective

    A machine learning malware analysis framework for Android apps.

    Project mention: Using machine learning to identify malware in Android applications | reddit.com/r/learnmachinelearning | 2022-05-10
  • HDB_Resale_Prices

    Predicted and identified the drivers of Singapore HDB resale prices (2015-2019) with 0.96 Rsquare & $20,000 MAE. Web app deployment using Streamlit for user price prediction.

    Project mention: What's that HDB worth? | reddit.com/r/singaporefi | 2022-04-29

    Just curious, have you seen the projects (e.g. this) using statistical learning to predict HDB resale prices?

  • TitanicPassangerSurvivalPredictor

    A Web-App that uses Machine-Learning to predict a persons chances of surviving the Titanic Wreckage as a Passenger

    Project mention: I Made a web-app that predicts whether you would have survived the Titanic wreck. | reddit.com/r/titanic | 2022-07-08

    github repository: https://github.com/karan51ngh/TitanicPassangerSurvivalPredictor

  • Unredactor

    In this project we are tryinbg to create unredactor. Unredactor will take a redacted document and the redacted flag as input, inreturn it will give the most likely candidates to fill in redacted location. In this project we are only considered about unredacting names only. The data that we are considering is imdb data set with many review files. These files are used to buils corpora for finding tfidf score. Few files are used to train and in these files names are redacted and written into redact

    Project mention: Redacted and Sanitized | reddit.com/r/conspiracyNOPOL | 2022-10-24

    Interestingly, some years back (perhaps 12-15 years?) someone developed a program that would examine the font a physically redacted document was written in, and the spacing to try to unredact it, with some relatively decent success as only a set combination of words/letters etc. could fill a specific redacted portion. Of course the larger the redacted block, the harder it becomes. It was interesting none the less, not sure what happened to it though. This: https://github.com/gt0410/Unredactor is similar, but not what I was thinking of, and this: https://hackaday.com/2008/08/01/exposing-poorly-redacted-pdfs/ may also prove interesting for you.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-03-02.

Python random-forest related posts

Index

What are some of the best open-source random-forest projects in Python? This list will help you:

Project Stars
1 orange 4,011
2 mljar-supervised 2,524
3 awesome-fraud-detection-papers 1,311
4 awesome-gradient-boosting-papers 880
5 SMAC3 813
6 decision-forests 561
7 FastTreeSHAP 388
8 emlearn 242
9 miceforest 229
10 shapley 191
11 DroidDetective 79
12 HDB_Resale_Prices 15
13 TitanicPassangerSurvivalPredictor 1
14 Unredactor 0
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com