Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at www.getonboard.dev. Learn more →
Top 22 Python Xgboost Projects
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic DocumentationProject mention: Show HN: Web App with GUI for AutoML on Tabular Data | news.ycombinator.com | 2023-08-24
Web App is using two open-source packages that I've created:
- MLJAR AutoML - Python package for AutoML on tabular data https://github.com/mljar/mljar-supervised
- Mercury - framework for converting Jupyter Notebooks into Web App https://github.com/mljar/mercury
You can run Web App locally. What is more, you can adjust notebook's code for your needs. For example, you can set different validation strategies or evalutaion metrics or longer training times. The notebooks in the repo are good starting point for you to develop more advanced apps.
Check out: https://github.com/BayesWitnesses/m2cgen
Learn any GitHub repo in 59 seconds. Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at www.getonboard.dev.
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
Standardized Serverless ML Inference Platform on KubernetesProject mention: Show HN: Software for Remote GPU-over-IP | news.ycombinator.com | 2022-12-14
Inference servers essentially turn a model running on CPU and/or GPU hardware into a microservice.
Many of them support the kserve API standard that supports everything from model loading/unloading to (of course) inference requests across models, versions, frameworks, etc.
So in the case of Triton you can have any number of different TensorFlow/torch/tensorrt/onnx/etc models, versions, and variants. You can have one or more Triton instances running on hardware with access to local GPUs (for this example). Then you can put standard REST and or grpc load balancers (or whatever you want) in front of them, hit them via another API, whatever.
Now all your applications need to do to perform inference is do an HTTP POST (or use a client) for model input, Triton runs it on a GPU (or CPU if you want), and you get back whatever the model output is.
Not a sales pitch for Triton but it (like some others) can also do things like dynamic batching with QoS parameters, automated model profiling and performance optimization, really granular control over resources, response caching, python middleware for application/biz logic, accelerated media processing with Nvidia DALI, all kinds of stuff.
Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.Project mention: I need help lol | /r/AskStatistics | 2023-03-07
AutoViz: A Python library that automatically visualizes any dataset in just one line of code.
MLBox is a powerful Automated Machine Learning python library.
A curated list of gradient boosting research papers with implementations.Project mention: [R] Boosted Trees Literature | /r/MachineLearning | 2023-02-14
Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.
Scalable machine 🤖 learning for time series forecasting.Project mention: Sales forecast for next two years | /r/datascience | 2023-06-25
An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and moreProject mention: Multi-model serving options | /r/mlops | 2023-02-12
You've already mentioned Seldon Core which is well worth looking at but if you're just after the raw multi-model serving aspect rather than a fully-fledged deployment framework you should maybe take a look at the individual inference servers: Triton Inference Server and MLServer both support multi-model serving for a wide variety of frameworks (and custom python models). MLServer might be a better option as it has an MLFlow runtime but only you will be able to decide that. There also might be other inference servers that do MMS that I'm not aware of.
Automatically Build Multiple ML Models with a Single Line of Code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.
An optimization and data collection toolbox for convenient and fast prototyping of computationally expensive models.Project mention: Hyperactive Version 4.5 Released | news.ycombinator.com | 2023-08-27
Fast SHAP value computation for interpreting tree-based models
:ledger: The MLOps stack component for experiment trackingProject mention: Show HN: A gallery of dev tool marketing examples | news.ycombinator.com | 2023-10-07
Whenever I'd start a new marketing project I found myself going over a list of 20+ companies I knew could have done something well to “copy-paste” their approach as a baseline (think Tailscale, DigitalOCean, Vercel, Algolia, CircleCi, Supabase, Posthog, Auth0).
So past year and a half, I’ve been screenshoting examples of how companies that are good at dev marketing do things like pricing, landing page design, ads, videos, blog conversion ideas. And for each example I added a note as to why I thought it was good.
Now, it is ~140 examples organized by tags so you can browse all or get stuff for a particular topic.
Hope it is helpful to some dev tool founders and marketers in here.
Also, I am always looking for new companies/marketing ideas to add to this, so if you’d like to share good examples I’d really appreciate it.
Distributed XGBoost on Ray
MLOps Python Library (by SeldonIO)
A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)Project mention: DeepMind has open-sourced the heart of AlphaGo and AlphaZero | news.ycombinator.com | 2023-02-15
> I came up with a nifty implementation in Python that outperforms the naive impl by 30x, allowing a pure python MCTS/NN interop implementation. See https://www.moderndescartes.com/essays/deep_dive_mcts/
Chasing pointers in the MCTS tree is definitely a slow approach. Although typically there are < 900 "considerations" per move for alphazero. I've found getting value/policy predictions from a neural network (or GBDT) for the node expansions during those considerations is at least an order of magnitude slower than the MCTS tree-hopping logic.
A python tool using XGboost and sentence-transformers to perform schema matching task on tables.
Contextual Multi-Armed Bandit Platform for Scoring, Ranking & DecisionsProject mention: It's Okay to Make Something Nobody Wants | news.ycombinator.com | 2023-09-23
I’ll push back on this. I think most of us want some external validation that what we’re creating is useful and valuable.
I spent 2 solid years working on https://improve.ai and it’s gotten very little uptake. It’s taken a long time to emotionally get over putting that much blood, sweat, and money into something that almost no one values.
Going forward will be failing much faster and not giving my life energy to products that aren’t showing traction.
Convert XGBoost's DMatrix format to np.array
Optimize, convert and deploy machine learning models as fast inference API using Triton and ORT. Currently support Hugging Face transformers, PyToch, Tensorflow, SKLearn and XGBoost models.
Language Identification classification using XGBoost
Predicts chess960 or crazyhouse ratings given bullet or blitz and others for either Lichess.org or Chess.com servers.
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Xgboost related posts
Show HN: A gallery of dev tool marketing examples
1 project | news.ycombinator.com | 7 Oct 2023
How to structure/manage a machine learning experiment? (medical imaging)
1 project | /r/learnmachinelearning | 29 Aug 2023
How to grow a developer blog to 3M annual visitors? with Jakub Czakon (Neptune.ai)
1 project | dev.to | 24 Aug 2023
[D] Is there any all in one deep learning platform or software
1 project | /r/MachineLearning | 9 Jul 2023
New Data Scientist, want to get into MLOps, where to start?
1 project | /r/developer | 4 Jul 2023
Does a fully sentient (Or at least as sentient as you and me) AI with free will have a soul?
1 project | /r/ArtificialInteligence | 20 May 2023
[D] The hype around Mojo lang
2 projects | /r/MachineLearning | 5 May 2023
A note from our sponsor - Onboard AI
getonboard.dev | 9 Dec 2023
What are some of the best open-source Xgboost projects in Python? This list will help you: