hummingbird
winget-pkgs
hummingbird | winget-pkgs | |
---|---|---|
9 | 98 | |
3,304 | 8,029 | |
0.5% | 1.2% | |
7.1 | 10.0 | |
17 days ago | 6 days ago | |
Python | PowerShell | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
hummingbird
- Treebomination: Convert a scikit-learn decision tree into a Keras model
-
[D] GPU-enabled scikit-learn
If are interested in just predictions you can try Hummingbird. It is part of the PyTorch ecosystem. We get already trained scikit-learn models and translate them into PyTorch models. From them you can run your model on any hardware support by PyTorch, export it into TVM, ONNX, etc. Performance on hardware acceleration is quite good (orders of magnitude better than scikit-learn is some cases)
-
Machine Learning with PyTorch and Scikit-Learn – The *New* Python ML Book
I think Rapids AI's cuML tried to go into this direction (essentially scikit-learn on the GPU): https://docs.rapids.ai/api/cuml/stable/api.html#logistic-reg.... For some reason it never took really off though.
Btw., going on a tangent, you might like Hummingbird (https://github.com/microsoft/hummingbird). It allows you trained scikit-learn tree-based models to PyTorch. I watched the SciPy talk last year, and it's a super smart & elegant idea.
-
Export and run models with ONNX
ONNX opens an avenue for direct inference using a number of languages and platforms. For example, a model could be run directly on Android to limit data sent to a third party service. ONNX is an exciting development with a lot of promise. Microsoft has also released Hummingbird which enables exporting traditional models (sklearn, decision trees, logistical regression..) to ONNX.
-
Supreme Court, in a 6–2 ruling in Google v. Oracle, concludes that Google’s use of Java API was a fair use of that material
And Python.
-
[D] Here are 3 ways to Speed Up Scikit-Learn - Any suggestions?
For inference, you can convert your models to other formats that support GPU acceleration. See Hummingbird https://github.com/microsoft/hummingbird
-
[D] Microsoft library, Hummingbird, compiles trained ML models into tensor computation for faster inference.
The surprising thing is that Hummingbird can be faster than the GPU implementation of LightGBM (and XGBoost) if you use tensor compilers such as TVM. [The paper](https://www.usenix.org/conference/osdi20/presentation/nakandala) describes our findings. We have also open sourced the [benchmark code](https://github.com/microsoft/hummingbird/tree/main/benchmarks) so you try yourself!
-
I learned about Microsoft's Hummingbird library today. 1000x performance??
I took their sample code from Github and tweaked it to spit out times for each model's prediction, as well as increase the number of rows to 5 million. I used Google's Colab and selected GPU for my hardware accelerator. This gives an option to run code on GPU, not that all computations will happen on the GPU.
winget-pkgs
-
FFmpeg 7.0 Released
7.0 is now available: https://github.com/microsoft/winget-pkgs/pull/147886
-
Packaging up NVIDIA driver updates...
I researched this for a WinGet thing: https://github.com/microsoft/winget-pkgs/pull/110618
-
2 spaces? 4 spaces? One tab?
Ah, reminds me of that time I requested a .editorconfig file in a Microsoft repo: https://github.com/microsoft/winget-pkgs/issues/329
-
MS and Windows gets a lot of (well deserved) hate, but winget is just fantastic!
Take dropbox as an example. This is what the yaml manifest looks like for that if you install it through winget. It literally has a hardcoded link to an .exe installer hosted by dropbox and then just set the flags to silent. I am not spreading misinformation, you are.
-
Windows is the malware compatibility layer for everything
It's not quite the same though, as there are different considerations when using a repository of things a unified group has decided should be included and built (or slightly modified existing) packages for and a repo where anyone can submit a package that will go through some level of vetting. In the end I still believe most this discussion is really about individuals and how much trust they apply towards different groups and sources and is not really about Linux or Windows in particular as much.
1: https://github.com/microsoft/winget-pkgs
- PowerToys Release 0.71
-
installed from winget, where is it located?
I never used winget, but probably: - https://github.com/microsoft/winget-pkgs/issues/107858 - https://github.com/Genymobile/scrcpy/issues/4027
-
The Unreasonable Effectiveness of VLC - A Comprehensive Exploration of a Multimedia Powerhouse
It's probably not on the Store, winget pulls from both the Store and a community collection of manifests on GitHub: https://github.com/microsoft/winget-pkgs
-
Seven.zip
I think that's part of the problem, if you don't have that package manager to bootstrap your signature key ring, DNS is your next best bootstrap. It is, of course, a terrible bootstrap for trust, but it is one so many users on Windows have been relying on for such a long time.
For power users on any modern Windows 10/Windows 11 there is at least WinGet now. Its manifests repo is becoming a very interesting (open) source of truth for common Windows applications. Admittedly, it in most cases doesn't seem to be checking specific code signatures in most cases either, but at least includes SHA checksums.
For instance, 7zip's manifests: https://github.com/microsoft/winget-pkgs/tree/master/manifes...
It's too bad there's still not a great option for "average user that doesn't know/trust how to use a CLI", given how sadly polluted the Microsoft Store can be for many common, especially Open Source, applications. For direct instance, because winget kindly includes Microsoft Store results when searching, there is a "7zip 22" in the Microsoft Store that costs some amount of money (winget details say "PaidUnknownPrice" for the pricing information; I'm on a corporate machine right now with the actual Store access locked so can't search in the actual Store right now) and the Publisher is listed as RepackagerExpress.com. (That website currently doesn't go anywhere, giving it a spot check.)
Having seen this, I may boot up my personal machine and try to report this specific Store listing for violating the Store's Open Source policies, though I'm unsure if such whackamole is all that useful. (Seems like it might be a useful winget feature request for it to provide Store Report URLs.)
-
App deployment switches
For example, see that Firefox has /S here.
What are some alternatives?
onnx - Open standard for machine learning interoperability
ansible.windows - Windows core collection for Ansible
swift - The Swift Programming Language
Scoop - A command-line installer for Windows.
sentence-transformers - Multilingual Sentence & Image Embeddings with BERT
ctags - A maintained ctags implementation
cuml - cuML - RAPIDS Machine Learning Library
appget - Free and open package manager for Windows.
docker - Docker - the open-source application container engine
winget-intune-win32 - Repository containing examples of how to use winget from Intune, also in system context.
chemprop - Message Passing Neural Networks for Molecule Property Prediction
gsudo - Sudo for Windows