-
Hi all,
Currently some works is being done to improve computational primitives of scikit-learn to enhance its overhaul performances natively.
You can have a look at this exploratory PR: https://github.com/scikit-learn/scikit-learn/pull/20254
This other PR is a clear revamp of this previous one:
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
scikit-learn-intelex
Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
Looks like they are responding to https://github.com/intel/scikit-learn-intelex#-acceleration
I completely agree. I hope some Intel competitor funds a scikit-learn developer to read this code and extract all the portable performance improvements.
-
https://github.com/rapidsai/cuml
> cuML is a suite of libraries that implement machine learning algorithms and mathematical primitives functions that share compatible APIs with other RAPIDS projects. cuML enables data scientists, researchers, and software engineers to run traditional tabular ML tasks on GPUs without going into the details of CUDA programming. In most cases, cuML's Python API matches the API from scikit-learn. For large datasets, these GPU-based implementations can complete 10-50x faster than their CPU equivalents. For details on performance, see the cuML Benchmarks Notebook.
-
> Intel are focused on data-parallel C++ for delivering high performance, rightly or wrongly.
They also invest efforts in making it possible to write high performance kernels in Python using an extension to the numba Python compiler:
https://github.com/IntelPython/numba-dppy
Related posts
-
Marimo: Open-source reactive notebook for Python
-
Must-Know 2025 Developer’s Roadmap and Key Programming Trends
-
Building a Sarcasm Detection System with LSTM and GloVe: A Complete Guide
-
Marimo raises $5M to build an open-source reactive Python notebook
-
20 Open Source Tools I Recommend to Build, Share, and Run AI Projects