NVTabular
powershap
Our great sponsors
NVTabular | powershap | |
---|---|---|
1 | 1 | |
1,006 | 175 | |
1.4% | 2.3% | |
5.5 | 4.2 | |
5 days ago | 8 months ago | |
Python | Python | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
NVTabular
-
ETL Pipelines with Airflow: The Good, the Bad and the Ugly
If you have GPUs, NVTabular outperforms most of the frameworks out there: https://github.com/NVIDIA/NVTabular
powershap
-
[R] PowerShap: A power-full Shapley feature selection method.
We are glad to present our novel shap-based wrapper feature selection method called PowerShap! This method uses statistical hypothesis testing and power calculations on Shapley values, enabling fast and intuitive wrapper-based feature selection. The complete library and methods are fully compatible with Sklearn, LightGBM, CatBoost, and more are coming in further following releases and the library can be found here: https://github.com/predict-idlab/powershap! The library is open-source and usable out-of-the-box as shown in the video!
What are some alternatives?
dbt-expectations - Port(ish) of Great Expectations to dbt test macros
cascade - Lightweight and modular MLOps library targeted at small teams or individuals
Scio - A Scala API for Apache Beam and Google Cloud Dataflow.
evalml - EvalML is an AutoML library written in python.
cuetils - CLI and library for diff, patch, and ETL operations on CUE, JSON, and Yaml
upgini - Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs
scikit-learn - scikit-learn: machine learning in Python
federeco - implementation of federated neural collaborative filtering algorithm
Sklearn-genetic-opt - ML hyperparameters tuning and features selection, using evolutionary algorithms.
daggy
mljar-supervised - Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation