datasciencecoursera
vtreat
datasciencecoursera | vtreat | |
---|---|---|
44 | 1 | |
2,196 | 281 | |
- | 0.4% | |
0.0 | 3.0 | |
about 1 year ago | 9 months ago | |
HTML | HTML | |
- | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
datasciencecoursera
vtreat
-
[Q] Reducing Categorical Variables with Many Levels
vtreat was created to solve this problem. IIRC it uses a form of contrast encoding.
What are some alternatives?
random-dose-of-knowledge - Using the latest Software Engineering practices to create a modern and simple app.
lme4cens - Simple Mixed Effect Models and Censoring
data-science-interviews - Data science interview questions and answers
intro_stats - Introduction to Statistics: an integrated textbook and workbook using R
H2O - H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
my-little-crony - A visualization of the connections between Tory politicians and companies being awarded government contracts during the pandemic.
Data-science-best-resources - Carefully curated resource links for data science in one place
metaflow - :rocket: Build and manage real-life ML, AI, and data science projects with ease!
ML-Workspace - 🛠All-in-one web-based IDE specialized for machine learning and data science.
surveydown - An attempt to build a markdown-based survey platform using Quarto & Shiny
sololearn - Compilation of all SoloLearn courses with their respective projects and practices and all 72 code challenges for all 7 supported languages.