vowpal_wabbit
Recommender
Our great sponsors
vowpal_wabbit | Recommender | |
---|---|---|
11 | 0 | |
8,394 | 259 | |
0.4% | - | |
8.3 | 0.0 | |
7 days ago | over 1 year ago | |
C++ | C | |
GNU General Public License v3.0 or later | BSD 2-clause "Simplified" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
vowpal_wabbit
-
Microsoft Reinforcement Learning Open Source Fest 2022 – Native CSV Parser
My project here at the Reinforcement Learning Open Source Fest 2022 is to add the native CSV parsing feature for the Vowpal Wabbit.
-
Predicting numerical values to a very high accuracy
If you only have 198 possible values then extreme multiclass models might benefit here with better precision and faster convergence. For example probabilistic label trees might have some relevance. Vowpal Wabbit also has specific reductions for extreme multi class problems. Might be worth a try if other alternatives still don't work out.
-
Performance comparison: counting words in Python, Go, C++, C, AWK, Forth, and Rust
You're likely correct, but I do recall attending a lecture by John Langford of https://vowpalwabbit.org/ running some form of an N-gram C++ based NLP model, including summary statistics on performance, in less time than wc -l took on the same data. Must have some neat hashing tricks, but still was cool
Recommender
We haven't tracked posts mentioning Recommender yet.
Tracking mentions began in Dec 2020.
What are some alternatives?
catboost - A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
xgboost - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
SHOGUN - Shōgun
mxnet - Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
MeTA - A Modern C++ Data Sciences Toolkit
Caffe - Caffe: a fast open framework for deep learning.
faiss-server - faiss serving :)
mlpack - mlpack: a fast, header-only C++ machine learning library
CCV - C-based/Cached/Core Computer Vision Library, A Modern Computer Vision Library