catboost
mxnet
Our great sponsors
catboost | mxnet | |
---|---|---|
8 | 4 | |
7,731 | 20,644 | |
1.4% | - | |
9.9 | 4.1 | |
7 days ago | 6 months ago | |
Python | C++ | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
catboost
- CatBoost: Open-source gradient boosting library
- Boosting Algorithms
-
What's New with AWS: Amazon SageMaker built-in algorithms now provides four new Tabular Data Modeling Algorithms
CatBoost is another popular and high-performance open-source implementation of the Gradient Boosting Decision Tree (GBDT). To learn how to use this algorithm, please see example notebooks for Classification and Regression.
-
Writing the fastest GBDT libary in Rust
Here are our benchmarks on training time comparing Tangram's Gradient Boosted Decision Tree Library to LightGBM, XGBoost, CatBoost, and sklearn.
-
Data Science toolset summary from 2021
Catboost - CatBoost is an open-source software library developed by Yandex. It provides a gradient boosting framework which attempts to solve for Categorical features using a permutation driven alternative compared to the classical algorithm. Link - https://catboost.ai/
-
CatBoost Quickstart — ML Classification
CatBoost is an open source algorithm based on gradient boosted decision trees. It supports numerical, categorical and text features. Check out the docs.
-
[D] What are your favorite Random Forest implementations that support categoricals
If you considering GBDT check out catboost, unfortunately RF mode is not available but library implement lots of interesting categorical encoding tricks that boost accuracy.
-
CatBoost and Water Pumps
The data contains a large number of categorical features. The most suitable for obtaining a base-line model, in my opinion, is CatBoost. It is a high-performance, open-source library for gradient boosting on decision trees.
mxnet
-
List of AI-Models
Click to Learn more...
-
Introduction to deep learning hardware in the cloud
Build – Choose a machine learning framework (such as TensorFlow, PyTorch, Apache MXNet, etc.)
-
just released my Clojure AI book
Clojure and Python also have bindings to the Apache MXNet library. Is there a reason why you didn't use them in some of your projects?
-
Can Apple's M1 help you train models faster and cheaper than Nvidia's V100?
> But you still lose something, e.g. if you use half precision on V100 you get virtually double speed, if you do on a 1080 / 2080 you get... nothing because it's not supported.
That's not true. FP16 is supported and can be fast on 2080, although some frameworks fail to see the speed-up. I filed a bug report about this a year ago: https://github.com/apache/incubator-mxnet/issues/17665
What consumer GPUs lack is ECC and fast FP64.
What are some alternatives?
xgboost - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Caffe - Caffe: a fast open framework for deep learning.
Recommender - A C library for product recommendations/suggestions using collaborative filtering (CF)
Pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration
Keras - Deep Learning for humans
Caffe2
Prophet - Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
mlpack - mlpack: a fast, header-only C++ machine learning library
vowpal_wabbit - Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
Porcupine - On-device wake word detection powered by deep learning
CCV - C-based/Cached/Core Computer Vision Library, A Modern Computer Vision Library
Theano - Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It is being continued as PyTensor: www.github.com/pymc-devs/pytensor