automlbenchmark
ai-seed
automlbenchmark | ai-seed | |
---|---|---|
3 | 5 | |
380 | 113 | |
2.4% | 0.0% | |
6.7 | 1.8 | |
8 days ago | about 1 year ago | |
Python | Jupyter Notebook | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
automlbenchmark
-
Show HN: Web App with GUI for AutoML on Tabular Data
Here is benchmark done by independent team of researchers https://openml.github.io/automlbenchmark/
I think most of overfitting is avoided with early stoppoing technique.
The underfitting can be avoidwd with using large training time.
-
Show HN: AutoAI
Your list excludes most of well-known open-source AutoML tools such as auto-sklearn, AutoGluon, LightAutoML, MLJarSupervised, etc. These tools have been very extensively benchmarked by the OpenML AutoML Benchmark (https://github.com/openml/automlbenchmark) and have papers published, so they are pretty well-known to the AutoML community.
Regarding H2O.ai: Frankly, you don't seem to understand H2O.ai's AutoML offerings.
I'm the creator of H2O AutoML, which is open source, and there's no "enterprise version" of H2O AutoML. The interface is simple -- all you need to specify is the training data and target. We have included DNNs in our set of models since the first release of the tool in 2017. Read more here: https://docs.h2o.ai/h2o/latest-stable/h2o-docs/automl.html We also offer full explainability for our models: https://docs.h2o.ai/h2o/latest-stable/h2o-docs/explain.html
H2O.ai develops another AutoML tool called Driverless AI, which is proprietary. You might be conflating the two. Neither of these tools need to be used on the H2O AI Cloud. Both tools pre-date our cloud by many years and can be used on a user's own laptop/server very easily.
Your Features & Roadmap list in the README indicates that your tool does not yet offer DNNs, so either you should update your post here or update your README if it's incorrect: https://github.com/blobcity/autoai/blob/main/README.md#featu...
Lastly, I thought I would mention that there's already an AutoML tool called "AutoAI" by IBM. Generally, it's not a good idea to have name collisions in a small space like the AutoML community. https://www.ibm.com/support/producthub/icpdata/docs/content/...
-
Show HN: Mljar Automated Machine Learning for Tabular Data (Explanation,AutoDoc)
I'm also curious how does it compare! The package will be included in the newest comparison done by OpenML people https://github.com/openml/automlbenchmark
I have some old comparison of closed-source old system
ai-seed
-
Show HN: AutoAI
Thanks for your question. Yes, we did research the space a lot before making AutoAI. Here is what we found:
PyCaret: Semi-automatic. You do the first run; then you figure the next set of runs. Ensemble models require manual configuration.
Tpot: Does a great job. Generates 4-5 lines of py code too. But does not support Neural Networks / DNN. So works only for problems where GOFAI works.
H2O.ai: They have an open-source flavor, but the best way to use it is the enterprise version on the H2O cloud. The interface is confusing, and the final output is black-box.
Now there are many in the enterprise category, such as DataRobot, AWS SageMaker, Azure etc. Most are unaffordable to Data Scientists unless your employer is sponsoring the platform.
AutoAI: This is 100% automated. Uses GOFAI, Neural Networks and DNN, all in one box. It is 100% White-box. It is the only AutoML framework that generates high-quality (1000s of lines) of Jupyter Notebook code. You can check some example codes here: https://cloud.blobcity.com
- [P] Comparison for all Sklearn Classifiers
-
Ready AI Code Templates
Hi, this is the team at BlobCity. Creators of A.I. Cloud (https://cloud.blobcity.com). We just released 400+ ready to use AI seed projects. Code templates provide newbie data scientists a great starting reference. We ourselves find them super useful. Let us know what you all think!
- Show HN: Ready code templates for your next AI Experiment
What are some alternatives?
autogluon - Fast and Accurate ML in 3 Lines of Code
ML-For-Beginners - 12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
nni - An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
ydata-profiling - 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
autokeras - AutoML library for deep learning
adanet - Fast and flexible AutoML with learning guarantees.
MindsDB - The platform for customizing AI from enterprise data
Time-series-classification-and-clustering-with-Reservoir-Computing - Library for implementing reservoir computing models (echo state networks) for multivariate time series classification and clustering.
HungaBunga - HungaBunga: Brute-Force all sklearn models with all parameters using .fit .predict!
mljar-supervised - Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
autoai - Python based framework for Automatic AI for Regression and Classification over numerical data. Performs model search, hyper-parameter tuning, and high-quality Jupyter Notebook code generation.