FLAML
darts
FLAML | darts | |
---|---|---|
9 | 47 | |
3,679 | 7,294 | |
1.3% | 1.9% | |
7.9 | 9.1 | |
27 days ago | 2 days ago | |
Jupyter Notebook | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
FLAML
-
AutoGen: Enabling Next-Gen GPT-X Applications
I really like the simplicity of this framework, and they hit on a lot of common problems found in other agent-based frameworks. Most intrigued by the RAG improvements.
Seems like Microsoft was frustrated with the pace of movement in this space and the shitty results of agents (which admittedly kept my interest turned away from agents for the last few months). I'm interested again because it makes practical sense, and from looking at the example notebooks, seems fairly easy to integrate into existing applications.
Maybe this is the 'low code' approach that might actually work, and bridge together engineering and non-engineering resources.
This example was what caught my eye: https://github.com/microsoft/FLAML/blob/main/notebook/autoge...
-
Elevate Your Python Skills: Machine Learning Packages That Transformed My Journey as ML Engineer
4. FLAML
- Show HN: AutoML Python Package for Tabular Data with Automatic Documentation
- [D] If there’s one practical tip you wish should have been drilled deeply into you when you first started out learning about deep learning, what would it be?
-
what is the future of ML.NET?
Improved AutoML - Again, with collaboration from Microsoft Research, we used FLAML to update our existing AutoML solutions. What does this mean for you? You're using the latest techniques but all you need is a problem to solve and some data to get started.
-
Automated Machine Learning (AutoML) - 9 Different Ways with Microsoft AI
For a complete tutorial, navigate to this Jupyter Notebook: https://github.com/microsoft/FLAML/blob/main/notebook/flaml_automl.ipynb
-
[N] Fast AutoML with Microsoft's FLAML + Ray Tune
Microsoft Researchers have developed FLAML (Fast Lightweight AutoML) which can now utilize Ray Tune for distributed hyperparameter tuning to scale up FLAML’s resource-efficient & seamlessly parallelizable algorithms across a cluster.
-
[R] FLAML - Fast and Lightweight AutoML library
Looks nice but I wonder if this is practical for non-tiny problems. The papers are a bit hard to follow but it looks like training is restarted with every new architecture choice. As for the library itself, the only large neural net example is a finetune of an NLP model that only searches over ADAM's optimizer params - which could be useful but it's a stretch to call that AutoML.
- Flaml – Cost-effective hyperparameter optimization AutoML
darts
- Darts: Python lib for forecasting and anomaly detection on time series
-
[D] Doubts on the implementation of LSTMs for timeseries prediction (like including weather forecasts)
Don't use an LSTM. Get up to date with SoTA methods and read the papers in the field. LSTMs are not the way forward. Read the papers I suggested. It would be very useful to come to grips with both the Time Series Repository (https://github.com/thuml/Time-Series-Library) and Darts (https://github.com/unit8co/darts) as these are widely used for research and in industry.
-
Facebook Prophet: library for generating forecasts from any time series data
As others have pointed out, Prophet is not a particularly good model for forecasting, and has been superseded by a multitude of other models. If you want to do time series forecasting, I'd recommend using Darts: https://github.com/unit8co/darts. Darts implements a wide range of models and is fairly easy to use.
The problem with time series forecasting in general is that they make a lot of assumptions on the shape of your data, and you'll find you're spending a lot of time figuring out mutating your data. For example, they expect that your data comes at a very regular interval. This is fine if it's, say, the data from a weather station. This doesn't work well in clinical settings (imagine a patient admitted into the ER -- there is a burst of data, followed by no data).
That said, there's some interesting stuff out there that I've been experimenting with that seems to be more tolerant of irregular time series and can be quite useful. If you're interested in exchanging ideas, drop me a line (email in my profile).
-
Elevate Your Python Skills: Machine Learning Packages That Transformed My Journey as ML Engineer
3. darts
-
Aeon: A unified framework for machine learning with time series
Looking forward to checking this out! How does this compare with darts[1]?
[1] https://unit8co.github.io/darts/
- [D] Hybrid forecasting framework ARIMA-LSTM
-
[D] Do any of you have experience using Darts for forecasting?
Darts is an open-source Python library by Unit8 for easy handling, pre-processing, and forecasting of time series. It contains an array of models, from standard statistical models such as ARIMA to deep neural networks. https://unit8co.github.io/darts/
-
gluonts VS darts - a user suggested alternative
2 projects | 13 Apr 2023
active support
-
A Simple Guide to Feature Engineering in the Forecast Menu
The new Forecast menu, featuring the open-source Darts Time Series library, offers script-friendly functionality. It's also easy to use. Don't have any data to load yet? Enter through the Stocks or Crypto menus.
-
Ask HN: Data Scientists, what libraries do you use for timeseries forecasting?
I would recommend Darts in Python [1]. It's easy to use (think fit()/predict()) and includes
* Statistical models (ETS, (V)ARIMA(X), etc)
* ML models (sklearn models, LGBM, etc)
* Many recent deep learning models (N-BEATS, TFT, etc)
* Seamlessly works on multi-dimensional series
* Models can be trained on multiple series
* Many models offer rich support for probabilistic forecasts
* Model evaluation is easy: Darts has many metrics, offers backtest etc
* Deep learning scales to large datasets, using GPUs, TPUs, etc
* There's even now an explainability module for some of the models - showing you what matters for computing the forecasts
* (coming soon): an anomaly detection module :)
* (also, it even include FB Prophet if you really want to use it)
Warning: I'm probably biased because I'm Darts creator.
[1] https://github.com/unit8co/darts
What are some alternatives?
autogluon - Fast and Accurate ML in 3 Lines of Code
sktime - A unified framework for machine learning with time series
nni - An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
pytorch-forecasting - Time series forecasting with PyTorch
H2O - H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Prophet - Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
ML-For-Beginners - 12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Kats - Kats, a kit to analyze time series data, a lightweight, easy-to-use, generalizable, and extendable framework to perform time series analysis, from understanding the key statistics and characteristics, detecting change points and anomalies, to forecasting future trends.
Made-With-ML - Learn how to design, develop, deploy and iterate on production-grade ML applications.
tsai - Time series Timeseries Deep Learning Machine Learning Pytorch fastai | State-of-the-art Deep Learning library for Time Series and Sequences in Pytorch / fastai
nitroml - NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (AutoML) pipelines.
statsforecast - Lightning ⚡️ fast forecasting with statistical and econometric models.