SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 feature-engineering Open-Source Projects
-
nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
mljar-supervised
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
-
metarank
A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine
-
SGX-Full-OrderBook-Tick-Data-Trading-Strategy
Providing the solutions for high-frequency trading (HFT) strategies using data science approaches (Machine Learning) on Full Orderbook Tick Data.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
OpenMLDB
OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference.
-
hamilton
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
-
Deep_Learning_Machine_Learning_Stock
Deep Learning and Machine Learning stocks represent promising opportunities for both long-term and short-term investors and traders.
-
NVTabular
NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
-
functime
Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.
-
intelligent-trading-bot
Intelligent Trading Bot: Automatically generating signals and trading based on machine learning and feature engineering
-
temporian
Temporian is an open-source Python library for preprocessing ⚡ and feature engineering 🛠 temporal data 📈 for machine learning applications 🤖
-
Hyperactive
An optimization and data collection toolbox for convenient and fast prototyping of computationally expensive models.
-
serverless-ml-course
Serverless Machine Learning Course for building AI-enabled Prediction Services from models and features
-
mistql
A query / expression language for performing computations on JSON-like structures. Tuned for clientside ML feature extraction.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Featuretools – A Python Library for Automated Feature Engineering | news.ycombinator.com | 2023-09-20
Project mention: Show HN: Web App with GUI for AutoML on Tabular Data | news.ycombinator.com | 2023-08-24Web App is using two open-source packages that I've created:
- MLJAR AutoML - Python package for AutoML on tabular data https://github.com/mljar/mljar-supervised
- Mercury - framework for converting Jupyter Notebooks into Web App https://github.com/mljar/mercury
You can run Web App locally. What is more, you can adjust notebook's code for your needs. For example, you can set different validation strategies or evalutaion metrics or longer training times. The notebooks in the repo are good starting point for you to develop more advanced apps.
Project mention: HFT: High frequency trading. Extended Research - star count:1469.0 | /r/algoprojects | 2023-07-08
Project mention: Comparative Analysis of Memory Consumption: OpenMLDB vs Redis Test Report | dev.to | 2024-04-03b. Pull the testing code
Note that this uses simple OpenAI calls — you can replace this with Langchain, LlamaIndex, Hamilton (or something else) if you prefer more abstraction, and delegate to whatever LLM you like to use. And, you should probably use something a little more concrete (E.G. instructor) to guarantee output shape.
Project mention: Deep_Learning_Machine_Learning_Stock: NEW Deep Learning And Reinforcement Learning - star count:1017.0 | /r/algoprojects | 2023-12-10
I agree that the conventional (numeric) forecasting can hardly benefit from the newest approaches like transformers and LLMs. I made such a conclusion while working on the intelligent trading bot [0] by experimenting with many ML algorithms. Yet, there exist some cases where transformers might provide significant advantages. They could be useful where the (numeric) forecasting is augmented with discrete event analysis and where sequences of events are important. Another use case is where certain patterns are important like those detected in technical analysis. Yet, for these cases much more data is needed.
[0] https://github.com/asavinov/intelligent-trading-bot Intelligent Trading Bot: Automatically generating signals and trading based on machine learning and feature engineering
Project mention: Temporian: Google's Python package for time series preprocessing | news.ycombinator.com | 2024-02-13
feature-engineering related posts
- Comparative Analysis of Memory Consumption: OpenMLDB vs Redis Test Report
- Ultra High-Performance Database OpenM(ysq)LDB: Seamless Compatibility with MySQL Protocol and Multi-Language MySQL Client
- Mastering Distributed Database Development in 10 Minutes with OpenMLDB Developer Docker Image
- Temporian: Google's Python package for time series preprocessing
- OpenMLDB new release v0.8.4
- temporian: NEW Data - star count:283.0
- temporian: NEW Data - star count:283.0
-
A note from our sponsor - SaaSHub
www.saashub.com | 29 Apr 2024
Index
What are some of the best open-source feature-engineering projects? This list will help you:
Project | Stars | |
---|---|---|
1 | nni | 13,742 |
2 | featuretools | 7,022 |
3 | mljar-supervised | 2,929 |
4 | metarank | 1,985 |
5 | feathr | 1,928 |
6 | SGX-Full-OrderBook-Tick-Data-Trading-Strategy | 1,749 |
7 | featureform | 1,674 |
8 | OpenMLDB | 1,550 |
9 | hamilton | 1,312 |
10 | Deep_Learning_Machine_Learning_Stock | 1,148 |
11 | hopsworks | 1,074 |
12 | NVTabular | 1,006 |
13 | functime | 891 |
14 | tsfel | 855 |
15 | intelligent-trading-bot | 737 |
16 | evalml | 712 |
17 | temporian | 624 |
18 | deltapy | 527 |
19 | Hyperactive | 490 |
20 | serverless-ml-course | 483 |
21 | tsflex | 361 |
22 | mistql | 345 |
23 | hrv-analysis | 342 |
Sponsored