SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python feature-engineering Projects
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
mljar-supervised
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
Im working on AI data analyst - MLJAR Studio. It is conversational UI with AI agent which uses Python to provide data insights. It is available as desktop application https://mljar.com
-
intelligent-trading-bot
Intelligent Trading Bot: Automatically generating signals and trading based on machine learning and feature engineering
-
Project mention: Stop Gluing Data Infrastructure Tools: Build Multimodal AI Workloads and Application with One Declarative Python SDK | dev.to | 2025-07-06
Star us on GitHub: https://github.com/pixeltable/pixeltable
-
functime
Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.
-
NVTabular
NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
-
-
-
temporian
Temporian is an open-source Python library for preprocessing ⚡ and feature engineering 🛠 temporal data 📈 for machine learning applications 🤖
-
Tabular-data-generation
We well know GANs for success in the realistic image generation. However, they can be applied in tabular data generation. We will review and examine some recent papers about tabular GANs in action.
-
-
-
-
upgini
Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs
-
-
CAAFE
Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" by Hollmann, Müller, and Hutter (2023).
-
NitroFE
NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for providing continuous calculation.
-
prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
-
-
-
Skyulf
Build and ship production ML pipelines faster: a pipeline library with an optional self-hosted visual layer for modular, reproducible workflows, local testing, and experiment tracking.
-
social-media-ai-engineering-etl
Real-world AI engineering dataset creation, SFT fine-tuning, and GRPO alignment ETL pipeline.
Project mention: Real-world dataset creation, SFT fine-tuning, and GRPO alignment pipeline | news.ycombinator.com | 2025-08-28 -
dpq
dpq is an open-source python library that makes prompt-based data transformations and feature engineering easy
Python feature-engineering discussion
Python feature-engineering related posts
-
Show HN: Dataclr – Python library simplifying feature selection for ML
-
Dataclr – New feature selection algorithm for ML achieving SOTA results
-
Temporian: Google's Python package for time series preprocessing
-
temporian: NEW Data - star count:283.0
-
temporian: NEW Data - star count:283.0
-
temporian: NEW Data - star count:283.0
-
temporian: NEW Data - star count:283.0
-
A note from our sponsor - SaaSHub
www.saashub.com | 10 Jun 2026
Index
What are some of the best open-source feature-engineering projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | featuretools | 7,655 |
| 2 | mljar-supervised | 3,265 |
| 3 | intelligent-trading-bot | 1,705 |
| 4 | pixeltable | 1,568 |
| 5 | functime | 1,178 |
| 6 | NVTabular | 1,146 |
| 7 | tsfel | 1,094 |
| 8 | evalml | 849 |
| 9 | temporian | 712 |
| 10 | Tabular-data-generation | 570 |
| 11 | Hyperactive | 550 |
| 12 | hrv-analysis | 445 |
| 13 | tsflex | 440 |
| 14 | upgini | 350 |
| 15 | feathub | 349 |
| 16 | CAAFE | 182 |
| 17 | NitroFE | 108 |
| 18 | prosto | 93 |
| 19 | bytehub | 61 |
| 20 | ds2 | 50 |
| 21 | Skyulf | 44 |
| 22 | social-media-ai-engineering-etl | 34 |
| 23 | dpq | 25 |