Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more! Learn more β
Top 23 Python feature-engineering Projects
-
-
Civic Auth
Simple auth for Python backends. Drop Civic Auth into your Python backend with just a few lines of code. Email login, SSO, and route protection built-in. Minimal config. Works with FastAPI, Flask, or Django.
-
mljar-supervised
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
Project mention: Python, notebooks, no code recipes, AI = new desktop app for data analysis | news.ycombinator.com | 2025-06-01 -
intelligent-trading-bot
Intelligent Trading Bot: Automatically generating signals and trading based on machine learning and feature engineering
-
functime
Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.
-
NVTabular
NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
-
-
-
Sevalla
Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
-
pixeltable
Pixeltable β AI Data infrastructure providing a declarative, incremental approach for multimodal workloads.
Project mention: Stop Gluing Data Infrastructure Tools: Build Multimodal AI Workloads and Application with One Declarative Python SDK | dev.to | 2025-07-06Star us on GitHub: https://github.com/pixeltable/pixeltable
-
temporian
Temporian is an open-source Python library for preprocessing β‘ and feature engineering π temporal data π for machine learning applications π€
-
Hyperactive
An optimization and data collection toolbox for convenient and fast prototyping of computationally expensive models.
-
-
-
-
upgini
Data search & enrichment library for Machine Learning β Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs
-
CAAFE
Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" by Hollmann, MΓΌller, and Hutter (2023).
-
NitroFE
NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for providing continuous calculation.
-
prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
-
-
-
dpq
dpq is an open-source python library that makes prompt-based data transformations and feature engineering easy
-
-
HDB_Resale_Prices
Predicted and identified the drivers of Singapore HDB resale prices (2015-2019) with 0.96 Rsquare & $20,000 MAE. Web app deployment using Streamlit for user price prediction.
-
Project mention: Show HN: Dataclr β Python library simplifying feature selection for ML | news.ycombinator.com | 2025-01-06
-
InfluxDB
InfluxDB β Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
Python feature-engineering discussion
Python feature-engineering related posts
-
Show HN: Dataclr β Python library simplifying feature selection for ML
-
Dataclr β New feature selection algorithm for ML achieving SOTA results
-
Temporian: Google's Python package for time series preprocessing
-
temporian: NEW Data - star count:283.0
-
temporian: NEW Data - star count:283.0
-
temporian: NEW Data - star count:283.0
-
temporian: NEW Data - star count:283.0
-
A note from our sponsor - Sevalla
sevalla.com | 31 Aug 2025
Index
What are some of the best open-source feature-engineering projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | featuretools | 7,528 |
2 | mljar-supervised | 3,193 |
3 | intelligent-trading-bot | 1,474 |
4 | functime | 1,116 |
5 | NVTabular | 1,097 |
6 | tsfel | 1,038 |
7 | evalml | 824 |
8 | pixeltable | 741 |
9 | temporian | 695 |
10 | Hyperactive | 529 |
11 | tsflex | 427 |
12 | hrv-analysis | 416 |
13 | feathub | 338 |
14 | upgini | 337 |
15 | CAAFE | 168 |
16 | NitroFE | 106 |
17 | prosto | 91 |
18 | bytehub | 61 |
19 | ds2 | 50 |
20 | dpq | 24 |
21 | lambdo | 24 |
22 | HDB_Resale_Prices | 23 |
23 | dataclr | 17 |