-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
I find this project quite interesting because sklearn has a good general design including data transformations and it does make sense to provide compatible functionality for Go.
Feature engineering in general is a hot topic and especially if features are not simple hard-coded transformations but rather can be learned from data. For example, I developed a toolkit intended for combining feature engineering and ML:
https://github.com/asavinov/lambdo - Feature engineering and machine learning: together at last!
(Currently, it is not actively developed and the focus is moved to a similar project - https://github.com/asavinov/prosto - also focused on data preprocessing and feature engineering)
Related posts
-
Show HN: JupySQL – a SQL client for Jupyter (ipython-SQL successor)
-
Featuretools – A Python Library for Automated Feature Engineering
-
New to large SW projects in Python, best practices to organize code
-
A three-part series on deploying a Data Science Platform on AWS
-
Ploomber Cloud - Parametrizing and running notebooks in the cloud in parallel