No-Code Self-Service BI/Data Analytics Tool

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

prosto

9 89 3.6 Python

Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby

Most of the self-service or no-code BI, ETL, data wrangling tools are am aware of (like airtable, fieldbook, rowshare, Power BI etc.) were thought of as a replacement for Excel: working with tables should be as easily as working with spreadsheets. This problem can be solved when defining columns within one table: ``ColumnA=ColumnB+ColumnC, ColumnD=ColumnAColumnE`` we get a graph of column computations* similar to the graph of cell dependencies in spreadsheets.
Yet, the main problem is in working multiple tables: how can we define a column in one table in terms of columns in other tables? For example: ``Table1::ColumnA=FUNCTION(Table2::ColumnB, Table3::ColumnC)`` Different systems provided different answers to this question but all of them are highly specific and rather limited.
Why it is difficult to define new columns in terms of other columns in other tables? Short answer is that working with columns is not the relational approach. The relational model is working with sets (rows of tables) and not with columns.
One generic approach to working with columns in multiple tables is provided in the concept-oriented model of data which treats mathematical functions as first-class elements of the model. Previously it was implemented in a data wrangling tool called Data Commander. But them I decided to implement this model in the *Prosto* data processing toolkit which is an alternative to map-reduce and SQL:
https://github.com/asavinov/prosto
It defines data transformations as operations with columns in multiple tables. Since we use mathematical functions, no joins and no groupby operations are needed and this significantly simplifies and makes more natural the task of data transformations.
Moreover, now it provides *Column-SQL* which makes it even easier to define new columns in terms of other columns:
https://github.com/asavinov/prosto/blob/master/notebooks/col...

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Functions matter – an alternative to SQL and map-reduce for data processing

1 project | /r/datascience | 19 May 2021
NoSQL Data Modeling Techniques

1 project | news.ycombinator.com | 10 Apr 2021
[P] Open data transformations in Python, no SQL required

3 projects | /r/MachineLearning | 1 Mar 2022
Show HN: Hamilton, a Microframework for Creating Dataframes

6 projects | news.ycombinator.com | 8 Nov 2021
Azure data lake - Data Share

1 project | /r/dataengineering | 29 Jun 2023

No-Code Self-Service BI/Data Analytics Tool

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Workflow Data processing map-reduce Spark Pandas
Post date: 13 Nov 2021

prosto

InfluxDB

Related posts

Functions matter – an alternative to SQL and map-reduce for data processing

NoSQL Data Modeling Techniques

[P] Open data transformations in Python, no SQL required

Show HN: Hamilton, a Microframework for Creating Dataframes

Azure data lake - Data Share

No-Code Self-Service BI/Data Analytics Tool

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Workflow Data processing map-reduce Spark Pandas Post date: 13 Nov 2021

prosto

InfluxDB

Related posts

Functions matter – an alternative to SQL and map-reduce for data processing

NoSQL Data Modeling Techniques

[P] Open data transformations in Python, no SQL required

Show HN: Hamilton, a Microframework for Creating Dataframes

Azure data lake - Data Share

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Workflow Data processing map-reduce Spark Pandas
Post date: 13 Nov 2021