Launch HN: Elementary (YC W22) – Open-source data observability

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

elementary

30 1,736 9.8 HTML

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

Sure, please compare: https://re-data.github.io/dbt-re-data/#!/overview?g_v=1 and https://docs.elementary-data.com/ graph png.
Elementary models like data_monitors_thread1, data_monitors_thread2, data_monitors_thread3, data_monitors_thread4, data_monitoring_metrics, latest_metrics, metrics_stats_for_anomalies, z_score, anomaly_detection, schema_schenages, etc.

re_data

15 1,521 7.1 HTML

re_data - fix data issues before your users & CEO would discover them 😊

Nice project, at re_data we just got over a lot of your new updates and it seems a quite large part of your project is "inspired" by code from our library https://github.com/re-data/re-data. Even with parts, we are not especially proud of ;)
If you decide to copy not only ideas but a big part of internal implementation, I think you should include that information in your LICENSE.
Cheers

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
dbt-data-reliability

2 338 9.7 Python

dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

For any dbt users, their reliability package has the best and most comprehensive way to upload artifacts directly to the warehouse after a dbt invocation.
https://github.com/elementary-data/dbt-data-reliability

sqllineage

3 1,120 8.6 Python

SQL Lineage Analysis Tool powered by Python

Is the idea here that it's inspired by re_data due to using dbt transformations underneath or because it's reposted looking nearly the same? (or both?)
Looks like much of the lineage code is also largely a wrapper around this library: https://github.com/reata/sqllineage
Would be curious to understand the project's purpose and unique contributions vs. the underlying dependencies powering it as there seems to be some ambiguity. Is this just a wrapper around dbt transformations and a lineage library in one package? Can I just use them directly?

deequ

17 3,119 7.5 Scala

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Does this in essence similar to the aws deeque project but fancier and more inclusive of edge cases, common scenarios? (https://github.com/awslabs/deequ)

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Snowflake SQL AST parser?
2 projects | /r/dataengineering | 5 Apr 2022
Multiwoven Reverse ETL (0.2.0) – Open-Source Alternative to Hightouch and Census
1 project | news.ycombinator.com | 19 Apr 2024
Show HN: Privacy-first analytics in natural language in the browser
1 project | news.ycombinator.com | 17 Apr 2024
Plotting Financial Data in Kotlin with Kandy
3 projects | dev.to | 9 Apr 2024
Sqlime: Online SQLite Playground
5 projects | news.ycombinator.com | 9 Apr 2024

Launch HN: Elementary (YC W22) – Open-source data observability

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
data-observability dataquality data-lineage Data Analysis data-reliability
Post date: 4 Mar 2022

elementary

re_data

WorkOS

dbt-data-reliability

sqllineage

deequ

InfluxDB

Related posts

Launch HN: Elementary (YC W22) – Open-source data observability

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com data-observability dataquality data-lineage Data Analysis data-reliability Post date: 4 Mar 2022

elementary

re_data

WorkOS

dbt-data-reliability

sqllineage

deequ

InfluxDB

Related posts

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
data-observability dataquality data-lineage Data Analysis data-reliability
Post date: 4 Mar 2022