Show HN: First open source data discovery and observability platform

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

odd-platform

33 1,115 8.7 Java

First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.

Thank you!
Actually everything is working on a push basis in ODD now. ODD Platform implements ODD Specification (https://github.com/opendatadiscovery/opendatadiscovery-speci...) and all agents, custom scripts and integrations, Airflow/Spark listeners, etc are pushing metadata to specific ODD Platform's endpoint (https://github.com/opendatadiscovery/opendatadiscovery-speci...). ODD Collectors (agents) are pushing metadata on a configurable schedule.
ODD Specification is a standard for collecting and gathering such metadata, ETL included. We gather metadata for lineage on an entity level now, but we plan to expand this to the column-level lineage at the end 2022 — start 2023. Specification allows us to make the system open and it's really easy to write your own integration by taking a look in what format metadata needs to be injected in the Platform.
ODD Platform has its own OpenAPI specification (https://github.com/opendatadiscovery/odd-platform/tree/main/...) so that the already indexed and layered metadata could be extracted via platform's API.
Also, thank you for sharing links with us! I'm thrilled to take a look how BMW solved a problem of lineage gathering from Spark, that's something we are improving in our product right now.

opendatadiscovery-specification

2 117 6.2

ODD Specification is a universal open standard for collecting metadata.

Thank you!
Actually everything is working on a push basis in ODD now. ODD Platform implements ODD Specification (https://github.com/opendatadiscovery/opendatadiscovery-speci...) and all agents, custom scripts and integrations, Airflow/Spark listeners, etc are pushing metadata to specific ODD Platform's endpoint (https://github.com/opendatadiscovery/opendatadiscovery-speci...). ODD Collectors (agents) are pushing metadata on a configurable schedule.
ODD Specification is a standard for collecting and gathering such metadata, ETL included. We gather metadata for lineage on an entity level now, but we plan to expand this to the column-level lineage at the end 2022 — start 2023. Specification allows us to make the system open and it's really easy to write your own integration by taking a look in what format metadata needs to be injected in the Platform.
ODD Platform has its own OpenAPI specification (https://github.com/opendatadiscovery/odd-platform/tree/main/...) so that the already indexed and layered metadata could be extracted via platform's API.
Also, thank you for sharing links with us! I'm thrilled to take a look how BMW solved a problem of lineage gathering from Spark, that's something we are improving in our product right now.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
opendatadiscovery-speci

2 - -

Thank you!
Actually everything is working on a push basis in ODD now. ODD Platform implements ODD Specification (https://github.com/opendatadiscovery/opendatadiscovery-speci...) and all agents, custom scripts and integrations, Airflow/Spark listeners, etc are pushing metadata to specific ODD Platform's endpoint (https://github.com/opendatadiscovery/opendatadiscovery-speci...). ODD Collectors (agents) are pushing metadata on a configurable schedule.
ODD Specification is a standard for collecting and gathering such metadata, ETL included. We gather metadata for lineage on an entity level now, but we plan to expand this to the column-level lineage at the end 2022 — start 2023. Specification allows us to make the system open and it's really easy to write your own integration by taking a look in what format metadata needs to be injected in the Platform.
ODD Platform has its own OpenAPI specification (https://github.com/opendatadiscovery/odd-platform/tree/main/...) so that the already indexed and layered metadata could be extracted via platform's API.
Also, thank you for sharing links with us! I'm thrilled to take a look how BMW solved a problem of lineage gathering from Spark, that's something we are improving in our product right now.

spline

1 582 6.3 Scala

Data Lineage Tracking And Visualization Solution (by AbsaOSS)

We found a way by leveraging the Spline Agent (https://github.com/AbsaOSS/spline) to make use of the Execution Plans, transform them into a suiting data model for our set of requirements and developed a UI to explore these relationships. We also open-sourced our approach in a

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

OpenDataDiscovery 0.15 with Data Deprecation and Metadata Stale

1 project | news.ycombinator.com | 4 Aug 2023
ODD Platform - An open-source data discovery and observability service - v0.12 release

1 project | /r/aipromptprogramming | 27 May 2023
ODD Platform - An open-source data discovery and observability service - v0.12 release

1 project | /r/artificial | 26 May 2023
ODD Platform - An open-source data discovery and observability service - v0.12 release

1 project | /r/dataengineering | 22 May 2023
ODD Platform - An open-source data discovery and observability service - v0.12 release

1 project | /r/opensource | 17 May 2023

Show HN: First open source data discovery and observability platform

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
data platform lineage Metadata metadata-management Bigdata
Post date: 22 Oct 2022

odd-platform

opendatadiscovery-specification

InfluxDB

opendatadiscovery-speci

spline

Related posts

OpenDataDiscovery 0.15 with Data Deprecation and Metadata Stale

ODD Platform - An open-source data discovery and observability service - v0.12 release

ODD Platform - An open-source data discovery and observability service - v0.12 release

ODD Platform - An open-source data discovery and observability service - v0.12 release

ODD Platform - An open-source data discovery and observability service - v0.12 release

Show HN: First open source data discovery and observability platform

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com data platform lineage Metadata metadata-management Bigdata Post date: 22 Oct 2022

odd-platform

opendatadiscovery-specification

InfluxDB

opendatadiscovery-speci

spline

Related posts

OpenDataDiscovery 0.15 with Data Deprecation and Metadata Stale

ODD Platform - An open-source data discovery and observability service - v0.12 release

ODD Platform - An open-source data discovery and observability service - v0.12 release

ODD Platform - An open-source data discovery and observability service - v0.12 release

ODD Platform - An open-source data discovery and observability service - v0.12 release

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
data platform lineage Metadata metadata-management Bigdata
Post date: 22 Oct 2022