odd-platform
opendatadiscovery-specification
odd-platform | opendatadiscovery-specification | |
---|---|---|
33 | 2 | |
1,115 | 116 | |
2.0% | 1.7% | |
8.7 | 6.2 | |
3 days ago | 17 days ago | |
Java | ||
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
odd-platform
- OpenDataDiscovery 0.15 with Data Deprecation and Metadata Stale
- ODD Platform - An open-source data discovery and observability service - v0.12 release
- ODD Platform - An open-source data discovery and observability service - v0.11.1 release
-
Release 0.11 of OpenDataDiscovery Platform w/ metrics, search explanations & new dataset structure
Get to know about OpenDataDiscovery: https://opendatadiscovery.org/
opendatadiscovery-specification
-
Show HN: First open source data discovery and observability platform
Thank you!
Actually everything is working on a push basis in ODD now. ODD Platform implements ODD Specification (https://github.com/opendatadiscovery/opendatadiscovery-speci...) and all agents, custom scripts and integrations, Airflow/Spark listeners, etc are pushing metadata to specific ODD Platform's endpoint (https://github.com/opendatadiscovery/opendatadiscovery-speci...). ODD Collectors (agents) are pushing metadata on a configurable schedule.
ODD Specification is a standard for collecting and gathering such metadata, ETL included. We gather metadata for lineage on an entity level now, but we plan to expand this to the column-level lineage at the end 2022 — start 2023. Specification allows us to make the system open and it's really easy to write your own integration by taking a look in what format metadata needs to be injected in the Platform.
ODD Platform has its own OpenAPI specification (https://github.com/opendatadiscovery/odd-platform/tree/main/...) so that the already indexed and layered metadata could be extracted via platform's API.
Also, thank you for sharing links with us! I'm thrilled to take a look how BMW solved a problem of lineage gathering from Spark, that's something we are improving in our product right now.
What are some alternatives?
OpenMetadata - Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
awesome-data-catalogs - 📙 Awesome Data Catalogs and Observability Platforms.
datahub-helm - Repository of helm charts for deploying DataHub on a Kubernetes cluster
spline - Data Lineage Tracking And Visualization Solution
CKAN - CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data.humdata.org among many other sites.
opendatadiscovery-speci
datahub - The Metadata Platform for your Data Stack
metadata-guardian - Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️