odd-platform
awesome-data-catalogs
odd-platform | awesome-data-catalogs | |
---|---|---|
33 | 9 | |
1,115 | 586 | |
2.0% | 4.3% | |
8.7 | 4.2 | |
3 days ago | 8 months ago | |
Java | ||
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
odd-platform
- OpenDataDiscovery 0.15 with Data Deprecation and Metadata Stale
- ODD Platform - An open-source data discovery and observability service - v0.12 release
- ODD Platform - An open-source data discovery and observability service - v0.11.1 release
-
Release 0.11 of OpenDataDiscovery Platform w/ metrics, search explanations & new dataset structure
Get to know about OpenDataDiscovery: https://opendatadiscovery.org/
awesome-data-catalogs
-
How to map out data pipeline of 500-person BI Excel team?
Check out this GitHub awesome list of Data Catalogs.
-
Standalone lineage tool
Maybe what you want i some specification from which you can build something? In that way, perhaps this can help you https://github.com/opendatadiscovery/awesome-data-catalogs. Airflow uses OpenLineage as a way to send their metadata, and Marquez collects them to show them in their UI (https://openlineage.io/docs/guides/airflow), so I suppose you would want to do something similar? But maybe in that GitHub you can find other specifications that can help you better.
- Our data catalog is difficult to manage and not built for the wider org - what can we do?
-
Looking for an "offline" data discovery platform
In order to gain a understanding of the tables and their contents in our company, I have implemented one of the existing [data discovery platforms](https://github.com/opendatadiscovery/awesome-data-catalogs) (in my case [Amundsen](https://www.amundsen.io/))). Unfortunately, Amundsen can only display the tables it has access to.
-
Open source data catalog
I got nice data catalog summary in case anyone would be interested - https://github.com/opendatadiscovery/awesome-data-catalogs. It is probably biased since author is also author of one of the data catalogs, but still can be quite useful :)
- Data Catalog High level feature comparison
- Data Catalog Comparison List
- Awesome-data-catalogs – A curated list of data catalogs
-
Ask HN: Is there any data catalog that targets ML as the first citizen?
Hi, I would like to know is there any opensource data catalog systems that targets machine learning applications (datasets (unstructral, e.g., text, image, and video) and models) as the citizen?
I have read the awesome-data-catalogs ([1]) list but found none of them is treating ML as 1st cizten and the support for datasets and models are not specific enough.
[1]: https://github.com/opendatadiscovery/awesome-data-catalogs
What are some alternatives?
OpenMetadata - Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
android-analytics-debugger - The Avo Android analytics debugger
datahub-helm - Repository of helm charts for deploying DataHub on a Kubernetes cluster
grai-core
CKAN - CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data.humdata.org among many other sites.
datahub - The Metadata Platform for your Data Stack
opendatadiscovery-specification - ODD Specification is a universal open standard for collecting metadata.
opendatadiscovery-speci
awesome-italian-public-datasets - A selection of interesting Open dataset from the Italian Public Administration and Civic Data use cases