datadiscovery

Open-source projects categorized as datadiscovery

Top 3 datadiscovery Open-Source Projects

datadiscovery
  1. OpenMetadata

    OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

    Project mention: Show HN: OpenMetadata – OSS platform for data discovery observability governance | news.ycombinator.com | 2024-07-17

    * It seems like DataHub has an async Kafka ingestion approach while OpenMetadata is API

    We do not use Kafka by default. If someone needs kafka they can add it. However for Metadata APIs, we do not feel like Kafka is needed. Lot of projects are getting dependent on Kafka and calling it as real-time. Its unnecessary burden on users who are going to operate in production for 99% of use-cases Kafka is not needed, coming from a Kafka committer :)

    2. Yes all of our APIs and Entity definitions are generated using JsonSchema. For us, Json Schema has been awesome, all of our backend / ingestion and UI is generated from JsonSchema and its easy to extend and add new models when needed

    3. IMO, we have much more coverage , you can look at the types available here https://github.com/open-metadata/OpenMetadata/tree/main/open... and we are support JsonSchema as a type from a long time

  2. Civic Auth

    Auth in Less Than 5 Minutes. Civic Auth comes with multiple SSO options, optional embedded wallets, and user management — all implemented with just a few lines of code. Start building today.

    Civic Auth logo
  3. awesome-data-catalogs

    📙 Awesome Data Catalogs and Observability Platforms.

  4. metacrafter

    Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customizable and flexible rules

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

datadiscovery discussion

Log in or Post with

datadiscovery related posts

  • Show HN: OpenMetadata – OSS platform for data discovery observability governance

    2 projects | news.ycombinator.com | 17 Jul 2024
  • OpenMetadata: Join the #1 Open Source Data Community

    1 project | news.ycombinator.com | 20 Jun 2024
  • How to Dynamically Adjust the Height of a Textarea in ReactJS

    1 project | dev.to | 25 Oct 2023
  • Blog - Project Nessie: A Look in the Depths

    1 project | /r/bigdata | 11 Jul 2023
  • What is your favorite data catalog?

    2 projects | /r/dataengineering | 25 Jun 2023
  • Data Governance Hands On with Amazon DataZone

    1 project | dev.to | 22 May 2023
  • What OSS are you using for data contracts?

    1 project | /r/dataengineering | 3 May 2023
  • A note from our sponsor - Civic Auth
    www.civic.com | 24 Apr 2025
    Civic Auth comes with multiple SSO options, optional embedded wallets, and user management — all implemented with just a few lines of code. Start building today. Learn more →

Index

What are some of the best open-source datadiscovery projects? This list will help you:

# Project Stars
1 OpenMetadata 6,481
2 awesome-data-catalogs 829
3 metacrafter 44

Sponsored
Auth in Less Than 5 Minutes
Civic Auth comes with multiple SSO options, optional embedded wallets, and user management — all implemented with just a few lines of code. Start building today.
www.civic.com

Did you know that TypeScript is
the 1st most popular programming language
based on number of references?