data-discovery

Open-source projects categorized as data-discovery

Top 10 data-discovery Open-Source Projects

  • applied-ml

    📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

  • datahub

    The Metadata Platform for your Data Stack

  • Project mention: Ask HN: Looking for DB schema management tool | news.ycombinator.com | 2023-10-24

    Sounds like you are looking for a data catalog tool instead of db schema management tool. You can check out Amundsen (https://www.amundsen.io/), DataHub (https://datahubproject.io/)

    If you are looking for schema change management tool, then you can check out Bytebase (bytebase.com). But it can't answer questions like "which collections contain links to bigmongo.user.id?"

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • amundsen

    Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

  • Project mention: Quick Start Guide to Amundsen Demo 🚀 | dev.to | 2023-05-09

    We'll be using WSL2 for this guide, and we'll start by cloning this repo and its submodules:

  • OpenMetadata

    Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

  • Project mention: How to Dynamically Adjust the Height of a Textarea in ReactJS | dev.to | 2023-10-25

    In this blog post, I have demonstrated how I addressed the challenge of dynamically adjusting the height of a textarea element based on its content, preventing the need for vertical scrolling in the title section of the OpenMetadata Knowledge article page.

  • marquez

    Collect, aggregate, and visualize a data ecosystem's metadata

  • sqllineage

    SQL Lineage Analysis Tool powered by Python

  • Project mention: FLaNK Stack Weekly for 12 September 2023 | dev.to | 2023-09-12
  • odd-platform

    First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.

  • Project mention: OpenDataDiscovery 0.15 with Data Deprecation and Metadata Stale | news.ycombinator.com | 2023-08-04
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • awesome-data-catalogs

    📙 Awesome Data Catalogs and Observability Platforms.

  • recap

    Work with your web service, database, and streaming schemas in a single format.

  • Project mention: Recap: A python library for describing database tables and serialization formats with minimal type coercion. | /r/dataengineering | 2023-07-12

    The Github Repo: https://github.com/recap-build/recap

  • opendatadiscovery-specification

    ODD Specification is a universal open standard for collecting metadata.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

data-discovery related posts

Index

What are some of the best open-source data-discovery projects? This list will help you:

Project Stars
1 applied-ml 25,984
2 datahub 9,230
3 amundsen 4,277
4 OpenMetadata 4,180
5 marquez 1,618
6 sqllineage 1,126
7 odd-platform 1,115
8 awesome-data-catalogs 586
9 recap 307
10 opendatadiscovery-specification 116

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com