Top 7 data-catalog Open-Source Projects

datahub

34 9,230 9.9 Java

The Metadata Platform for your Data Stack

Project mention: Ask HN: Looking for DB schema management tool | news.ycombinator.com | 2023-10-24

Sounds like you are looking for a data catalog tool instead of db schema management tool. You can check out Amundsen (https://www.amundsen.io/), DataHub (https://datahubproject.io/)
If you are looking for schema change management tool, then you can check out Bytebase (bytebase.com). But it can't answer questions like "which collections contain links to bigmongo.user.id?"

amundsen

7 4,277 7.8 Python

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

Project mention: Quick Start Guide to Amundsen Demo 🚀 | dev.to | 2023-05-09

We'll be using WSL2 for this guide, and we'll start by cloning this repo and its submodules:

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
OpenMetadata

26 4,180 10.0 TypeScript

Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

Project mention: How to Dynamically Adjust the Height of a Textarea in ReactJS | dev.to | 2023-10-25

In this blog post, I have demonstrated how I addressed the challenge of dynamically adjusting the height of a textarea element based on its content, preventing the need for vertical scrolling in the title section of the OpenMetadata Knowledge article page.

odd-platform

33 1,115 8.7 Java

First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.

Project mention: OpenDataDiscovery 0.15 with Data Deprecation and Metadata Stale | news.ycombinator.com | 2023-08-04

awesome-data-catalogs

9 586 4.2

📙 Awesome Data Catalogs and Observability Platforms.
recap

2 306 8.7 Python

Work with your web service, database, and streaming schemas in a single format.

Project mention: Recap: A python library for describing database tables and serialization formats with minimal type coercion. | /r/dataengineering | 2023-07-12

The Github Repo: https://github.com/recap-build/recap

meteor

1 171 6.7 Go

Meteor is an easy-to-use, plugin-driven metadata collection framework to extract data from different sources and sink to any data catalog. (by raystack)
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

data-catalog related posts

Ask HN: Looking for DB schema management tool

1 project | news.ycombinator.com | 24 Oct 2023
Which open source or commercial tools are used for Data Governance and access management

1 project | /r/dataengineering | 22 Jun 2023
ODD Platform - An open-source data discovery and observability service - v0.12 release

2 projects | /r/dataengineering | 10 May 2023
Quick Start Guide to Amundsen Demo 🚀

1 project | dev.to | 9 May 2023
How to map out data pipeline of 500-person BI Excel team?

1 project | /r/dataengineering | 19 Mar 2023
Standalone lineage tool

2 projects | /r/dataengineering | 16 Mar 2023
Apache Atlas or OpenMetaData?

1 project | /r/dataengineering | 10 Mar 2023
A note from our sponsor - InfluxDB
www.influxdata.com | 7 May 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source data-catalog projects? This list will help you:

	Project	Stars
1	datahub	9,230
2	amundsen	4,277
3	OpenMetadata	4,180
4	odd-platform	1,115
5	awesome-data-catalogs	586
6	recap	306
7	meteor	171

data-catalog

Top 7 data-catalog Open-Source Projects

data-catalog related posts

Ask HN: Looking for DB schema management tool

Which open source or commercial tools are used for Data Governance and access management

ODD Platform - An open-source data discovery and observability service - v0.12 release

Quick Start Guide to Amundsen Demo 🚀

How to map out data pipeline of 500-person BI Excel team?

Standalone lineage tool

Apache Atlas or OpenMetaData?

Index