SaaSHub helps you find the best software and product alternatives Learn more →
Top 10 data-discovery Open-Source Projects
-
applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
amundsen
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
-
OpenMetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
-
odd-platform
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
opendatadiscovery-specification
ODD Specification is a universal open standard for collecting metadata.
Sounds like you are looking for a data catalog tool instead of db schema management tool. You can check out Amundsen (https://www.amundsen.io/), DataHub (https://datahubproject.io/)
If you are looking for schema change management tool, then you can check out Bytebase (bytebase.com). But it can't answer questions like "which collections contain links to bigmongo.user.id?"
We'll be using WSL2 for this guide, and we'll start by cloning this repo and its submodules:
Project mention: How to Dynamically Adjust the Height of a Textarea in ReactJS | dev.to | 2023-10-25In this blog post, I have demonstrated how I addressed the challenge of dynamically adjusting the height of a textarea element based on its content, preventing the need for vertical scrolling in the title section of the OpenMetadata Knowledge article page.
Project mention: OpenDataDiscovery 0.15 with Data Deprecation and Metadata Stale | news.ycombinator.com | 2023-08-04
Project mention: Recap: A python library for describing database tables and serialization formats with minimal type coercion. | /r/dataengineering | 2023-07-12The Github Repo: https://github.com/recap-build/recap
data-discovery related posts
-
Ask HN: Looking for DB schema management tool
-
Which open source or commercial tools are used for Data Governance and access management
-
ODD Platform - An open-source data discovery and observability service - v0.12 release
-
Quick Start Guide to Amundsen Demo 🚀
-
How to map out data pipeline of 500-person BI Excel team?
-
Standalone lineage tool
-
Apache Atlas or OpenMetaData?
-
A note from our sponsor - SaaSHub
www.saashub.com | 4 May 2024
Index
What are some of the best open-source data-discovery projects? This list will help you:
Project | Stars | |
---|---|---|
1 | applied-ml | 25,984 |
2 | datahub | 9,230 |
3 | amundsen | 4,277 |
4 | OpenMetadata | 4,180 |
5 | marquez | 1,618 |
6 | sqllineage | 1,126 |
7 | odd-platform | 1,115 |
8 | awesome-data-catalogs | 586 |
9 | recap | 307 |
10 | opendatadiscovery-specification | 116 |
Sponsored