datahub VS amundsen

Compare datahub vs amundsen and see what are their differences.

amundsen

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data. (by amundsen-io)
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
datahub amundsen
35 7
9,977 4,452
1.4% 0.7%
10.0 7.5
3 days ago 21 days ago
Java Python
Apache License 2.0 Apache License 2.0
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

datahub

Posts with mentions or reviews of datahub. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-10.

amundsen

Posts with mentions or reviews of amundsen. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-07-19.
  • Quick Start Guide to Amundsen Demo 🚀
    1 project | dev.to | 9 May 2023
    We'll be using WSL2 for this guide, and we'll start by cloning this repo and its submodules:
  • Apache Atlas or OpenMetaData?
    1 project | /r/dataengineering | 10 Mar 2023
    You can use Amundsen data builder to send data to Apache Atlas, https://github.com/amundsen-io/amundsen/blob/main/databuilder/example/scripts/sample_atlas_search_extractor.py If you don’t have to configure Apache Atlas then why not, but the server side validation the last time when I used it was absent. You couldn’t validate the JSON body sent to the REST API endpoints.
  • Searching for Delta Lake Cataloging
    1 project | /r/dataengineering | 26 Apr 2022
    Other than that, maybe you could try amundsen (https://github.com/amundsen-io/amundsen/issues/608) which now has a connector to extract delta lake metadata via Spark.
  • Help with Data Discoverability in a Data Lake
    1 project | /r/dataengineering | 17 Aug 2021
  • Launch YC S21: Meet the Batch, Thread #6
    1 project | news.ycombinator.com | 12 Aug 2021
    How does it differ from something like Amundsen : https://github.com/amundsen-io/amundsen
  • Metadata and how to capture it
    3 projects | /r/dataengineering | 19 Jul 2021
    Metadata Engine: - Datahub https://github.com/linkedin/datahub - Amundsen https://github.com/amundsen-io/amundsen/ - Marquez https://marquezproject.github.io/ - Egeria - Open Metadata and Governance https://egeria.odpi.org
  • The State of Data Engineering in 2021
    3 projects | /r/Python | 6 May 2021
    A final category worth highlighting is Discovery, where it seems every notable company developed an internal Data Catalogue tool that now is available as an open-source or paid service. Some examples are Amundsen (Lyft), Datahub (LinkedIn), Metacat (Netflix), Databook (Uber), and Dataportal (Airbnb).

What are some alternatives?

When comparing datahub and amundsen you can also consider the following projects:

OpenMetadata - OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

OpenLineage - An Open Standard for lineage metadata collection

marquez - Collect, aggregate, and visualize a data ecosystem's metadata

atlas - Manage your database schema as code

metacat

sickbeard_mp4_automator - Automatically convert video files to a standardized format with metadata tagging to create a beautiful and uniform media library

Atlas - 🚀 An open and lightweight modification to Windows, designed to optimize performance, privacy and usability.

Medusa - The world's most flexible commerce platform.

monosi - Open source data observability platform

amundsendatabuilder - Data ingestion library for Amundsen to build graph and search index

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured

Did you konow that Java is
the 8th most popular programming language
based on number of metions?