Looking for an open-source data lineage app, where objects and connections can be manually defined (not just automatically ingested)

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering

Our great sponsors
  • Onboard AI - Learn any GitHub repo in 59 seconds
  • InfluxDB - Collect and Analyze Billions of Data Points in Real Time
  • SaaSHub - Software Alternatives and Reviews
  • kedro-viz

    Visualise your Kedro data and machine-learning pipelines and track your experiments.

    At this point, I'll even be happy with a pure visualization engine, like for instance if I can repurpose kedro-viz so that it can take a csv or json of object relationships as an input. I'd also be happy if any of the aforementioned lineage tools I mentioned above have this functionality and I just missed it.

  • OpenMetadata

    Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

    Hello everyone, I'm looking for an open-source data lineage app (e.g. tokern, datahubproject, openmetadata).

  • Onboard AI

    Learn any GitHub repo in 59 seconds. Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at www.getonboard.dev.

  • datahub

    The Metadata Platform for the Modern Data Stack

    Hello everyone, I'm looking for an open-source data lineage app (e.g. tokern, datahubproject, openmetadata).

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts