What data governance tool are you folks using?

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering

Scout Monitoring - Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com
featured
InfluxDB - Purpose built for real-time analytics at any scale.
InfluxDB Platform is powered by columnar analytics, optimized for cost-efficient storage, and built with open data standards.
www.influxdata.com
featured
  • fides

    The Privacy Engineering & Compliance Framework

    I’ve also been impressed with the approach of Fides, an open source privacy management framework that ties into ci/cd, though I haven’t used it myself yet. The thing about it that stood out was Fideslang, their language and taxonomy for representing data privacy primitives.

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • fideslang

    Open-source description language for privacy to declare data types and data behaviors in your tech stack in order to simplify data privacy globally. Supports GDPR, CCPA, LGPD and ISO 19944.

    I’ve also been impressed with the approach of Fides, an open source privacy management framework that ties into ci/cd, though I haven’t used it myself yet. The thing about it that stood out was Fideslang, their language and taxonomy for representing data privacy primitives.

  • datahub

    The Metadata Platform for your Data Stack

    I’m a huge fan of DataHub, the open source data catalogue spun out of LinkedIn, but it’s best thought of as an observability layer for data assets that can be shared by data engineers and analyst-types. For data users: it’s a stellar search/discovery interface (what datasets are there on this keyword, which are most broadly used across the organization, what downstream products are made with this data, what’s it usually joined to, are it’s upstream pipelines reliable). For data engineers, it’s a comprehensive asset cataloger, crawling your warehouse, orchestrator, modeling layers, features, and reports, matching the lineage into a graph where it can.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Which data lineage tool did you implement at your company

    2 projects | /r/dataengineering | 29 Mar 2022
  • Metadata extraction and management

    2 projects | /r/dataengineering | 16 Feb 2022
  • Open Source takes center stage at United Nations

    5 projects | news.ycombinator.com | 17 Jul 2024
  • This Week In Python

    5 projects | dev.to | 12 Jul 2024
  • Show HN: Automatically extract data from APIs with dlt and OpenAPI

    2 projects | news.ycombinator.com | 29 May 2024

Did you konow that Python is
the 1st most popular programming language
based on number of metions?