metricflow VS dbt

Compare metricflow vs dbt and see what are their differences.

metricflow

MetricFlow allows you to define, build, and maintain metrics in code. (by dbt-labs)

dbt

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications. [Moved to: https://github.com/dbt-labs/dbt-core] (by fishtown-analytics)
Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
metricflow dbt
4 1
1,073 3,802
2.5% -
9.8 10.0
4 days ago over 2 years ago
Python Python
GNU General Public License v3.0 or later Apache License 2.0
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

metricflow

Posts with mentions or reviews of metricflow. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-04-06.
  • MetricFlow allows you to define, build, and maintain metrics in code.
    1 project | /r/CKsTechNews | 6 Apr 2022
  • Show HN: MetricFlow – open-source metric framework
    4 projects | news.ycombinator.com | 6 Apr 2022
    Three things:

    First, MetricFlow does not currently support MySQL. We launched with support for BigQuery, Redshift, and Snowflake. I have opened an issue to add support for MySQL (and similar issues for other SQL engines are coming): https://github.com/transform-data/metricflow/issues/27

    Second, what we call a data source is more similar to a table in a database, rather than the underlying database service itself. Metricflow itself is useful when you're using a single SQL engine - indeed, that's all we support today - but it is most useful when you're in a world where joins are a thing. That said, if you have one big data table you might still find it useful to have declarative metric definitions defined in Metricflow. Suppose, for example, you had a big NoSQL style table filled with JSON objects. You might define a few data sources that normalize those JSON objects into top level elements (identifiers, dimensions, aggregated measures) using the sql_query data source config attribute, and then that'd allow you to support structured queries on the data consumption end while pushing unstructured blobs from your application layer. This will be slow at query time, and only as reliable as the level of discipline exerted in your application development workflow, but it's possible.

    Third, if we did support MySQL you'd basically connect to it via standard connection parameters - we have a config file where you can store the required information and then we'll manage the connections for you. However, I'm not familiar with uxwizz, and a quick perusal of their documentation did not turn up how one goes about connecting to the underlying DB. It's likely I just missed this, but at any rate I don't know how it is done. If they don't support standard MySQL client connections you'd need to write an adapter of some kind against whatever DB connection APIs they provide, in which case you'd likely need to roll a custom implementation of MetricFlow's SqlClient interface and initialize the MetricFlowEngine with that.

dbt

Posts with mentions or reviews of dbt. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-11-25.
  • Open Source Analytics Stack: Bringing Control, Flexibility, and Data-Privacy to Your Analytics
    15 projects | dev.to | 25 Nov 2021
    Due to the rise in cloud-based data warehouses, businesses can directly load all the raw data into the data warehouse without prior transformations. This process is known as ELT (Extract, Load, Transform) and gives data and analytics teams freedom to develop ad-hoc transformations based on their particular needs. ELT became popular as the cloud's processing power and scale became better suited to transforming data. DBT (website, GitHub) is a popular open-source tool recommended for ELT and allows businesses to transform data in their warehouses more effectively. It's a great pairing with with RudderStack's Cloud Extract ETL tool.

What are some alternatives?

When comparing metricflow and dbt you can also consider the following projects:

dbt-core - dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Apache Kafka - Mirror of Apache Kafka

dbt_metrics - Macros for calculating metrics

airbyte - The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Apache Superset - Apache Superset is a Data Visualization and Data Exploration Platform [Moved to: https://github.com/apache/superset]

superset - Apache Superset is a Data Visualization and Data Exploration Platform

dictum - Describe business metrics with YAML, query and visualize in Jupyter with zero SQL

Snowplow - The enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCP

datafluent_pg - Build a better understanding of your data in PostgreSQL.

nbdev - Create delightful software with Jupyter Notebooks

rudderstack-docs - Documentation repository for RudderStack - the Customer Data Platform for Developers.

ClickHouse - ClickHouse® is a free analytics DBMS for big data