sqlfluff vs dbt-utils

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

sqlfluff		dbt-utils
	Project
35	Mentions	7
7,199	Stars	1,196
2.1%	Growth	3.9%
9.6	Activity	5.4
4 days ago	Latest Commit	7 days ago
Python	Language	Python
MIT License	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

sqlfluff

Posts with mentions or reviews of sqlfluff. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-10-19.

🐍🐍 23 issues to grow yourself as an exceptional open-source Python expert 🧑‍💻 🥇
10 projects | dev.to | 19 Oct 2023

Repo : https://github.com/sqlfluff/sqlfluff
SQL Reserved Words – The Empirical List
2 projects | news.ycombinator.com | 11 Oct 2023

I'm surprised sqlfluff hasn't been mentioned yet. Perhaps not a comprehensive list, but it's worked for everything I've thrown at it. There's an ANSI keyword list [0], and then dialect-specific lists for everything from DB2 [1] to Snowflake [2].
[0]: https://github.com/sqlfluff/sqlfluff/blob/main/src/sqlfluff/...
Show HN: Postgres Language Server
21 projects | news.ycombinator.com | 6 Aug 2023

It has tons of annoying quirks, but I couldn't imagine running a DBT project without it: https://github.com/sqlfluff/sqlfluff
Front page news headline scraping data engineering project
3 projects | /r/dataengineering | 13 May 2023

Move SQL queries to sql files and read from files (Use sqlfluff to lint the code https://github.com/sqlfluff/sqlfluff)
Anything like SQLFluff written in Rust?
1 project | /r/rust | 9 Mar 2023
Code autoformatter for SQL in VSCode that plays nicely with dbt
1 project | /r/dataengineering | 23 Feb 2023

SQLFluff is a good CLI tool for this and includes support for jinja and dbt. I don't think there's a VSCode plugin for it yet.
Ask HN: How do you test SQL?
18 projects | news.ycombinator.com | 31 Jan 2023

This linter can really enforce some best practices https://github.com/sqlfluff/sqlfluff
A list of best practices:
What is something you would learn at college but not a bootcamp (hard skills)
2 projects | /r/cscareerquestions | 12 Jan 2023

BigQuery SQL and SQLFluff
Is the knowledge on how Compilers work applicable to the role of a Data Engineer?
2 projects | /r/dataengineering | 11 Jan 2023

There's a SQL parser/linter called SQLFluff that my team uses for our CI/CD. I've made a few pull requests to fix the parser for the particular SQL dialect we used, and my college compiler classes definitely helped.
sqlfluff VS ANTLR - a user suggested alternative
2 projects | 12 Dec 2022

dbt-utils

Posts with mentions or reviews of dbt-utils. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-08.

Show HN: Nasty, a cross warehouse, type checked, unit testable analytics library
2 projects | news.ycombinator.com | 8 Mar 2024
// To get around this, we can use the approach outlined by how dbt does ansi sql generate_series
```
  // https://github.com/dbt-labs/dbt-utils/blob/main/macros/sql/generate_series.sql
```
Anything one should know before going for self-hosted dbt?
1 project | /r/dataengineering | 10 Jul 2023

I got bit by dbt-utils/deduplicate naively removing any row that contained a null in it recently, but fortunately there was a workaround for Databricks and a few other flavors of SQL.
Managing SQL Tests
2 projects | /r/dataengineering | 30 Mar 2023

I'm used to utilising dbt and defining my tests there (along with dbt-utils or https://github.com/calogica/dbt-expectations): I simply add a list item to a column definition and can already define a great number of tests without having to copy code. I can even extend the pre-defined using generic tests. Writing custom tests also integrates nicely. Additionally it's very convenient to tag tests or define a severity. The learning curve for a business engineer is almost flat as long as they know some SQL.
Dbt to acquire Transform to build out its semantic layer
2 projects | news.ycombinator.com | 9 Feb 2023

My top three:
- Dev/stag/prod env check numbers before pushing to production.
- Unions between two sources that are not the same shape can be done without the headache. https://github.com/dbt-labs/dbt-utils#union_relations-source
- Macros for common case when statements.
Analytics Stacks for Startups
8 projects | dev.to | 21 Feb 2022

Add tests: unit tests in SQL are still not really practical, but testing the data, before allowing users to see it, is possible. dbt has some basic tests like Non-NULL and so on. dbt_utils supports comparing data across tables. If you need more, there is Great Expectation and similar tools. dbt also supports writing SQL queries which output “bad” rows. Use this to, e.g. check a specific order against manually checked correct data. Tests give you confidence that your pipelines produce correct results: nothing is worse than waking up with a Slack message from your boss that the graphs look wrong… They are especially useful in case you have to refactor a data pipeline. Basically every query you would run during the QA phase of a change request has a high potential to become an automatic test.
Why is Data Build Tool (DBT) is so popular? What are some other alternatives?
4 projects | /r/dataengineering | 4 Dec 2021
Unit testing SQL in DBT
3 projects | /r/dataengineering | 6 Feb 2021

The equality test macro is also in the dbt-utils package from fishtown at https://github.com/fishtown-analytics/dbt-utils/blob/master/macros/schema_tests/equality.sql

What are some alternatives?

When comparing sqlfluff and dbt-utils you can also consider the following projects:

vscode-sqlfluff - An extension to use the sqlfluff linter in vscode.

dbt-expectations - Port(ish) of Great Expectations to dbt test macros

sqlparse - A non-validating SQL parser module for Python

dbt-oracle - A dbt adapter for oracle db backend

ale - Check syntax in Vim/Neovim asynchronously and fix files, with Language Server Protocol (LSP) support

nodejs-bigquery - Node.js client for Google Cloud BigQuery: A fast, economical and fully-managed enterprise data warehouse for large-scale data analytics.

soda-sql - Data profiling, testing, and monitoring for SQL accessible data.

streamlit - Streamlit — A faster way to build and share data apps.

Metabase - The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:

airbyte - The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

spark-fast-tests - Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)