dbt_salesforce_source
dbt-expectations
dbt_salesforce_source | dbt-expectations | |
---|---|---|
1 | 10 | |
11 | 972 | |
- | 2.6% | |
7.4 | 6.5 | |
2 months ago | 8 days ago | |
Shell | Shell | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
dbt_salesforce_source
-
Dimensional Data Model Design for Salesforce
Fyi, have used salesforce integrations from Fivetran, Segment, Stitch Data, as well as in a house integration for different companies. Fivetran has also published some analytical models: https://github.com/fivetran/dbt_salesforce_source/tree/master/models.
dbt-expectations
-
Dbt tests vs Soda SQL
Have not used Soda, but dbt indeed is pretty good especially when adding dbt-expectations
-
Data-eng related highlights from the latest Thoughtworks Tech Radar
dbt-expectations
-
Data Quality Dimensions: Assuring Your Data Quality with Great Expectations
I highly.. highly.. recommend the dbt-expectations extension from Catologica for dbt. It's a port of Great Expectations, except you can quickly thunk it in your schema.yml's and have it run as part of your dbt test process. Super powerful and it's prevented us from shipping bad data many times.
-
Managing SQL Tests
I'm used to utilising dbt and defining my tests there (along with dbt-utils or https://github.com/calogica/dbt-expectations): I simply add a list item to a column definition and can already define a great number of tests without having to copy code. I can even extend the pre-defined using generic tests. Writing custom tests also integrates nicely. Additionally it's very convenient to tag tests or define a severity. The learning curve for a business engineer is almost flat as long as they know some SQL.
-
What are some Data Quality check related frameworks for datasets ranging from 100GB to 1TB in size?
Use dbt's testing functionality during your transformations with catalogica/dbt-expectations (Great Expectations framework ported to dbt)
-
Great Expectations is annoyingly cumbersome
Check out dbt-expectations https://github.com/calogica/dbt-expectations
-
CI/CD in data engineering - help a noob
There are certain things I would like to add such as data quality, I can use something like dbt great expectations, but I am not sure how much more I should force it before getting an airflow setup..
- How do you query and quality check data produced in intermediate steps in analytics pipeline?
-
ETL Pipelines with Airflow: The Good, the Bad and the Ugly
[dbt Labs employee here]
Check out dbt-expectations package[1]. It's a port of the Great Expectations checks to dbt as tests. The advantage of this is you don't need another tool for these pretty standard tests, and can be early incorporated into dbt workflows.
[1] https://github.com/calogica/dbt-expectations
-
Unit testing SQL in DBT
Also check out dbt-expectations that is a port of Great Expectations that greatly expands the configurable (non-assert) tests.
What are some alternatives?
dbt_salesforce - Fivetran's Salesforce dbt package
dbt-utils - Utility functions for dbt projects.
dbt-project-evaluator - This package contains macros and models to find DAG issues automatically
materialize - The data warehouse for operational workloads.
dbt-bigquery-gitpod-config - A gitpod config for using dbt-bigquery in vscode
dbt-oracle - A dbt adapter for oracle db backend
dbt_stripe - Data models for Stripe built using dbt.
Scio - A Scala API for Apache Beam and Google Cloud Dataflow.
dbt_hubspot - Data models for Hubspot built using dbt.
NVTabular - NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
cuetils - CLI and library for diff, patch, and ETL operations on CUE, JSON, and Yaml
dbt-fal - do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.