targets vs dbt

targets

Function-oriented Make-like declarative workflows for R (by ropensci)

Source Code

docs.ropensci.org

Suggest alternative

Edit details

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications. [Moved to: https://github.com/dbt-labs/dbt-core] (by fishtown-analytics)

dbt-viewpoint Slack pypa data-modeling Business Intelligence Analytics elt

DISCONTINUED

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

targets		dbt
	Project
10	Mentions	1
869	Stars	3,802
1.6%	Growth	-
9.6	Activity	10.0
9 days ago	Latest Commit	over 2 years ago
R	Language	Python
GNU General Public License v3.0 or later	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

targets

Posts with mentions or reviews of targets. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-09-07.

Advice on Best Practices
1 project | /r/RStudio | 27 Sep 2022

Is this it https://github.com/ropensci/targets?
Does anyone else feel in a tricky spot about their use of R?
3 projects | /r/rstats | 7 Sep 2022

I'll chime in with others to say that using targets can help with the memory load as well. If you partition your data adequately (e.g. grouping by subjects), you can take advantage of the way targets maps data so it only loads what it needs to. Moreover, if you use the memory = "transient" option, it will unload objects between steps -- adding a little bit of time overhead but saving you on memory. targets and tidytable together have enabled me to work on pretty sizeable datasets while rarely running into memory issues. In fact, the only time I ran into a data memory hog was because I didn't adequately partition the data across worker nodes.
What are your favorite R Libraries?
1 project | /r/rstats | 1 Aug 2022

targets
Is there a better way to update an entire series of scripts?
1 project | /r/rstats | 11 Apr 2022

I highly recommend the holy grail of workflow orchestrators / executors in the R ecosystem: targets.
The new Drake ropensci targets: Function-oriented Make-like declarative workflows for R {R}
2 projects | /r/Sciatro | 15 Nov 2021
How do you manage, distribute and schedule jobs written in R?
1 project | /r/dataengineering | 7 Oct 2021

That said, you might want to check out the ‘targets’ package, which provides a DSL for specifying complex workflow descriptions in R. When repeatedly running the same jobs on changing data, this package helps ensure that only necessary work is performed (suitable intermediate results are reused), and scripts are run reproducibly. This might help with sceduling.
How do I do something like this as a parallel programming in R?
1 project | /r/rstats | 29 Sep 2021

It may be worth it to put these individual steps into a targets pipeline. targets is designed to support parallelization with future and make it easier to visualize downstream dependencies.
Tips re: workflow, organization, file hygiene and similar?
1 project | /r/rstats | 19 Aug 2021

Given your requirements, I recommend you check out ‘targets’, which specifically addresses the needs of reusable workflows in R, and it seems like it fits your requirements to a T.
Your impression of {targets}? (r package)
3 projects | /r/Rlanguage | 2 May 2021

The targets package is the official successor to Drake, and has the same primary author (Will Landau). He has explained why he created targets, which includes stronger guardrails for users and better UX.
Data engineering with R?
2 projects | /r/rstats | 18 Apr 2021

I use it for ETL. I use targets as the workflow management software, and, like others, have a cron job set up to run nightly builds.

dbt

Posts with mentions or reviews of dbt. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-11-25.

Open Source Analytics Stack: Bringing Control, Flexibility, and Data-Privacy to Your Analytics
15 projects | dev.to | 25 Nov 2021

Due to the rise in cloud-based data warehouses, businesses can directly load all the raw data into the data warehouse without prior transformations. This process is known as ELT (Extract, Load, Transform) and gives data and analytics teams freedom to develop ad-hoc transformations based on their particular needs. ELT became popular as the cloud's processing power and scale became better suited to transforming data. DBT (website, GitHub) is a popular open-source tool recommended for ELT and allows businesses to transform data in their warehouses more effectively. It's a great pairing with with RudderStack's Cloud Extract ETL tool.

What are some alternatives?

When comparing targets and dbt you can also consider the following projects:

dbt-core - dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Apache Kafka - Mirror of Apache Kafka

drake - An R-focused pipeline toolkit for reproducibility and high-performance computing

airbyte - The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

awesome-pipeline - A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin

superset - Apache Superset is a Data Visualization and Data Exploration Platform

tidyverse - Easily install and load packages from the tidyverse

Snowplow - The enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCP

fastverse - An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R

nbdev - Create delightful software with Jupyter Notebooks

targets-tutorial - Short course on the targets R package

rudderstack-docs - Documentation repository for RudderStack - the Customer Data Platform for Developers.

targets vs dbt-core dbt vs Apache Kafka targets vs drake dbt vs airbyte targets vs awesome-pipeline dbt vs superset targets vs tidyverse dbt vs Snowplow targets vs fastverse dbt vs nbdev targets vs targets-tutorial dbt vs rudderstack-docs

Compare targets vs dbt and see what are their differences.

targets

dbt

targets

dbt

What are some alternatives?