retentioneering-tools
dbt-fal
Our great sponsors
retentioneering-tools | dbt-fal | |
---|---|---|
1 | 12 | |
762 | 851 | |
2.4% | - | |
5.9 | 7.7 | |
5 months ago | 24 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
retentioneering-tools
-
My Favorite Off-the-Shelf Data Science Repos, What Are Yours?
Here are my top off-the-shelf data science models for Marketing. Would be interested which other marketing data science tools you use?
Product Recommendation on Your Website with Metarank (https://github.com/metarank/metarank)
Metarank is a tool that helps you easily build an advanced recommendation engine for your products or content on your website. To get started you only need historical performance data of your products (e.g. number of clicks) and additional metadata like product rating, genre, ingredients or price. In a YAML file, you define the features and the model parameters (e.g. number of iterations, modeling technique). The API service integrates with Apache Flink and can be easily integrated into Kubernetes clusters.
User Journey Analysis on your Website with Retentioneering (https://github.com/retentioneering/retentioneering-tools)
Retentioneering helps you to understand the user journey on your website. Retentioneering is a Python library that allows you to easily connect your Google Analytics data (in Bigquery). You define user-id, event-type and time stamp. From this data input a comprehensive graph network is created with gains and losses as you know it from a customer journey. In addition, customer segments are created that have a similar customer journey. This reduces the complexity of a purely descriptive view of the data.
Marketing Mix Modeling with Robyn (https://github.com/facebookexperimental/Robyn)
Less third-party cookie means less attribution models. The answer to this is Marketing Mix Modeling. Marketing mix models are regression models that use statistical probability to calculate the effect size of marketing channels and other independent variables. The advantage is that business context can be modeled much more realistically. For example, Google Searches for the own brand can be integrated to determine the share of the own brand strength in the revenue. Likewise, offline advertising measures can be modeled with other metrics in this context (e.g. offline advertising with GRPs). Robyn takes into account adstock effects, ROAS calculation and multicollinarity in the marketing channels. In addition, with simple functionality, budgets can be optimized using the predictions and results from marketing tests can be integrated into the model for calibration.
dbt-fal
-
machine learning in snowflake, unhappy data scientists
Happy data scientists use fal and dbt
-
dbt for ML Engineering
fal (https://github.com/fal-ai/fal) helps with this! In fact we wrote a blog post about feature engineering with fal and dbt recently
-
Dbt-fal: a dbt Python adapter with local code execution
We built a dbt adapter that helps you run local Python code with your dbt project with any other data warehouse. You can see it here: https://github.com/fal-ai/fal/tree/main/adapter
This new adapter helps you run your dbt Python models with isolated Python environments using our open source library: https://github.com/fal-ai/isolate
-
Data Stack for Python Scripts (and other transformations)
Have you considered fal? https://github.com/fal-ai/fal
-
Comparing dbt with Delta Live Tables for doing transformations
Something to maybe comment on the post is that dbt is introducing Python transformations on the data warehouse offering (e.g. Snowspark) soon and that there are tools like fal that enable these Python transformations to run in a different environment which you have control over.
-
What are the hottest dbt Repositories you should star on Github 2022? - Here are mine.
Fal-AI ( https://github.com/fal-ai/fal ) Fal helps to run Python scripts directly from the dbt project. For example, you can load dbt models directly into the Python context which helps to apply Data Science libraries like SKlearn and Prophet in the dbt models. This especially improves the data science capabilities within a data pipeline. What I extremely like about fal is that it extends dbt from a interesting angle.
-
What are your hottest dbt repositories in 2022 so far? Here are mine!
- π fal ai: Fal helps to run Python scripts directly from the dbt project. For example you can load dbt models directly into the Python context which helps to apply Data Science libaries like SKlearn and Prophet in the dbt models.
-
Wanting to move away from SQL
I havenβt tried it yet but I know https://fal.ai/ helps you run python alongside dbt.
-
Do I need orchestration for a Fivetran-dbt stack?
Yes I agree with you that having fivetran/airbyte and dbt covers a lot of the airflow use cases.. That being said you might still want to run some scripts after the DBT transformation is over, we ran into this exact problem and built a useful CLI tool for running python scripts alongside the dbt run.
-
Why is Data Build Tool (DBT) is so popular? What are some other alternatives?
Great write-up! For your logging integration, you might have a look at fal. There's an example of sending events to Datadog
What are some alternatives?
metarank - A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine
dbt-metabase - dbt + Metabase integration
Robyn - Robyn is an experimental, AI/ML-powered and open sourced Marketing Mix Modeling (MMM) package from Meta Marketing Science. Our mission is to democratise modeling knowledge, inspire the industry through innovation, reduce human bias in the modeling process & build a strong open source marketing science community.
dbt-expectations - Port(ish) of Great Expectations to dbt test macros
orange - π :bar_chart: :bulb: Orange: Interactive data analysis
kuwala - Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data science models and products with a focus on geospatial data. Currently, the following data connectors are available worldwide: a) High-resolution demographics data b) Point of Interests from Open Street Map c) Google Popular Times
sweetviz - Visualize and compare datasets, target values and associations, with one line of code.
evidence - Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown
Contactless-Attendance-System - β¨ A Contactless Attendance System where your face is identified for Attendance.
Pandas - Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
airflow-dbt - Apache Airflow integration for dbt
re_data - re_data - fix data issues before your users & CEO would discover them π