rudderstack-docs
BigQuery-Python
Our great sponsors
rudderstack-docs | BigQuery-Python | |
---|---|---|
20 | 1 | |
20 | 449 | |
- | - | |
3.5 | 1.8 | |
17 days ago | over 2 years ago | |
JavaScript | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
rudderstack-docs
-
MDS Newsletter #12
2/ Featured tools this week - Transform and RudderStack
-
How To Event Stream From Your Gatsby Website Using Open Source RudderStack
RudderStack is an open-source Customer Data Pipeline that allows you to track and send real-time events from your web, mobile, and server-side sources to your entire customer data stack. Our primary repository - rudder-server - is open-sourced on GitHub.
-
Customer Data Pipelines Play a Key Role in Data Privacy
This post will explain how your customer data pipeline can help improve your data privacy and how to ensure your data privacy with RudderStack.
-
Open Source Analytics Stack: Bringing Control, Flexibility, and Data-Privacy to Your Analytics
However, limitations to traditional CDPs, especially around connecting to best-of-breed customer tooling and exposing data for use across an organization have driven a new generation of non-CDPs. Solutions like Snowplow's (website, GitHub) data delivery platform and RudderStack's (website, GitHub) customer data platform for developers ingest data from a multitude of sources, apply in-stream transformations, and route data to your data warehouse, like Snowplow, or your warehouse plus your preferred customer tooling destinations for activation, like RudderStack.
-
RudderStack + Blendo: Better Together
I learned many lessons from this journey - lessons that deserve a post of their own - but there's one lesson that I learned early on that stands out. In this blog, I talk about why we merged Blendo with RudderStack, building the team and working together to build a great product.
-
The Open Source Story - Open Sourcing RudderStack Blog and Docs
In fact, developers have already started contributing to our documentation. Recently, Benedikt from the Userlist team created the docs for the Userlist destination for RudderStack (see the pull request here). They also built the Userlist integration, submitted a pull request, and it is now live on our platform! This is the beauty of open source!
-
How to plan and implement a customer data tracking strategy for your Micro-SaaS
TLDR: general steps/starting point for setting up an app with Rudderstack ( or Segment) to track customer events
-
Developing a Custom Plugin using Flutter
As a part of our SDK roadmap at RudderStack, we wanted to develop a Flutter SDK. Our existing SDKs include features such as storing event details and persisting user details on the database, and much more. However, these features are already implemented in our Android and iOS SDKs.
-
Visualize Stripe Payments Data in Postgres using SQL
To load Stripe data into Postgres, you can use platforms such as Stitch Data and Rudderstack. In this guide, we will use Stitch Data because it is a cheap and a fast solution.
-
Dogfooding at RudderStack: Tracking Plans Part 1
With your Tracking Plans in place, you can use the existing Data Governance API's to evaluate your inbound events, payload samples and metadata to compare them against your plans. You can also use the RudderTyper tool we're releasing alongside Tracking Plans. RudderTyper is a tool for generating strongly-typed RudderStack analytics library wrappers based on your published tracking plan specs, meaning your data will conform to your defined schema upon capture.
BigQuery-Python
-
How To Access And Query Your Google BigQuery Data Using Python And R
To query your Google BigQuery data using Python, we need to connect the Python client to our BigQuery instance. We do so using a cloud client library for the Google BigQuery API. You can also choose to use any other third-party option to connect BigQuery with Python; the BigQuery-Python library by tylertreat is also a great option.
What are some alternatives?
pub-dev - The pub.dev website
ethereum-etl - Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
dbt-spark - dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks
datavault4dbt - Scalefree's dbt package for a Data Vault 2.0 implementation congruent to the original Data Vault 2.0 definition by Dan Linstedt including the Staging Area, DV2.0 main entities, PITs and Snapshot Tables.
dbt - dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications. [Moved to: https://github.com/dbt-labs/dbt-core]
bitcoin-etl - ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
nodejs-bigquery - Node.js client for Google Cloud BigQuery: A fast, economical and fully-managed enterprise data warehouse for large-scale data analytics.
bigrquery - An interface to Google's BigQuery from R.
WordPress - WordPress, Git-ified. This repository is just a mirror of the WordPress subversion repository. Please do not send pull requests. Submit pull requests to https://github.com/WordPress/wordpress-develop and patches to https://core.trac.wordpress.org/ instead.
bigquery-schema-generator - Generates the BigQuery schema from newline-delimited JSON or CSV data records.
dbt-sessionization - Using DBT for Creating Session Abstractions on RudderStack - an open-source, warehouse-first customer data pipeline and Segment alternative.
somm_airdrop - Sommelier Finance (SOMM) token distribution for the airdrop proposed in Sips-002