spark-bigquery-connector
streamify
spark-bigquery-connector | streamify | |
---|---|---|
2 | 4 | |
351 | 474 | |
2.3% | - | |
8.9 | 0.0 | |
5 days ago | about 2 years ago | |
Java | Python | |
Apache License 2.0 | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
spark-bigquery-connector
-
What the hell is Spark?
Why not just swap Spark and BigQuery in your comment? There's even a connector.
- Completed my first Data Engineering project with Kafka, Spark, GCP, Airflow, dbt, Terraform, Docker and more!
streamify
- Where can I find online projects end-to-end?
-
Completed my first Data Engineering project with Kafka, Spark, GCP, Airflow, dbt, Terraform, Docker and more!
Here is link number 1 - Previous text "Git"
What are some alternatives?
eventsim - Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.
eventsim - Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.
terraform - Terraform enables you to safely and predictably create, change, and improve infrastructure. It is a source-available tool that codifies APIs into declarative configuration files that can be shared amongst team members, treated as code, edited, reviewed, and versioned.
Apache Spark - Apache Spark - A unified analytics engine for large-scale data processing
Airflow - Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
nodejs-bigquery - Node.js client for Google Cloud BigQuery: A fast, economical and fully-managed enterprise data warehouse for large-scale data analytics.
ApacheKafka - A curated re-sources list for awesome Apache Kafka
finnhub-streaming-data-pipeline - Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more
Docker Compose - Define and run multi-container applications with Docker
tfl-bikes-data-pipeline - Processing TFL data for bike usage with Google Cloud Platform.