data_engineering_on_gcp_book
Arthur
Our great sponsors
data_engineering_on_gcp_book | Arthur | |
---|---|---|
12 | 5 | |
116 | 7 | |
- | - | |
2.6 | 8.5 | |
about 3 years ago | about 3 years ago | |
Python | ||
- | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
data_engineering_on_gcp_book
-
How possible is it for a beginner to establish pipelines, data warehouse, and visualization solution as a team of 1?
This book will walk you through setting up a complete data engineering stack on GCP: https://github.com/Nunie123/data_engineering_on_gcp_book
-
Python & SQL knowledge needed for ETL?
As for resources, this book goes over a lot of these: https://github.com/Nunie123/data_engineering_on_gcp_book. However, this goes over the 'how', not the 'why'. The only method I know for understanding the 'why' is experience. Whether at work or personal projects.
-
Learning Python and SQL: What should be my next step?
Here's a good book to follow along to introduce you to common tooling and design patterns: https://github.com/Nunie123/data_engineering_on_gcp_book
-
Github Repo with All Data tranformation,Cleaning,Validation
I'm not sure if this is exactly what you're looking for, but here's a book on GitHub that talks about the tools and steps for building data pipelines into a data warehouse: https://github.com/Nunie123/data_engineering_on_gcp_book
-
What is the low hanging fruit for a brand new GCP data engineer to learn?
Check out this book: https://github.com/Nunie123/data_engineering_on_gcp_book
-
Unsure about overall process of data engineering
If you're interested in example of how to build a complete data engineering infrastructure, you should check out this book: https://github.com/Nunie123/data_engineering_on_gcp_book
-
[HELP] Airflow Reverse proxy + load balancer +docker
If you want to try Airflow without the setup headache, you can try Composer on GCP, which is a hosted version of Airflow. I wrote some info on how to do that here: https://github.com/Nunie123/data_engineering_on_gcp_book/blob/master/ch_2_orchestration.md
-
Transition from a Quality engineer to Data engineer
This book might be a good resource for you: https://github.com/Nunie123/data_engineering_on_gcp_book
-
Accepted a data engineer intern role at a Big N company - how do I learn as much as possible?
If you want a place to start on personal projects you can check out this book, https://github.com/Nunie123/data_engineering_on_gcp_book, which will walk you through the basics of setting up a full data engineering stack.
-
What tools, software, programming languages, and etc. does a data engineer need to have in 2021
If you are interested in tooling, here's a free book on setting up a basic data engineering tech stack on GCP: https://github.com/Nunie123/data_engineering_on_gcp_book
Arthur
-
Ask HN: What is your “I don't care if this succeeds” project?
Here are three hobby projects I've worked on during the last 2 years. I've written extensive guides for all of them:
https://github.com/maxvfischer/DIY-CNC-machine A CNC-machine I built from scratch, using 40x 3d-printed parts.
https://github.com/maxvfischer/Arthur An AI art installation I built from scratch using a GAN network, Samsung The Frame, a button and a PIR-sensor.
https://github.com/maxvfischer/DIY-arcade A full-size Arcade Machine I built from scratch.
-
Problem With False Positives On My Soldered
Here you have the complete electronic setup: https://github.com/maxvfischer/Arthur#electronic-components
-
What is your “I don't care if this succeeds” project?
When I set out to learn new skills, I usually try to wrap them in a project. I also try to document and open-source the whole process, both for my own learning, but to enable other to leverage my failures and learnings.
Here are the projects I've done so far:
https://github.com/maxvfischer/Arthur An AI art installation I built from scratch using a GAN network, Samsung The Frame, a button and a PIR-sensor (including, code, images and tutorial). The main draft is almost done, but quite some polishing to do.
https://github.com/maxvfischer/shibusa An automatic Zen Garden drawing infinite patterns in sand. Using stepper motors, inverse kinematics and a Raspberry Pi Zero W (including, code, images and tutorial). I'm almost done building the robot, but still have quite some implementation to do. Also, the guide is far from done, I've mostly uploaded images so far.
https://github.com/maxvfischer/DIY-arcade A full-size Arcade Machine I built from scratch (including, code, images and tutorial). I don't know where you draw the life of "half baked". It's done, but there's a lot of improvements that can be done.
-
Ask HN: What Are You Working On?
https://github.com/maxvfischer/Arthur An AI art installation I built from scratch using a GAN network, Samsung The Frame, a button and a PIR-sensor (including, code, images and tutorial). The main draft is almost done, but quite some polishing to do.
https://github.com/maxvfischer/shibusa An automatic Zen Garden drawing infinite patterns in sand. Using stepper motors, inverse kinematics and a Raspberry Pi Zero W (including, code, images and tutorial). I'm almost done building the robot, but still have quite some implementation to do. Also, the guide is far from done, I've mostly uploaded images so far.
-
Ask HN: Show me your Half Baked project
https://github.com/maxvfischer/Arthur
What are some alternatives?
shotcaller - A moddable RTS/MOBA game made with bracket-lib and minigene.
vopono - Run applications through VPN tunnels with temporary network namespaces
FactGraph - FactGraph monorepo (backend + frontend + landing page + blog)
listudy - Listudy - chess training server
beubo - Beubo is a free, simple, and minimal CMS with unlimited extensibility using plugins
scraper - A scraper for EmulationStation written in Go using hashing
distribyted - Torrent client with HTTP, fuse, and WebDAV interfaces. Start exploring your torrent files right away, even zip, rar, or 7zip archive contents!
dflex - The sophisticated Drag and Drop library you've been waiting for 🥳
go-plugin - Golang plugin system over RPC.
UsTaxes - Tax filing web application
dali - Indie assembler/linker for Dalvik VM .dex & .apk files (Work In Progress)
electron-browser-shell - A minimal, tabbed web browser with support for Chrome extensions—built on Electron.