data_engineering_on_gcp_book
exomind
data_engineering_on_gcp_book | exomind | |
---|---|---|
12 | 5 | |
116 | 58 | |
- | - | |
2.6 | 9.3 | |
about 3 years ago | 8 days ago | |
TypeScript | ||
- | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
data_engineering_on_gcp_book
-
How possible is it for a beginner to establish pipelines, data warehouse, and visualization solution as a team of 1?
This book will walk you through setting up a complete data engineering stack on GCP: https://github.com/Nunie123/data_engineering_on_gcp_book
-
Python & SQL knowledge needed for ETL?
As for resources, this book goes over a lot of these: https://github.com/Nunie123/data_engineering_on_gcp_book. However, this goes over the 'how', not the 'why'. The only method I know for understanding the 'why' is experience. Whether at work or personal projects.
-
Learning Python and SQL: What should be my next step?
Here's a good book to follow along to introduce you to common tooling and design patterns: https://github.com/Nunie123/data_engineering_on_gcp_book
-
Github Repo with All Data tranformation,Cleaning,Validation
I'm not sure if this is exactly what you're looking for, but here's a book on GitHub that talks about the tools and steps for building data pipelines into a data warehouse: https://github.com/Nunie123/data_engineering_on_gcp_book
-
What is the low hanging fruit for a brand new GCP data engineer to learn?
Check out this book: https://github.com/Nunie123/data_engineering_on_gcp_book
-
Unsure about overall process of data engineering
If you're interested in example of how to build a complete data engineering infrastructure, you should check out this book: https://github.com/Nunie123/data_engineering_on_gcp_book
-
[HELP] Airflow Reverse proxy + load balancer +docker
If you want to try Airflow without the setup headache, you can try Composer on GCP, which is a hosted version of Airflow. I wrote some info on how to do that here: https://github.com/Nunie123/data_engineering_on_gcp_book/blob/master/ch_2_orchestration.md
-
Transition from a Quality engineer to Data engineer
This book might be a good resource for you: https://github.com/Nunie123/data_engineering_on_gcp_book
-
Accepted a data engineer intern role at a Big N company - how do I learn as much as possible?
If you want a place to start on personal projects you can check out this book, https://github.com/Nunie123/data_engineering_on_gcp_book, which will walk you through the basics of setting up a full data engineering stack.
-
What tools, software, programming languages, and etc. does a data engineer need to have in 2021
If you are interested in tooling, here's a free book on setting up a basic data engineering tech stack on GCP: https://github.com/Nunie123/data_engineering_on_gcp_book
exomind
-
Ask HN: What is your current side-project?
Is your PDF reader open sourced? It's a feature I'd like to implement at some point in my own personal project (https://github.com/appaquet/exomind)
-
What is your “I don't care if this succeeds” project?
I just added a few screenshots in the README: https://github.com/appaquet/exomind
As for the Gmail integration, it is quite crude at the moment. I use it mostly to organize incoming emails, but I still use Gmail to send or reply to my emails. Exomind inbox is synchronized with Gmail, so all emails that you remove from one or the other get removed / archived on the other side. It also supports multiple accounts.
If you are interested to try and not afraid of the rough edges, just let me know. I added Discussions to the GitHub repository.
-
Ask HN: What Are You Working On?
Exomind[1], a personal knowledge management tool that takes the form of a unified inbox in which you can have your emails, tasks, notes and bookmarks organized into collections. I have an iOS and a web/electron client at the moment. I plan to eventually add files (blobs), definitions and support extensibility via WASM applications.
Its backend (Exocore[2]) is built on top of a personal / private blockchain and is made from the ground up to be hosted in a semi-decentralized fashion on your own personal devices (your computer, raspberry pi, a cloud instance, etc.)
It has very rough edges, but I'm using it daily to organize my life. It has also been my learning playground to improve my Rust skills over the last two years. If all goes well, I'm a few months away from some kind of tech preview.
[1] https://github.com/appaquet/exomind
What are some alternatives?
shotcaller - A moddable RTS/MOBA game made with bracket-lib and minigene.
openmiko - Open source firmware for Ingenic T20 based devices such as WyzeCam V2, Xiaomi Xiaofang 1S, iSmartAlarm's Spot+ and others.
FactGraph - FactGraph monorepo (backend + frontend + landing page + blog)
listudy - Listudy - chess training server
beubo - Beubo is a free, simple, and minimal CMS with unlimited extensibility using plugins
DsHidMini - Virtual HID Mini-user-mode-driver for Sony DualShock 3 Controllers
distribyted - Torrent client with HTTP, fuse, and WebDAV interfaces. Start exploring your torrent files right away, even zip, rar, or 7zip archive contents!
ExtPay - The JavaScript library for ExtensionPay.com — payments for your browser extensions, no server needed.
go-plugin - Golang plugin system over RPC.
electron-browser-shell - A minimal, tabbed web browser with support for Chrome extensions—built on Electron.
dali - Indie assembler/linker for Dalvik VM .dex & .apk files (Work In Progress)
Video-Hub-App - Official repository for Video Hub App