data_engineering_on_gcp_book
distribyted
Our great sponsors
data_engineering_on_gcp_book | distribyted | |
---|---|---|
12 | 3 | |
116 | 1,008 | |
- | 1.7% | |
2.6 | 8.6 | |
about 3 years ago | 1 day ago | |
Go | ||
- | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
data_engineering_on_gcp_book
-
What is your “I don't care if this succeeds” project?
A book for setting up data engineering infrastructure on GCP: https://github.com/Nunie123/data_engineering_on_gcp_book
It's almost done, and I do plan to spend a little effort promoting it when it's complete, but it's been a great focus for me even if no one ever reads it.
I use a fair bit of infrastructure at my job that was set up by others. It was nice to go through the practice of setting it up myself.
I learned a good bit, but also it's nice to have all this knowledge written down in a place not owned by the company I work for. If I use GCP at future jobs I'm sure I'll reference this book myself.
distribyted
-
What is your “I don't care if this succeeds” project?
A torrent client that exposes torrent content as files: https://github.com/distribyted/distribyted
It's pretty fun to work on it and implement new use cases. Right now it supports FUSE mounts, but I'm thinking to make it work as a WebDAV server too.
Also, I'm working on several demos, like SQLite compatibility, similar to https://github.com/lmatteis/torrent-net, or CSV analysis using Jupyter notebooks for huge datasets like https://ghtorrent.org/
What are some alternatives?
Video-Hub-App - Official repository for Video Hub App
btfs - A bittorrent filesystem based on FUSE.
fusell-seed - FUSE (the low-level interface) file system boilerplate :open_file_folder: :electric_plug: :floppy_disk:
check-all-the-things - check all of the things!
shotcaller - A moddable RTS/MOBA game made with bracket-lib and minigene.
FactGraph - FactGraph monorepo (backend + frontend + landing page + blog)
beubo - Beubo is a free, simple, and minimal CMS with unlimited extensibility using plugins
go-plugin - Golang plugin system over RPC.
scraper - Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Supported databases: SQLite, MySQL, PostgreSQL. Supported headless clients: Puppeteer, Playwright, Cheerio, JSdom.
dali - Indie assembler/linker for Dalvik VM .dex & .apk files (Work In Progress)
listudy - Listudy - chess training server