DataEngineeringProject
openwisp-monitoring
DataEngineeringProject | openwisp-monitoring | |
---|---|---|
5 | 1 | |
985 | 142 | |
- | 2.1% | |
0.0 | 7.1 | |
over 1 year ago | 4 days ago | |
Python | Python | |
MIT License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
DataEngineeringProject
- What are your favourite GitHub repos that shows how data engineering should be done?
- Is it me or are beginner-friendly ETL pipeline guides that explain from the ground-up how to incorporate the use of various technologies notoriously difficult to find.
-
Starting A Data Engineering Project Series
News RSS Feeds
-
5 Data Sources for Data Engineering Projects
Lastly, the most readily available data source would be data scraped from the internet. To be slightly less vague, I have outlined a project that web-scrapes new online articles every ten minutes to provide all the latest news curated into one place. This project utilizes a wide variety of relevant data engineering tools, which makes it a great project example. The author of this project is Damian Kliś, and he outlines his model architecture below:
-
Can You Recommend Good Data Engineering Projects
Here is my project that got me a few interviews so far: https://github.com/damklis/DataEngineeringProject
openwisp-monitoring
-
New OpenWISP Monitoring feature for OpenWRT/modem-manager: Mobile Signal Charts
Pull request: https://github.com/openwisp/openwisp-monitoring/pull/294.
What are some alternatives?
blinkist-scraper - 📚 Python tool to download book summaries and audio from Blinkist.com, and generate some pretty output
Grafana - The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.
synapse-s3-storage-provider - Synapse storage provider to fetch and store media in Amazon S3
ansible-openwisp2 - Ansible role that installs and upgrades OpenWISP.
yaetos - Write data & AI pipelines in (SQL, Spark, Pandas) and deploy to the cloud, simplified
wazuh-ruleset - Wazuh - Ruleset
amazon-s3-find-and-forget - Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
pgflux - Python script to send PostgreSQL monitoring telemetry to InfluxDB
Zillow-Data-Engineering
vnet-manager - Virtual network manager - Manages containers and VMs to create a virtual network setup
openverse-catalog - Identifies and collects data on cc-licensed content across web crawl data and public apis.
Sentry - Developer-first error tracking and performance monitoring