openverse-catalog
DataEngineeringProject
openverse-catalog | DataEngineeringProject | |
---|---|---|
7 | 5 | |
54 | 985 | |
- | - | |
1.8 | 0.0 | |
about 1 year ago | over 1 year ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
openverse-catalog
- where can I find royalty free stock photos for designs?
-
Are there any record pools for non commercial use?
copyright free music ? https://openverse.org
-
Any other Mr. Nightmare-style horror Youtubers here?
You can check openverse.org. They have different royalty-free assets and those are quite often not polished and look very ordinary and real.
-
In Over My Head
Like with any other issue, I kind of look at it at large and think either "This seems do-able" or "Pass", this one was in the first category: openverse-catalog. I saw that I just had to add a string to some header and thought maybe this is something I can actually do. Maybe it was, I won't be able to find out because I could not get the project to run.
-
Hacktoberfest Recap
Issue, Pull Request, Blog Post
-
Hacktoberfest Week 2
We're already halfway through October! This week, I focused on finishing up my second issue that I had started working on last week in the Wordpress Openverse Catalog repository.
-
Hacktoberfest Week 1
This is my first Hacktoberfest! I was able to work on two issues this week, one for Seneca's Telescope project and one for Wordpress Openverse Catalog. Finding the issues were a bit challenging since there were so many people and repos participating, but I remembered a piece of advice that my open source professor mentioned, which was to pick a good enough issue, rather than a perfect issue.
DataEngineeringProject
- What are your favourite GitHub repos that shows how data engineering should be done?
- Is it me or are beginner-friendly ETL pipeline guides that explain from the ground-up how to incorporate the use of various technologies notoriously difficult to find.
-
Starting A Data Engineering Project Series
News RSS Feeds
-
5 Data Sources for Data Engineering Projects
Lastly, the most readily available data source would be data scraped from the internet. To be slightly less vague, I have outlined a project that web-scrapes new online articles every ten minutes to provide all the latest news curated into one place. This project utilizes a wide variety of relevant data engineering tools, which makes it a great project example. The author of this project is Damian Kliś, and he outlines his model architecture below:
-
Can You Recommend Good Data Engineering Projects
Here is my project that got me a few interviews so far: https://github.com/damklis/DataEngineeringProject
What are some alternatives?
pytest-recording - A pytest plugin that allows recording network interactions via VCR.py
blinkist-scraper - 📚 Python tool to download book summaries and audio from Blinkist.com, and generate some pretty output
telescope - A tool for tracking blogs in orbit around Seneca's open source involvement
synapse-s3-storage-provider - Synapse storage provider to fetch and store media in Amazon S3
openverse-api - The Openverse API allows programmatic access to search for CC-licensed and public domain digital media.
yaetos - Write data & AI pipelines in (SQL, Spark, Pandas) and deploy to the cloud, simplified
Airflow - Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
amazon-s3-find-and-forget - Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Atari-Space-Invaders - An inspiration of the original Atari Space Invaders game built in pygame :space_invader: :video_game: [Moved to: https://github.com/Mayank0255/Space-Invaders]
Zillow-Data-Engineering
office-ui-fabric-react - Fluent UI web represents a collection of utilities, React components, and web components for building web applications.
openwisp-monitoring - Network monitoring system written in Python and Django, designed to be extensible, programmable, scalable and easy to use by end users: once the system is configured, monitoring checks, alerts and metric collection happens automatically.