openverse-catalog
Airflow
openverse-catalog | Airflow | |
---|---|---|
7 | 169 | |
54 | 34,485 | |
- | 1.1% | |
1.8 | 10.0 | |
about 1 year ago | 7 days ago | |
Python | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
openverse-catalog
- where can I find royalty free stock photos for designs?
-
Are there any record pools for non commercial use?
copyright free music ? https://openverse.org
-
Any other Mr. Nightmare-style horror Youtubers here?
You can check openverse.org. They have different royalty-free assets and those are quite often not polished and look very ordinary and real.
-
In Over My Head
Like with any other issue, I kind of look at it at large and think either "This seems do-able" or "Pass", this one was in the first category: openverse-catalog. I saw that I just had to add a string to some header and thought maybe this is something I can actually do. Maybe it was, I won't be able to find out because I could not get the project to run.
-
Hacktoberfest Recap
Issue, Pull Request, Blog Post
-
Hacktoberfest Week 2
We're already halfway through October! This week, I focused on finishing up my second issue that I had started working on last week in the Wordpress Openverse Catalog repository.
-
Hacktoberfest Week 1
This is my first Hacktoberfest! I was able to work on two issues this week, one for Seneca's Telescope project and one for Wordpress Openverse Catalog. Finding the issues were a bit challenging since there were so many people and repos participating, but I remembered a piece of advice that my open source professor mentioned, which was to pick a good enough issue, rather than a perfect issue.
Airflow
-
Building in Public: Leveraging Tublian's AI Copilot for My Open Source Contributions
Contributing to Apache Airflow's open-source project immersed me in collaborative coding. Experienced maintainers rigorously reviewed my contributions, providing constructive feedback. This ongoing dialogue refined the codebase and honed my understanding of best practices.
-
Navigating Week Two: Insights and Experiences from My Tublian Internship Journey
In week Two, I contributed to the Apache Airflow repository.
-
Airflow VS quix-streams - a user suggested alternative
2 projects | 7 Dec 2023
-
Best ETL Tools And Why To Choose
Apache Airflow is an open-source platform to programmatically author, schedule, and monitor workflows. The platform features a web-based user interface and a command-line interface for managing and triggering workflows.
-
Simplifying Data Transformation in Redshift: An Approach with DBT and Airflow
Airflow is the most widely used and well-known tool for orchestrating data workflows. It allows for efficient pipeline construction, scheduling, and monitoring.
-
Share Your favorite python related software!
AIRFLOW This is more of a library in my opinion, but Airflow has become an essential tool for scheduling in my work. All our ML training pipelines are ordered and scheduled with Airflow and it works seamlessly. The dashboard provided is also fantastic!
-
Ask HN: What is the correct way to deal with pipelines?
I agree there are many options in this space. Two others to consider:
- https://airflow.apache.org/
- https://github.com/spotify/luigi
There are also many Kubernetes based options out there. For the specific use case you specified, you might even consider a plain old Makefile and incrond if you expect these all to run on a single host and be triggered by a new file showing up in a directory…
- "Você veio protestar para ter acesso ao código fonte da urnas. O que é o código fonte?" "Não sei" 🤡
- Cómo construir tu propia data platform. From zero to hero.
-
Is it impossible to contribute to open source as a data engineer?
You can try and contribute some new connectors/operators for workflow managers like Airflow or Airbyte
What are some alternatives?
DataEngineeringProject - Example end to end data engineering project.
Kedro - Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
pytest-recording - A pytest plugin that allows recording network interactions via VCR.py
dagster - An orchestration platform for the development, production, and observation of data assets.
telescope - A tool for tracking blogs in orbit around Seneca's open source involvement
n8n - Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services.
openverse-api - The Openverse API allows programmatic access to search for CC-licensed and public domain digital media.
luigi - Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Atari-Space-Invaders - An inspiration of the original Atari Space Invaders game built in pygame :space_invader: :video_game: [Moved to: https://github.com/Mayank0255/Space-Invaders]
Apache Spark - Apache Spark - A unified analytics engine for large-scale data processing
office-ui-fabric-react - Fluent UI web represents a collection of utilities, React components, and web components for building web applications.
Dask - Parallel computing with task scheduling