typhoon-orchestrator
jmespath.py
typhoon-orchestrator | jmespath.py | |
---|---|---|
14 | 30 | |
29 | 2,081 | |
- | 1.5% | |
0.0 | 0.0 | |
over 1 year ago | about 2 months ago | |
Python | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
typhoon-orchestrator
- After Airflow. Where next for DE?
- New OSS Orchestrator - Where should we go next?
-
Airflow's Problem
I have my own opinion on Airflow's pain points and created Typhoon Orchestrator (https://github.com/typhoon-data-org/typhoon-orchestrator) to solve them. It doesn't have many stars yet but I've used it to create some pipelines for medium sized companies in a few days, and they've been running for over a year without issues.
In particular I transpile to Airflow code (can also deploy to Lambda) because I think it's still the most robust and well supported "runtime", I just don't think the developer experience is that good.
-
Data Engineering for very small businesses. Any experiences?
Typhoon Orchestrator This is a framework that I designed to help fix some of the pain points of Airflow so that I could build test and deploy pipelines faster. You could skip this step but if you want more info check here.
-
CSV data library to database
I am also collaborating on an open source tool called Typhoon Orchestrator (repo). It aims to make composing airflow data pipelines simple and quite quick. Putting pipeline steps together like lego.
-
Recommendations for simple ETL (Postgres to Snowflake)
The project (https://github.com/typhoon-data-org/typhoon-orchestrator) doesn't have many stars yet but I have deployed it on a medium sized hotel chain for several data sources with a similar use case to yours and it's been working for over a year with no intervention. If you decide to pursue this option I'd be willing to provide provide some support free of charge (feel free to PM me).
-
Impress your friends! Make a serverless bot that sends daily jokes to a Telegram Group
Typhoon Orchestrator is a great way to deploy ETL workflow on AWS Lambda. In this tutorial we intend to show how easy to use and versatile it is by deploying code to Lambda that gets a random joke from https://jokeapi.dev once a day and sends it to your telegram group.
-
My Thirty Years of Dodging Repetitive Work with Automation Tools
I think there's space for an open source library that can help with what you described. We originally created https://github.com/typhoon-data-org/typhoon-orchestrator to orchestrate ETL workflows, which would be a superset of the use cases you described. Our next goal is to allow deployment to AWS lambda which can be a good compromise between getting locked in with SAAS and hosting your own infrastructure.
Also check out Zappa's scheduled tasks that have a similar goal and inspired our library.
- Airflow, you complete me! Compose YAML DAGs for Airflow with auto-complete with Typhoon (Open Source).
- Use Airflow? Composable elegant YAML DAGS that transpile to Airflow. Zero risk and no migration.
jmespath.py
-
Automating Nightly Local Database Refreshes from Azure Blob Storage with Docker
The Azure CLI lets us write queries to filter the results of the az storage blob list command. The queries are written in JMESPath, which is a query language for JSON. In this case, we are filtering the results to only include blobs that end with the .bacpac extension and then selecting the first one as ordered by the lastModified property. If there are no blobs found, the script exits with a failure code. If we find a blob, we download it to the local path specified by the localPath variable.
-
What's New in Python 3.12
For JSON there is the `jmespath` library which might help.
https://github.com/jmespath/jmespath.py
-
jq 1.7 Released
I love jq, but I also use JMESPath (especially with AWS CLI), yq (bundled with tomlq and xq as well), and dasel [2]. I also wish hclq [3] wasn't so dead!
[0]: https://jmespath.org/
-
Announcing serde-query 0.2.0
Probably writing the query side of things is a lot of the fun here, but there is actually a spec (and a complying Rust impl) you can hook into for this JQ-like querying: https://jmespath.org/ ( https://github.com/jmespath/jmespath.rs ).
- JMESPath
-
Spring Boot logging with Loki, Promtail, and Grafana (Loki stack)
Thanks to custom variables that use labels, we can create various filters for the dashboard. You can look up my configuration of variables and extend it with an analogy way for your own needs. At the top, I marked the filter with detected pods in selected namespace. In the lower part, you can see a preview of all labels that are associated with a single log line. Most labels are meta information that Promtail adds during scraping targets. This part of the Promtail configuration provides it. In this section, I also marked a few labels that not comes out-of-the box e.g. leavel , class , thread . We added these labels using the Promtail json stage. You need to know that Promtail processes scraped logs in a pipeline. A pipeline is comprised of a set of stages. json stage is a parsing stage that reads the log line as JSON and accepts JMESPath expressions to extract data.
- Who is JSON?
-
jmespath.py VS jertl - a user suggested alternative
2 projects | 31 Oct 2022
-
YAML value retrieval
The tester on the official website for JMESPath (what json_query is doing) has been useful to me: https://jmespath.org/
-
I've built a PathDict, a library that makes it easy to work with dicts!
Interesting. How does this compared to Jmespath? Not saying Jmespath is superior, just wondering whether you were aware of it.
What are some alternatives?
JokeAPI - REST API that serves uniformly and well formatted jokes in JSON, XML, YAML or plain text format that also offers a great variety of filtering methods
jq - Command-line JSON processor [Moved to: https://github.com/jqlang/jq]
Mage - 🧙 The modern replacement for Airflow. Mage is an open-source data pipeline tool for transforming and integrating data. https://github.com/mage-ai/mage-ai
jq - Command-line JSON processor
astro - Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow. [Moved to: https://github.com/astronomer/astro-sdk]
yq - yq is a portable command-line YAML, JSON, XML, CSV, TOML and properties processor
astro-sdk - Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
jfq - JSONata on the command line
pachyderm - Data-Centric Pipelines and Data Versioning
jello - CLI tool to filter JSON and JSON Lines data with Python syntax. (Similar to jq)
getting-started - This repository is a getting started guide to Singer.
yq - Command-line YAML, XML, TOML processor - jq wrapper for YAML/XML/TOML documents