covalent
kestra
covalent | kestra | |
---|---|---|
4 | 32 | |
692 | 6,428 | |
2.5% | 8.7% | |
8.6 | 9.9 | |
7 days ago | 4 days ago | |
Python | Java | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
covalent
-
Remote execution of code
Pretty interesting request, if SSH is not used, i would try using something like dask which uses tcp to connect and execute assuming your workers are in another machine.I also think something like covalent can be used to extend your own custom plugin in their ecosystem to connect how you want. We have a very custom private plugin written on top of covalent's to have a custom protocol to connect our central on-prem GPU machines to our local laptops that is rpc based, mostly for high performance as well as some mandate security from where the GPU machines are. Once done it is pretty much something like
-
Prefect alternatives meant for Slurm (HPC)
Does anyone here have any suggestions of alternatives tailored for Slurm on HPC? I know Covalent is one option, but I'm curious about others as well. Ideally the platform should be Pythonic, have a GUI, and be reasonably active/well-maintained.
- Show HN: Covalent – distributed computing for ML, HPC and Quantum (open source)
-
Your strategies for offloading computation
Came across this new tool exactly for this - https://github.com/AgnostiqHQ/covalent
kestra
-
A High-Performance, Java-Based Orchestration Platform
Kestra's communication is asynchronous and based on a queuing mechanism. It leverages the Micronaut framework and offers two runners: one that uses a database (JDBC) for both the message queue and resource storage, and another that uses Kafka as the message queue and Elasticsearch as the resource storage. The platform is fully extensible and plugin-based, providing a rich set of plugins for various workflow tasks, triggers, and data storage options. For those interested, the GitHub repository is available here: https://github.com/kestra-io/kestra
- Kestra is an open-source data orchestration platform for complex workflows
- YAML-based data orchestrator
- Kestra
-
Introduction to Kestra, the open source data orchestration and scheduling platform
For everyone wondering https://github.com/kestra-io/kestra/discussions/468
-
Snowflake data pipeline with Kestra
If you need any guidance with your Snowflake deployment, our experts at Kestra would love to hear from you. Let us know if you would like us to add more plugins to the list. Or start building your custom Kestra plugin today and send it our way. We always welcome contributions!
-
Airflow's Problem
But I totally agree that a large static dag is not appropriate in the actual data world with data mesh and domain responsibility.
[0] https://github.com/kestra-io/kestra
-
Ask HN: Open-source with Kafka as dependencies, is this a instant turn off?
- We have plans to add another option that will replace both dependencies with jdbc (https://github.com/kestra-io/kestra/pull/368), is theses dependencies more comfortable for you?
-
ELT vs ETL: Why not both?
With Kestra's innate flexibility, and many integrations, you are not locked into the choice of one ingestion method or the other. Complex workflows can be developed, whether in parallel or sequentially, to deliver both ELT and ETL processes. Simple descriptive yaml is used to connect plugins, and to create flows. Because workflows created in Kestra are represented visually, and issues can be seen in relation to individual tasks, there is no need to fear complexity. Trouble can be traced to its source in an instant, allowing you to try new things and come up with a new solution without fear. Give it a try, and let us know what you come up with!
-
Debezium Change Data Capture without Kafka Connect
Kestra is an orchestration and scheduling platform that is designed to simplify the building, running, scheduling, and monitoring of complex data pipelines. Data pipelines can be built in real-time, no matter how complex the workflow, and can connect to multiple resources as needed (including Debezium).
What are some alternatives?
SmartSim - SmartSim Infrastructure Library.
conductor - Conductor is a microservices orchestration engine.
cadence-python - Python framework for Cadence Workflow Service
zeebe - Distributed Workflow Engine for Microservices Orchestration
mlnotify - 🔔 No need to keep checking your training - just one import line and you'll know the second it's done.
kogito-runtimes - This repository is a fork of apache/incubator-kie-kogito-runtimes. Please use upstream repository for development.
dagster - An orchestration platform for the development, production, and observation of data assets.
debezium - Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
bytehub - ByteHub: making feature stores simple
akhq - Kafka GUI for Apache Kafka to manage topics, topics data, consumers group, schema registry, connect and more...
streetdensityai - This YoloV5 based model is fit to detect people and different types of land vehicles, and displaying their density on a fitted map, according to their coordinates and detected labels.
flyte - Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.