Dataplane
amplify
Dataplane | amplify | |
---|---|---|
1 | 3 | |
184 | 10 | |
1.1% | - | |
8.3 | 7.5 | |
4 months ago | 9 months ago | |
Go | Go | |
Business Source License 1.1 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Dataplane
-
Airflow VS dataplane - a user suggested alternative
2 projects | 3 May 2022
Dataplane is an Airflow inspired data platform to automate, schedule and design data pipelines and workflows written in Golang.
amplify
-
Jupyter Lab Extension to run your GPU-heavy stuff (for free for now) on somebody's else server without blocking yours
When using Jupyter Lab and running GPU-heavy notebooks are you annoyed that your computer is not usable for anything else? I made an extension which allows you to run complex AI inference, training,... remotely on decentralized servers [see bacalhau.org]. This allows you to work on multiple GPU-heavy notebooks in parallel. For now Bacalhau is free, so this is a really cool way to run GPU stuff.
- Introducing Bacalhau Amplify: a tool/service that aims to automatically enrich, enhance, and explain your data
-
[P] CleanVision: Audit your Image Data for better Computer Vision
Nice! Thanks for this. Will try this in a project I'm working on: https://github.com/bacalhau-project/amplify/issues/26
What are some alternatives?
Airflow - Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
rtdl - rtdl makes it easy to build and maintain a real-time data lake
dagu - Yet another cron alternative with a Web UI, but with much more capabilities. It aims to solve greater problems.
cleanvision - Automatically find issues in image datasets and practice data-centric computer vision.
transfer - Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift) in real-time.
cleanvision-examples - Notebooks demonstrating example applications of the cleanvision library
data-engineering-wiki - The best place to learn data engineering. Built and maintained by the data engineering community.
deailab - Decentralised AI Jupyter Lab Extension
JDR - Job Dependency Runner
memphis - Memphis.dev is a highly scalable and effortless data streaming platform
dagster - An orchestration platform for the development, production, and observation of data assets.
starthinker - Reference framework for building data workflows provided by Google. Accelerates authentication, logging, scheduling, and deployment of solutions using GCP. To borrow a tagline.. "The framework for professionals with deadlines."