nimbo
Ray
nimbo | Ray | |
---|---|---|
5 | 42 | |
123 | 31,101 | |
- | 1.6% | |
8.8 | 10.0 | |
over 2 years ago | 6 days ago | |
Python | Python | |
GNU General Public License v3.0 only | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
nimbo
-
Show HN: SpotML – Managed ML Training on Cheap AWS/GCP Spot Instances
You should really mention / give attribution / emphasize more that this is a fork of https://spotty.cloud and you took a lot from https://github.com/nimbo-sh/nimbo as well.
-
Show HN: Nimbo – Run jobs (and notebooks) on AWS with a single command
Hey everyone,
I (Miguel) am an ML PhD from the University of Edinburgh and Juozas is a Software Engineer also from Edinburgh.
Together we developed Nimbo, a dead-simple CLI that wraps the AWS CLI, allowing you to run code on AWS as if you were running it locally. You can find the source code here (https://github.com/nimbo-sh/nimbo) and the docs here (https://docs.nimbo.sh).
We decided to build this because we were frustrated with how cumbersome using AWS was, and we just wanted to be able to run jobs on AWS as easily as we run them locally. At the same time, we wanted to make use of the cheap spot instances (on Nimbo, this is a single parameter). All in all, we didn't like the current user experience of working with AWS, and we believed it was possible to vastly improve it.
For this reason, we also provide many useful commands to make it faster and easier to work with AWS, such as launching notebooks on EC2, easily checking prices, logging onto an instance, or syncing data to/from S3 (you can see some useful commands at https://docs.nimbo.sh/useful-commands).
Unlike other similar services, we are solely client-side, meaning that the code runs on your EC2 instances and data is stored in your S3 buckets (we don't have a server; all the infrastructure orchestration happens in the Nimbo package). We are also open contribution, meaning that all the source code is publicly available on our GitHub, and we welcome community contribution.
We have tons of ideas for Nimbo, like adding docker support, and providing instances with preloaded datasets like ImageNet, so that you don't have to download and store it yourself - you simply spin the instance, and the dataset is available at /datasets. We are currently working on adding GCP support, so that you can use AWS or GCP with the same config file.
We are happy to receive any feedback and suggestions you have.
-
[P] Nimbo: Run jobs on AWS with a single command
My friend and I just launched Nimbo, a dead-simple CLI that wraps AWS CLI, allowing you to run code on AWS as if you were running it locally. GitHub: https://github.com/nimbo-sh/nimbo. Docs: https://docs.nimbo.sh.
Ray
-
Open Source Advent Fun Wraps Up!
22. Ray | Github | tutorial
-
Fine-Tuning Llama-2: A Comprehensive Case Study for Tailoring Custom Models
Training times for GSM8k are mentioned here: https://github.com/ray-project/ray/tree/master/doc/source/te...
- Ray – an open source project for scaling AI workloads
-
Methods to keep agents inside grid world.
Here's a reference from RLlib that points to docs and an example, and here's one from one of my projects that includes all my own implementations
-
TransformerXL + PPO Baseline + MemoryGym
RLlib
- Is dynamic action masking possible in Rllib?
-
AWS re:Invent 2022 Recap | Data & Analytics services
⦿ AWS Glue Data Quality - Automatic data quality rule recommendations based on your data AWS Glue for Ray - Data integration with Ray (ray.io), a popular new open-source compute framework that helps you scale Python workloads
-
Think about it for a second
https://ray.io (just dropping the link)
-
Elixir Livebook now as a desktop app
I've wondered whether it's easier to add data analyst stuff to Elixir that Python seems to have, or add features to Python that Erlang (and by extension Elixir) provides out of the box.
By what I can see, if you want multiprocessing on Python in an easier way (let's say running async), you have to use something like ray core[0], then if you want multiple machines you need redis(?). Elixir/Erlang supports this out of the box.
Explorer[1] is an interesting approach, where it uses Rust via Rustler (Elixir library to call Rust code) and uses Polars as its dataframe library. I think Rustler needs to be reworked for this usecase, as it can be slow to return data. I made initial improvements which drastically improves encoding (https://github.com/elixir-nx/explorer/pull/282 and https://github.com/elixir-nx/explorer/pull/286, tldr 20+ seconds down to 3).
[0] https://github.com/ray-project/ray
-
Learn various techniques to reduce data processing time by using multiprocessing, joblib, and tqdm concurrent
Adding these for anyone who had a similar question about Ray vs dask 1, 2, 3
What are some alternatives?
criu-image-streamer - Enables streaming of images to and from CRIU during checkpoint/restore with low overhead
optuna - A hyperparameter optimization framework
dbt-spark - dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks
stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
nimbo-examples
Faust - Python Stream Processing
gevent - Coroutine-based concurrency library for Python
stable-baselines - A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
SCOOP (Scalable COncurrent Operations in Python) - SCOOP (Scalable COncurrent Operations in Python)
Thespian Actor Library - Python Actor concurrency library
Dask - Parallel computing with task scheduling
django-celery - Old Celery integration project for Django