transformers vs Ray

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. (by huggingface)

Source Code

huggingface.co

Suggest alternative

Edit details

Ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads. (by ray-project)

Source Code

ray.io

Docs

Suggest alternative

Edit details

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

transformers		Ray
	Project
174	Mentions	42
124,557	Stars	30,988
2.7%	Growth	3.1%
10.0	Activity	10.0
5 days ago	Latest Commit	4 days ago
Python	Language	Python
Apache License 2.0	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

transformers

Posts with mentions or reviews of transformers. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-21.

Lossless Acceleration of LLM via Adaptive N-Gram Parallel Decoding
3 projects | news.ycombinator.com | 21 Apr 2024

The HuggingFace transformers library already has support for a similar method called prompt lookup decoding that uses the existing context to generate an ngram model: https://github.com/huggingface/transformers/issues/27722
I don't think it would be that hard to switch it out for a pretrained ngram model.
AI enthusiasm #6 - Finetune any LLM you want💡
2 projects | dev.to | 16 Apr 2024

Most of this tutorial is based on Hugging Face course about Transformers and on Niels Rogge's Transformers tutorials: make sure to check their work and give them a star on GitHub, if you please ❤️
Schedule-Free Learning – A New Way to Train
3 projects | news.ycombinator.com | 6 Apr 2024

* Superconvergence + LR range finder + Fast AI's Ranger21 optimizer was the goto optimizer for CNNs, and worked fabulously well, but on transformers, the learning rate range finder sadi 1e-3 was the best, whilst 1e-5 was better. However, the 1 cycle learning rate stuck. https://github.com/huggingface/transformers/issues/16013
Gemma doesn't suck anymore – 8 bug fixes
3 projects | news.ycombinator.com | 11 Mar 2024

Thanks! :) I'm pushing them into transformers, pytorch-gemma and collabing with the Gemma team to resolve all the issues :)
The RoPE fix should already be in transformers 4.38.2: https://github.com/huggingface/transformers/pull/29285
My main PR for transformers which fixes most of the issues (some still left): https://github.com/huggingface/transformers/pull/29402
HuggingFace Transformers: Qwen2
1 project | news.ycombinator.com | 11 Jan 2024
HuggingFace Transformers Release v4.36: Mixtral, Llava/BakLlava, SeamlessM4T v2
1 project | news.ycombinator.com | 13 Dec 2023
HuggingFace: Support for the Mixtral Moe
1 project | news.ycombinator.com | 11 Dec 2023
Paris-Based Startup and OpenAI Competitor Mistral AI Valued at $2B
4 projects | news.ycombinator.com | 10 Dec 2023

If you want to tinker with the architecture Hugging Face has a FOSS implementation in transformers: https://github.com/huggingface/transformers/blob/main/src/tr...
If you want to reproduce the training pipeline, you couldn't do that even if you wanted to because you don't have access to thousands of A100s.
Fail to reproduce the same evaluation metrics score during inference.
1 project | /r/LocalLLaMA | 10 Dec 2023

I am aware that using mixed precision reduces the stability of weight and there will be little consistency but don't expect it to be this much. I have attached the graph of evaluation metrics. If someone can give me some insight into this issue, that would be great.
[D] What is a good way to maintain code readability and code quality while scaling up complexity in libraries like Hugging Face?
3 projects | /r/MachineLearning | 10 Dec 2023

In transformers, they tried really hard to have a single function or method to deal with both self and cross attention mechanisms, masking, positional and relative encodings, interpolation etc. While it allows a user to use the same function/method for any model, it has led to severe parameter bloat. Just compare the original implementation of llama by FAIR with the implementation by HF to get an idea.

Ray

Posts with mentions or reviews of Ray. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-01-05.

Open Source Advent Fun Wraps Up!
10 projects | dev.to | 5 Jan 2024

22. Ray | Github | tutorial
Fine-Tuning Llama-2: A Comprehensive Case Study for Tailoring Custom Models
1 project | news.ycombinator.com | 11 Aug 2023

Training times for GSM8k are mentioned here: https://github.com/ray-project/ray/tree/master/doc/source/te...
Ray – an open source project for scaling AI workloads
1 project | news.ycombinator.com | 11 Aug 2023
Methods to keep agents inside grid world.
1 project | /r/reinforcementlearning | 30 Jun 2023

Here's a reference from RLlib that points to docs and an example, and here's one from one of my projects that includes all my own implementations
TransformerXL + PPO Baseline + MemoryGym
10 projects | /r/reinforcementlearning | 15 Feb 2023

RLlib
Is dynamic action masking possible in Rllib?
1 project | /r/reinforcementlearning | 23 Jan 2023
AWS re:Invent 2022 Recap | Data & Analytics services
1 project | dev.to | 3 Jan 2023

⦿ AWS Glue Data Quality - Automatic data quality rule recommendations based on your data AWS Glue for Ray - Data integration with Ray (ray.io), a popular new open-source compute framework that helps you scale Python workloads
Think about it for a second
1 project | /r/mathmemes | 19 Oct 2022

https://ray.io (just dropping the link)
Elixir Livebook now as a desktop app
12 projects | news.ycombinator.com | 2 Aug 2022

I've wondered whether it's easier to add data analyst stuff to Elixir that Python seems to have, or add features to Python that Erlang (and by extension Elixir) provides out of the box.
By what I can see, if you want multiprocessing on Python in an easier way (let's say running async), you have to use something like ray core[0], then if you want multiple machines you need redis(?). Elixir/Erlang supports this out of the box.
Explorer[1] is an interesting approach, where it uses Rust via Rustler (Elixir library to call Rust code) and uses Polars as its dataframe library. I think Rustler needs to be reworked for this usecase, as it can be slow to return data. I made initial improvements which drastically improves encoding (https://github.com/elixir-nx/explorer/pull/282 and https://github.com/elixir-nx/explorer/pull/286, tldr 20+ seconds down to 3).
[0] https://github.com/ray-project/ray
Learn various techniques to reduce data processing time by using multiprocessing, joblib, and tqdm concurrent
1 project | /r/Python | 13 Jul 2022

Adding these for anyone who had a similar question about Ray vs dask 1, 2, 3

What are some alternatives?

When comparing transformers and Ray you can also consider the following projects:

fairseq - Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

optuna - A hyperparameter optimization framework

sentence-transformers - Multilingual Sentence & Image Embeddings with BERT

stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

llama - Inference code for Llama models

Faust - Python Stream Processing

transformer-pytorch - Transformer: PyTorch Implementation of "Attention Is All You Need"

gevent - Coroutine-based concurrency library for Python

text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

stable-baselines - A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

huggingface_hub - The official Python client for the Huggingface Hub.

SCOOP (Scalable COncurrent Operations in Python) - SCOOP (Scalable COncurrent Operations in Python)

transformers vs fairseq Ray vs optuna transformers vs sentence-transformers Ray vs stable-baselines3 transformers vs llama Ray vs Faust transformers vs transformer-pytorch Ray vs gevent transformers vs text-generation-webui Ray vs stable-baselines transformers vs huggingface_hub Ray vs SCOOP (Scalable COncurrent Operations in Python)

Compare transformers vs Ray and see what are their differences.

transformers

Ray

transformers

Ray

What are some alternatives?