triton
Development repository for the Triton language and compiler (by ROCm)
bitsandbytes-rocm
8-bit CUDA functions for PyTorch, ported to HIP for use in AMD GPUs (by agrocylo)
triton | bitsandbytes-rocm | |
---|---|---|
2 | 2 | |
77 | 36 | |
- | - | |
9.6 | 3.6 | |
2 days ago | about 1 year ago | |
C++ | Python | |
MIT License | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
triton
Posts with mentions or reviews of triton.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-06-01.
-
How to use AMD GPU?
cd .. git clone https://github.com/ROCmSoftwarePlatform/triton.git -b release/pytorch_2.0 cd triton/python pip3 install cmake pip3 install -e .
-
Run Stable-Diffusion locally with a AMD GPU (7900XT) on Windows 11
Someone I know returned their 4080 (had a horrible coil whine he said) and yesterday his new 7900XTX came in and he did some testing. Now he can't use xformers and he did not have the sdp optimization on (iow no optimizations) using 5.5.0 beta on docker (that hurts a bit too) he was getting about 16it/s for 512sq and at 768sq he was getting 5.25ish it/s. I had him try with the SDP but optimization but docker is new to him and for some reason I saw no gains, or losses, when it was used (as if docker ignored it). His next test will be for training (which is why he got the card and I will as well). Another thing that hurts is no Triton but here is what he told me yesterday "regarding the 7900 XTX. Inference is fine, around 16 it/s. I couldn't get the training to work, mostly because of what I assume is a bug with the ROCm fork of Triton that's currently in development ( https://github.com/ROCmSoftwarePlatform/triton )."
bitsandbytes-rocm
Posts with mentions or reviews of bitsandbytes-rocm.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-06-01.
-
How to use AMD GPU?
pip uninstall bitsandbytes cd .. git clone https://github.com/agrocylo/bitsandbytes-rocm.git cd bitsandbytes-rocm make hip python setup.py install
-
Does anyone have a guide on how to get 4bit working on Arch Linux?
Step 11 use: bitsandbytes-rocm
What are some alternatives?
When comparing triton and bitsandbytes-rocm you can also consider the following projects:
stable-diffusion-webui-amdgpu - Stable Diffusion web UI
GPTQ-for-LLaMa - 4 bits quantization of LLaMA using GPTQ