llama-mps vs bitsandbytes-win-prebuilt

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

llama-mps		bitsandbytes-win-prebuilt
	Project
4	Mentions	4
83	Stars	76
-	Growth	-
3.8	Activity	10.0
9 months ago	Latest Commit	over 1 year ago
Python	Language
GNU General Public License v3.0 only	License	-

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

llama-mps

Posts with mentions or reviews of llama-mps. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-13.

llama.cpp now officially supports GPU acceleration.
8 projects | /r/LocalLLaMA | 13 May 2023

There are currently at least 3 ways to run llama on m1 with GPU acceleration. - mlc-llm (pre-built, only 1 model has been ported) - tinygrad (very memory efficient, not that easy to integrate into other projects) - llama-mps (original llama codebase + llama adapter support)
LLaMA-7B in Pure C++ with full Apple Silicon support
19 projects | news.ycombinator.com | 10 Mar 2023

There is also a gpu-acelerated fork of the original repo
https://github.com/remixer-dec/llama-mps
Llama-CPU: Fork of Facebooks LLaMa model to run on CPU
8 projects | news.ycombinator.com | 7 Mar 2023
[D] Tutorial: Run LLaMA on 8gb vram on windows (thanks to bitsandbytes 8bit quantization)
8 projects | /r/MachineLearning | 7 Mar 2023

I tried to port the llama-cpu version to a gpu-accelerated mps version for macs, it runs, but the outputs are not as good as expected and it often gives "-1" tokens. Any help and contributions on fixing it are welcome!

bitsandbytes-win-prebuilt

Posts with mentions or reviews of bitsandbytes-win-prebuilt. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-04-07.

bitsandbytes now for Windows (8-bit CUDA functions for PyTorch)
2 projects | /r/Oobabooga | 7 Apr 2023

So there used to be a compiled version from https://github.com/DeXtmL/bitsandbytes-win-prebuilt but now I see there is a new version (from last week) at https://github.com/acpopescu/bitsandbytes/releases which appears to maybe become the start of Windows support in the official repo?
[D] Tutorial: Run LLaMA on 8gb vram on windows (thanks to bitsandbytes 8bit quantization)
8 projects | /r/MachineLearning | 7 Mar 2023

put libbitsandbytes_cuda116.dll in C:\Users\xxx\miniconda3\envs\textgen\lib\site-packages\bitsandbytes\
Running Pygmalion 6b with 8GB of VRAM
2 projects | /r/PygmalionAI | 13 Feb 2023

Download these 2 dll files from here. then you move those files into "installer_files\env\lib\site-packages\bitsandbytes\" under your oobabooga root folder (where you've extracted the oneclick installer)
Has anyone gotten the models to load via 8-bit for windows?!?!?
1 project | /r/KoboldAI | 3 Jan 2023

What are some alternatives?

When comparing llama-mps and bitsandbytes-win-prebuilt you can also consider the following projects:

llama - Inference code for Llama models

text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

llama-cpu - Fork of Facebooks LLaMa model to run on CPU

awesome-ml - Curated list of useful LLM / Analytics / Datascience resources

bitsandbytes - 8-bit CUDA functions for PyTorch

llama - Inference code for LLaMA models

LLaMA_MPS - Run LLaMA inference on Apple Silicon GPUs.

one-click-installers - Simplified installers for oobabooga/text-generation-webui.

tinygrad - You like pytorch? You like micrograd? You love tinygrad! ❤️

llama-dl - High-speed download of LLaMA, Facebook's 65B parameter GPT model [UnavailableForLegalReasons - Repository access blocked]

llama-mps vs llama bitsandbytes-win-prebuilt vs text-generation-webui llama-mps vs text-generation-webui bitsandbytes-win-prebuilt vs llama-cpu llama-mps vs awesome-ml bitsandbytes-win-prebuilt vs bitsandbytes llama-mps vs llama bitsandbytes-win-prebuilt vs awesome-ml llama-mps vs LLaMA_MPS bitsandbytes-win-prebuilt vs one-click-installers llama-mps vs tinygrad llama-mps vs llama-dl

Compare llama-mps vs bitsandbytes-win-prebuilt and see what are their differences.

llama-mps

bitsandbytes-win-prebuilt

llama-mps

bitsandbytes-win-prebuilt

What are some alternatives?