Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
GPTQ-for-LLaMa-API Alternatives
Similar projects and alternatives to GPTQ-for-LLaMa-API
-
text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a better GPTQ-for-LLaMa-API alternative or higher similarity.
GPTQ-for-LLaMa-API reviews and mentions
Posts with mentions or reviews of GPTQ-for-LLaMa-API.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-05-25.
- Alternative ways for running models locally and hosting APIs
-
Can someone explain why there isn't a good interface for the oobabooga api in langchain?
oobabooga has to support way too many models, so making the whole thing unnecessarily complicated. If you have some development experience, maybe you would build your own API in a few lines of Python code. It's not hard if you build from scratch and learn along the way. I have built some example repositories for hosting GPTQ-related models. You can have a look at them. https://github.com/mzbac/GPTQ-for-LLaMa-API https://github.com/mzbac/gptq-cuda-api
-
Looking to selfhost Llama on remote server, could use some help
I ran this https://github.com/mzbac/GPTQ-for-LLaMa-API for my home server. It should be easy enough to create a Dockerfile and make it hostable via Docker.
-
How do I load a gptq LLaMA model (Vicuna) in .safetensors format?
If you have some experience with Python, you can take a look at my repo. It only has the minimal logic of how to load a GPTQ model and serve it as an API. https://github.com/mzbac/GPTQ-for-LLaMa-API
-
Just create a repository to show how to serve GPTQ model via an API
Hopefully, it will make it easier for any developer who wants to build some integration with their app. https://github.com/mzbac/GPTQ-for-LLaMa-API
-
A note from our sponsor - InfluxDB
www.influxdata.com | 3 May 2024
Stats
Basic GPTQ-for-LLaMa-API repo stats
5
40
4.7
12 months ago
mzbac/GPTQ-for-LLaMa-API is an open source project licensed under Apache License 2.0 which is an OSI approved license.
The primary programming language of GPTQ-for-LLaMa-API is Python.
Popular Comparisons
Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com