Our great sponsors
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
https://github.com/oobabooga/text-g eneration-webui/issues/147#issuecom ment-1454798725
With flexgen I believe it should be possible to run on a typical high end system. They have run a 175B parameter model with it. See here: https://github.com/FMInference/FlexGen
See here for full details: https://github.com/oobabooga/text-generation-webui/issues/147