SaaSHub helps you find the best software and product alternatives Learn more →
mpt-30B-inference Alternatives
Similar projects and alternatives to mpt-30B-inference
-
exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
-
llm-rp
Discontinued ✨ Your Custom Offline Role Play with LLM and Stable Diffusion on Mac and Linux (for now) 🧙♂️
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
mpt-30B-inference reviews and mentions
- New open-source model with 8k context runs on CPU, outperforms GPT-3
- MPT 30B inference code using CPU
-
[D] Is there an efficient way to make inferences with open-source LLM?
4-bit. I've used this implementation: https://github.com/abacaj/mpt-30B-inference/tree/main
-
A note from our sponsor - SaaSHub
www.saashub.com | 22 May 2024
Stats
abacaj/mpt-30B-inference is an open source project licensed under MIT License which is an OSI approved license.
The primary programming language of mpt-30B-inference is Python.
Sponsored