llama.onnx
llama2.openvino
llama.onnx | llama2.openvino | |
---|---|---|
2 | 3 | |
324 | 43 | |
- | - | |
7.3 | 7.9 | |
10 months ago | 2 months ago | |
Python | Python | |
GNU General Public License v3.0 only | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
llama.onnx
-
Qnap TS-264
You can find LLM models in the onnx format here: https://github.com/tpoisonooo/llama.onnx
-
Langchain question and answer without openai
You also need a LLM to do this. Please check this out to pick one up from the llama family. Other works like llama.onnx, alpaca-native and llama model on hugging face are also worth checking.
llama2.openvino
-
Optimum Intel OpenVino Performance
Code adapted from https://github.com/OpenVINO-dev-contest/llama2.openvino
- Qnap TS-264
- Intel arc gpu price drop - inexpensive llama.cpp opencl inference accelerator?
What are some alternatives?
llama.cpp - LLM inference in C/C++
openvino - OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Chinese-LLaMA-Alpaca - 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
stable_diffusion_arc - Stable Difussion inference on Intel Arc dGPUs
fastT5 - ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
openvino_notebooks - 📚 Jupyter notebook tutorials for OpenVINO™
motorhead - 🧠 Motorhead is a memory and information retrieval server for LLMs.
tiny_llm_finetuner - LLM finetuning on Intel XPUs - LoRA on intel discrete GPUs
AST-1 - Join the movement led by IZX.ai to create the world's best open-source LLM.
LocalAI - :robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.