llama-dfdx
web-llm
llama-dfdx | web-llm | |
---|---|---|
2 | 43 | |
94 | 9,822 | |
- | 9.6% | |
7.3 | 9.1 | |
10 months ago | 3 days ago | |
Rust | TypeScript | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
llama-dfdx
-
rustformers/llm: Run inference for Large Language Models on CPU, with Rust 🦀🚀🦙
Not a maintainer, but dfdx can run llama with CUDA!
-
A brief history of LLaMA models
There's a rust deep learning library called dfdx that just setup llama: https://github.com/coreylowman/llama-dfdx
web-llm
-
Show HN: I built a free in-browser Llama 3 chatbot powered by WebGPU
Looks like it uses this: https://github.com/mlc-ai/web-llm
- What stack would you recommend to build a LLM app in React without a backend?
-
When LLM doesn’t fit into memory, how to make it work?
So I was playing with MLC webllm locally. I got my mistral 7B model installed and quantised. Converted it using mlc lib to metal package for Apple chips. Now it takes only 3.5GB of memory
-
Show HN: Ollama for Linux – Run LLMs on Linux with GPU Acceleration
Maybe they're talking about https://github.com/mlc-ai/mlc-llm which is used for web-llm (https://github.com/mlc-ai/web-llm)? Seems to be using TVM.
- Local embeddings model for javascript
-
this makes deploying AI language models so much easier
Link to github for those who want to know about MLC straight from them. Web demo is cool but takes a long time to load first time. https://github.com/mlc-ai/web-llm
-
April 2023
web-llm: Bringing large-language models and chat to web browsers. (https://github.com/mlc-ai/web-llm)
- Running a small model on a phone?
-
Weekly Megathread - 14 May 2023
WebLLM - https://mlc.ai/web-llm/
- WebLLM - Bringing LLMs based chatbot to your web browser
What are some alternatives?
llm - An ecosystem of Rust libraries for working with large language models
chainlit - Build Conversational AI in minutes ⚡️
LLaMA_MPS - Run LLaMA inference on Apple Silicon GPUs.
mlc-llm - Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
dalai - The simplest way to run LLaMA on your local machine
gpt4all - gpt4all: run open-source LLMs anywhere
wonnx - A WebGPU-accelerated ONNX inference run-time written 100% in Rust, ready for native and the web
StableLM - StableLM: Stability AI Language Models
text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
FastChat - An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
peft - 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
duckdb-wasm - WebAssembly version of DuckDB