StableLM
web-llm
StableLM | web-llm | |
---|---|---|
43 | 43 | |
15,853 | 9,822 | |
0.2% | 9.6% | |
5.0 | 9.1 | |
about 1 month ago | 4 days ago | |
Jupyter Notebook | TypeScript | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
StableLM
-
The Era of 1-bit LLMs: ternary parameters for cost-effective computing
https://github.com/Stability-AI/StableLM?tab=readme-ov-file#...
-
Stable LM 3B: Bringing Sustainable, High-Performance LMs to Smart Devices
https://mistral.ai/news/announcing-mistral-7b/
looking at the 3b results (here https://github.com/Stability-AI/StableLM#stablelm-alpha-v2 ?), it looks like Mistral (which outperforms Llama-2 13b) is far more powerful
-
FreeWilly 1 and 2, two new open-access LLMs
Does this mean Stability gave up on StableLM?
I notice that the repo hasn’t been updated since April, and a question asking for an update has been ignored for at least a month: https://github.com/Stability-AI/StableLM/issues/83
-
In five years, there will be no programmers left, believes Stability AI CEO
I'm not "ignoring" StableLM, if anything it's the impetus for my post. The alpha models were so bad and unusable that it seems they may have simply abandoned the project. It's clear they basically didn't know what they were doing, which is silly for a company of their size and specialization.
-
Losing the plot
1) StableLM released a checkpoint at 800B for their 3B and 7B at 800B tokens with 4096 context size, but perform very poorly on different benchmarks and finetuning is discouraged with such a weak base model
-
UAE's Technology Innovation Institute Launches Open-Source "Falcon 40B" Large Language Model for Research & Commercial Utilization
It is the best open-source model currently available. Falcon-40B outperforms LLaMA, StableLM, RedPajama, MPT, etc. See the OpenLLM Leaderboard.
- Consulta API GPT
- Google "We Have No Moat, And Neither Does OpenAI"
-
New to StableLM--is it possible to use this locally to fine-tune on a small subset of documents yet?
Someone shared this link on another recent post
-
[N] Stability AI releases StableVicuna: the world's first open source chatbot trained via RLHF
Github: https://github.com/Stability-AI/StableLM
web-llm
-
Show HN: I built a free in-browser Llama 3 chatbot powered by WebGPU
Looks like it uses this: https://github.com/mlc-ai/web-llm
- What stack would you recommend to build a LLM app in React without a backend?
-
When LLM doesn’t fit into memory, how to make it work?
So I was playing with MLC webllm locally. I got my mistral 7B model installed and quantised. Converted it using mlc lib to metal package for Apple chips. Now it takes only 3.5GB of memory
-
Show HN: Ollama for Linux – Run LLMs on Linux with GPU Acceleration
Maybe they're talking about https://github.com/mlc-ai/mlc-llm which is used for web-llm (https://github.com/mlc-ai/web-llm)? Seems to be using TVM.
- Local embeddings model for javascript
-
this makes deploying AI language models so much easier
Link to github for those who want to know about MLC straight from them. Web demo is cool but takes a long time to load first time. https://github.com/mlc-ai/web-llm
-
April 2023
web-llm: Bringing large-language models and chat to web browsers. (https://github.com/mlc-ai/web-llm)
- Running a small model on a phone?
-
Weekly Megathread - 14 May 2023
WebLLM - https://mlc.ai/web-llm/
- WebLLM - Bringing LLMs based chatbot to your web browser
What are some alternatives?
text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
chainlit - Build Conversational AI in minutes ⚡️
lm-evaluation-harness - A framework for few-shot evaluation of language models.
mlc-llm - Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
llama.cpp - LLM inference in C/C++
gpt4all - gpt4all: run open-source LLMs anywhere
ggml - Tensor library for machine learning
FastChat - An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Open-Assistant - OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
duckdb-wasm - WebAssembly version of DuckDB
alpaca_lora_4bit
triton - Development repository for the Triton language and compiler