FlexGen
Open-Assistant
FlexGen | Open-Assistant | |
---|---|---|
39 | 329 | |
9,007 | 36,647 | |
0.8% | 0.3% | |
3.0 | 8.3 | |
15 days ago | 9 days ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
FlexGen
- Run 70B LLM Inference on a Single 4GB GPU with This New Technique
- Colorful Custom RTX 4060 Ti GPU Clocks Outed, 8 GB VRAM Confirmed
-
Local Alternatives of ChatGPT and Midjourney
LLaMA, Pythia, RWKV, Flan-T5 (self-hosted), FlexGen
- FlexGen: Running large language models on a single GPU
-
Show HN: Finetune LLaMA-7B on commodity GPUs using your own text
> With no real knowledge of LLM and only recently started to understand what LLM terms mean, such as 'model, inference, LLM model, intruction set, fine tuning' whatelse do you think is required to make a took like yours?
This was mee a few weeks ago. I got interested in all this when FlexGen (https://github.com/FMInference/FlexGen) was announced, which allowed to run inference using OPT model on consumer hardware. I'm an avid user of Stable Diffusion, and I wanted to see if I can have an SD equivalent of ChatGPT.
Not understanding the details of hyperparameters or terminology, I basically asked ChatGPT to explain to me what these things are:
Explain to someone who is a software engineer with limited knowledge of ML terms or linear algebra, what is "feed forward" and "self-attention" in the context of ML and large language models. Provide examples when possible.
- Could this new flexgen be used in place of GPTq? or is this different?
- OpenAI is expensive
Open-Assistant
-
Best open source AI chatbot alternative?
For open assistant, the code: https://github.com/LAION-AI/Open-Assistant/tree/main/inference
-
GPT-4 Turbo for free with no sign up, and most importantly no Bing
Is this being used to collect chat results for synthetic data and/or training like https://github.com/LAION-AI/Open-Assistant did? I believe they gave away GPT-4 api calls via a text interface and absorbed the cost to later build a dataset of chats.
-
OpenAI now sends email threats?!
https://open-assistant.io seems to have the same guardrails, as ChatGPT. Tried it on several prompts and it wouldn't comply.
- ChatGPT-Antworten nach Schulnoten bewerten
-
Chat GPT Alternatives?
Open-Assistant [https://open-assistant.io/]
-
What are the best AI tools you've ACTUALLY used?
Open Assistant by LAION AI on GitHub
-
Keep Artificial Intelligence Free, protect it from monopolies: please sign this petition
To add to this if you want something for free or at least close to free, contribute to OpenSource projects like https://open-assistant.io/
-
If I had to get someone from total zero to ChatGPT power user
Also, there are fairly useful alternatives like GPT4ALL and Open Assistant that you can run locally.
-
Compiling a Comprehensive List of Publicly Usable LLM Q&A Services - Need Your Input!
https://open-assistant.io - oasst-sft-6-llama-30b
- Proposal for a Crowd-Sourced AI Feedback System
What are some alternatives?
llama - Inference code for Llama models
KoboldAI-Client
text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
text-generation-inference - Large Language Model Text Generation Inference
llama.cpp - LLM inference in C/C++
whisper.cpp - Port of OpenAI's Whisper model in C/C++
DeepSpeed - DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
gpt4all - gpt4all: run open-source LLMs anywhere
audiolm-pytorch - Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
stanford_alpaca - Code and documentation to train Stanford's Alpaca models, and generate the data.