visual-chatgpt
visual-chatgpt | minimal-llama | |
---|---|---|
50 | 4 | |
31,684 | 456 | |
- | - | |
8.9 | 8.5 | |
about 1 year ago | 7 months ago | |
Python | Python | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
visual-chatgpt
-
Subtler Flex
For prompting, the GitHub repo I refer to the most is actually one from Microsoft: https://github.com/Microsoft/Visual-ChatGPT
-
OpenDILab Awesome Paper Collection: RL with Human Feedback (1)
Found relevant code at https://github.com/microsoft/visual-chatgpt + all code implementations here
-
[R] Low-code LLM: Visual Programming over LLMs - Yuzhe Cai et al , Microsoft Research Asia 2023
Github: https://github.com/microsoft/visual-chatgpt/tree/main/LowCodeLLM will soon be available!
-
It's been a week and there's already front ends for auto GPT. Is anyone else having a hard time keeping up with the take off? Are we seeing what exponential looks like?
It's not so much the size of the LLMs, it's what people are able to do with them in novel ways, e.g. https://github.com/microsoft/visual-chatgpt/tree/main/TaskMatrix.AI
- Taskmatrix.ai
- Introducing JARVIS : the new Microsoft's autonomous AI powered by HuggingGPT and ChatGPT.
-
How to make ChatGPT run tasks
There is a python library langchain, which microsoft used to give it access to image editing tools. You can write your own tools for it very easily, I've been having fun with it. I gave it internet search tools and it worked great at using current events in the response.
-
An AI researcher who has been warning about the technology for over 20 years says we should 'shut it all down,' and issue an 'indefinite and worldwide' ban.
Something like this: https://github.com/microsoft/visual-chatgpt/tree/main/TaskMatrix.AI could enable a far simpler AI to start making copies of itself.
-
HuggingGPT: Solving AI Tasks with ChatGPT and Its Friends in HuggingFace
Reminds me of VisualChatGPT (https://github.com/microsoft/visual-chatgpt), which also uses a LLM to decide what vision models to run.
-
ChatGPT is Inform8: interactive fiction with dual narration and action planes (both playable), dialogues, inner thoughts and the ability to argue about narration to change the course of action.
The best Microsoft API or overall API? It really depends what your goal is. https://github.com/huggingface is great if you want to dive right in, but https://github.com/microsoft/visual-chatgpt might be good as well (not easy to get to work locally).
minimal-llama
- Show HN: Finetune LLaMA-7B on commodity GPUs using your own text
-
Visual ChatGPT
I can't edit my comment now, but it's 30B that needs 18GB of VRAM.
LLaMA-13B, GPT-3 175B level, only needs 10GB of VRAM with the GPTQ 4bit quantization.
>do you think there's anything left to trim? like weight pruning, or LoRA, or I dunno, some kind of Huffman coding scheme that lets you mix 4-bit, 2-bit and 1-bit quantizations?
Absolutely. The GPTQ paper claims negligible output quality loss with 3-bit quantization. The GPTQ-for-LLaMA repo supports 3-bit quantization and inference. So this extra 25% savings is already possible.
As of right GPTQ-for-LLaMA is using a VRAM hungry attention method. Flash attention will reduce the requirements for 7B to 4GB and possibly fit 30B with a 2048 context window into 16GB, all before stacking 3-bit.
Pruning is a possibility but I'm not aware of anyone working on it yet.
LoRa has already been implemented. See https://github.com/zphang/minimal-llama#peft-fine-tuning-wit...
What are some alternatives?
text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
FlexGen - Running large language models on a single GPU for throughput-oriented scenarios.
langchain - ⚡ Building applications with LLMs through composability ⚡ [Moved to: https://github.com/langchain-ai/langchain]
whisper.cpp - Port of OpenAI's Whisper model in C/C++
roomGPT - Upload a photo of your room to generate your dream room with AI.
simple-llm-finetuner - Simple UI for LLM Model Finetuning
JARVIS - JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
alpaca-lora - Instruct-tune LLaMA on consumer hardware
ControlNet - Let us control diffusion models!
GPTQ-for-LLaMa - 4 bits quantization of LLaMA using GPTQ
pybroker - Algorithmic Trading in Python with Machine Learning