AutoGPT vs MiniGPT-4

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters. (by Significant-Gravitas)

AI gpt-4 openai Python Artificial intelligence autonomous-agents

Source Code

agpt.co

Suggest alternative

Edit details

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/) (by Vision-CAIR)

Suggest topics

Source Code

minigpt-4.github.io

Suggest alternative

Edit details

SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App

With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

surveyjs.io

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

AutoGPT		MiniGPT-4
	Project
180	Mentions	37
161,405	Stars	24,899
0.7%	Growth	0.8%
9.9	Activity	9.1
4 days ago	Latest Commit	9 days ago
JavaScript	Language	Python
MIT License	License	BSD 3-clause "New" or "Revised" License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

AutoGPT

Posts with mentions or reviews of AutoGPT. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-06.

Accessible AI for Everyone
1 project | news.ycombinator.com | 8 Jan 2024
AGI has, in some sense, been achieved: Tell me why I am wrong
2 projects | /r/singularity | 6 Dec 2023

Define agency. Does AutoGPT or BabyAGI fit the definition?
The Emergence of Autonomous Agents
2 projects | dev.to | 24 Nov 2023

This leap is evident in projects like BabyAGI and AutoGPT, showcasing how such agents can prioritize and execute tasks based on a pre-defined objective and the results of previous actions, such as sales prospecting or ordering pizza.
An experimental open-source attempt to make GPT-4 autonomous
1 project | news.ycombinator.com | 29 Oct 2023
[Long read] Deep dive into AutoGPT: A comprehensive and in-depth step-by-step guide to how it works
1 project | dev.to | 24 Oct 2023

A system and a user message are constructed from the task given by the user in code and passed to the LLM as input.
1000 Member Celebration and FAQ
1 project | /r/AI_Agents | 22 Oct 2023

A: How much do you know? If you can easily read code (in this example Python, but this will still benefit anyone who can read code), you should check out Auto-GPT. If you are looking to explore different options, check out this doc on AI Agents.
Agents: An Open-source Framework for Autonomous Language Agents - AIWaves Inc 2023
2 projects | /r/LocalLLaMA | 17 Sep 2023

Also I think most agents I have seen have implemented some form of long-short term memory. Why does it say autogpt doesnt support it? https://github.com/Significant-Gravitas/Auto-GPT/tree/master/autogpts/autogpt/autogpt/memory
MetaGPT: The Next Evolution or Just More Hype?
2 projects | /r/ChatGPTPro | 5 Sep 2023

In my newest experiment, I try out MetaGPT, which is supposed to be better than AutoGPT according to MetaGPT's paper.
List of Awesome AI Agents like AutoGPT and BabyAGI / Many open-source Agents with code included!
6 projects | /r/singularity | 12 Aug 2023

In my opinion the most interesting Agents: Auto-GPT Github: https://github.com/Significant-Gravitas/Auto-GPT BabyAGI Github: https://github.com/yoheinakajima/babyagi Voyager Github: https://github.com/MineDojo/Voyager / Paper: https://arxiv.org/abs/2305.16291 I would also add: ChemCrow: Augmenting large-language models with chemistry tools Github: https://github.com/ur-whitelab/chemcrow-public/ Paper: https://arxiv.org/abs/2304.05376
We've released Auto-GPT v0.4.5!
1 project | /r/AutoGPT | 22 Jul 2023

Check out the new Re-Arch README and ARCHITECTURE_NOTES.

MiniGPT-4

Posts with mentions or reviews of MiniGPT-4. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-07-19.

"Building Machines That Learn and Think Like People", 7 Years Later
1 project | news.ycombinator.com | 15 Oct 2023

I just think the tech has been out for so long it's not as big of a deal. Mini-Gpt4 has been out for 6 months! Of course the descriptions aren't exactly gpt-4 grade, but with mistral 7b being used as the language model instead of llama 7b, the reasoning ability will improve noticeably.
[1] https://github.com/Vision-CAIR/MiniGPT-4
Minigpt4 Inference on CPU
2 projects | news.ycombinator.com | 19 Jul 2023
Multimodal LLM for infographics images
1 project | /r/LocalLLaMA | 10 Jul 2023

Isn't there only two open multimodal LLMs, LLaVA and mini-gpt4?
Ai trained on photos
3 projects | /r/LocalLLaMA | 12 Jun 2023

For LLM visual instruction, you can use LLaVA, LaVIN, or MiniGPT-4.
CLIP and DeepDanbooru Alternatives For Prompt Generation [Relevant Self-Promotion]
7 projects | /r/StableDiffusion | 4 Jun 2023
Looking for a pre trained food recognition model
4 projects | /r/LocalLLaMA | 30 May 2023

Please read the rules before posting. If you want a model for visual instruction, use LLaVA, LaVIN, or MiniGPT-4.
Minigpt-4 (Vicuna 13B + images)
1 project | /r/LocalLLaMA | 29 May 2023
Upload a photo of your meal and get roasted by ChatGPT
1 project | /r/ChatGPT | 25 May 2023

So we use MiniGPT-4 for image parsing, and yep it does return a pretty detailed (albeit not always accurate) description of the photo. You can actually play around with it on Huggingface here.

1 project | /r/OpenAI | 24 May 2023

We use MiniGPT-4 first to interpret the image and then pass the results onto GPT-4. Hopefully, once GPT-4 makes its multi-modal functionality available, we can do it all in one request.
Give some love to multi modal models trained on censored llama based models
1 project | /r/LocalLLaMA | 15 May 2023

But I would like to bring up that there are some multi models(llava, miniGPT-4) that are built based on censored llama based models like vicuna. I tried several multi modal models like llava, minigpt4 and blip2. Llava has very good captioning and question answering abilities and it is also much faster than the others(basically real time), though it has some hallucination issue.

What are some alternatives?

When comparing AutoGPT and MiniGPT-4 you can also consider the following projects:

langchain - ⚡ Building applications with LLMs through composability ⚡ [Moved to: https://github.com/langchain-ai/langchain]

LLaVA - [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

gpt4all - gpt4all: run open-source LLMs anywhere

FastChat - An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

llama.cpp - LLM inference in C/C++

stable-diffusion-webui-wd14-tagger - Labeling extension for Automatic1111's Web UI

Auto-Vicuna

BooruDatasetTagManager

JARVIS - JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

bark - 🔊 Text-Prompted Generative Audio Model

SuperAGI - <⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.

mini-agi - MiniAGI is a simple general-purpose autonomous agent based on the OpenAI API.