SaaSHub helps you find the best software and product alternatives Learn more →
MM-REACT Alternatives
Similar projects and alternatives to MM-REACT
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
hiertext
The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and paragraph level annotations.
-
viper
Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning" (by cvlab-columbia)
MM-REACT reviews and mentions
-
OpenAI CEO Sam Altman: We are not and won't for some time [train GPT-5]
It's possible to take a text only model and ground it with images. examples are blip-2, fromage, prismer, palm-e. now assuming gpt-4 vision isn't just some variant of mm-react(ie what you're describing), that's what's happening here. https://github.com/microsoft/MM-REACT
images can be tokenized. so what happens usually is that extra parameters are added to a frozen model and those parameters are trained on an image embedding to text embedding task.
- MM-ReAct: Prompting ChatGPT for Multimodal Reasoning and Action
- MM-ReAct: Prompting ChatGPT for Multimodal Reasoning and Action (Microsoft Research)
-
Microsoft AI Proposes MM-REACT: A System Paradigm that Combines ChatGPT and Vision Experts for Advanced Multimodal Reasoning and ActionMicrosoft AI Researchers Propose
Quick Read: https://www.marktechpost.com/2023/03/24/microsoft-ai-proposes-mm-react-a-system-paradigm-that-combines-chatgpt-and-vision-experts-for-advanced-multimodal-reasoning-and-actionmicrosoft-ai-researchers-propose/ Paper: https://arxiv.org/abs/2303.11381 Project: https://multimodal-react.github.io/ Github: https://github.com/microsoft/MM-REACT
-
[D] I just realised: GPT-4 with image input can interpret any computer screen, any userinterface and any combination of them.
Now i'm wondering if they're just doing something like this -https://github.com/microsoft/MM-REACT
- MM-React: Multimodal Reasoning and Action with ChatGPT
-
[R] MM-ReAct: Prompting ChatGPT for Multimodal Reasoning and Action
Found relevant code at https://multimodal-react.github.io/ + all code implementations here
- [R] MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action
-
A note from our sponsor - SaaSHub
www.saashub.com | 6 May 2024
Stats
microsoft/MM-REACT is an open source project licensed under MIT License which is an OSI approved license.
The primary programming language of MM-REACT is Python.
Popular Comparisons
Sponsored