visual-question-answering

Open-source projects categorized as visual-question-answering

Top 6 visual-question-answering Open-Source Projects

  • BLIP

    PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

  • Project mention: MetaCLIP – Meta AI Research | news.ycombinator.com | 2023-10-26

    I suggest trying BLIP for this. I've had really good results from that.

    https://github.com/salesforce/BLIP

    I built a tiny Python CLI wrapper for it to make it easier to try: https://github.com/simonw/blip-caption

  • OFA

    Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • flamingo-pytorch

    Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

  • UPop

    [ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.

  • Project mention: Show HN: Compress vision-language and unimodal AI models by structured pruning | news.ycombinator.com | 2023-07-31
  • WSDMCup2023

    Toloka Visual Question Answering Challenge at WSDM Cup 2023

  • vqa-plugin

    Perform visual question answering on your images

  • Project mention: Plugin for Building and Managing Plugins! | dev.to | 2024-02-09

    Week 2: ❓Visual Question Answering

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

visual-question-answering related posts

  • Is there a website where you can upload a photo and get the description in a paragraph?

    1 project | /r/AskTechnology | 3 Jul 2023
  • Stable Diffusion v2-1-unCLIP model released

    2 projects | /r/StableDiffusion | 26 Mar 2023
  • GPT-4 shows emergent Theory of Mind on par with an adult. It scored in the 85+ percentile for a lot of major college exams. It can also do taxes and create functional websites from a simple drawing

    1 project | /r/artificial | 15 Mar 2023
  • meme

    3 projects | /r/StableDiffusion | 27 Nov 2022
  • Object Recognition for Photo Metadata

    1 project | news.ycombinator.com | 15 Oct 2022
  • Stable-diffusion in Nix

    8 projects | /r/NixOS | 11 Sep 2022
  • I have a problem with the "interrogate" function of Automatic1111's fork. Can someone help me?

    1 project | /r/StableDiffusion | 12 Sep 2022
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 16 May 2024
    Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source visual-question-answering projects? This list will help you:

Project Stars
1 BLIP 4,302
2 OFA 2,337
3 flamingo-pytorch 1,134
4 UPop 82
5 WSDMCup2023 29
6 vqa-plugin 13

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com