multimodal VS DallEval

Compare multimodal vs DallEval and see what are their differences.

multimodal

A collection of multimodal datasets, and visual features for VQA and captionning in pytorch. Just run "pip install multimodal" (by cdancette)

DallEval

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023) (by j-min)
Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
multimodal DallEval
1 1
70 133
- -
0.0 3.6
about 2 years ago 5 months ago
Python Jupyter Notebook
MIT License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

multimodal

Posts with mentions or reviews of multimodal. We have used some of these posts to build our list of alternatives and similar projects.

DallEval

Posts with mentions or reviews of DallEval. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-06-21.

What are some alternatives?

When comparing multimodal and DallEval you can also consider the following projects:

math - The MATH Dataset (NeurIPS 2021)

DALL-E - PyTorch package for the discrete VAE used for DALL·E.

LAVIS - LAVIS - A One-stop Library for Language-Vision Intelligence

dalle-mini - DALL·E Mini - Generate images from a text prompt

label-studio - Label Studio is a multi-type data labeling and annotation tool with standardized output format

robo-vln - Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

Awesome-Prompt-Engineering - This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc

ru-dalle - Generate images from texts. In Russian

pytorch-metric-learning - The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

ALPRO - Align and Prompt: Video-and-Language Pre-training with Entity Prompts

conceptual-12m - Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.