multimodal
DallEval
Our great sponsors
multimodal | DallEval | |
---|---|---|
1 | 1 | |
70 | 133 | |
- | - | |
0.0 | 3.6 | |
about 2 years ago | 5 months ago | |
Python | Jupyter Notebook | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
multimodal
-
[P] multimodal: a library for VQA / vision and language research
Hi everyone, I am currently building a library for vision & language research: https://github.com/cdancette/multimodal
DallEval
-
[N] [D] Openai, who runs DALLE-2 alleged threatened creator of DALLE-Mini
There are also other users of the DALL-E name: Sberbank's ruDALL-E or Kakao Brain's minDALL-E, or how about the benchmark DALL-Eval?
What are some alternatives?
math - The MATH Dataset (NeurIPS 2021)
DALL-E - PyTorch package for the discrete VAE used for DALL·E.
LAVIS - LAVIS - A One-stop Library for Language-Vision Intelligence
dalle-mini - DALL·E Mini - Generate images from a text prompt
label-studio - Label Studio is a multi-type data labeling and annotation tool with standardized output format
robo-vln - Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"
Awesome-Prompt-Engineering - This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
ru-dalle - Generate images from texts. In Russian
pytorch-metric-learning - The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
ALPRO - Align and Prompt: Video-and-Language Pre-training with Entity Prompts
conceptual-12m - Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.