CogView VS DALLE-mtf

Compare CogView vs DALLE-mtf and see what are their differences.

CogView

Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers". (by THUDM)
Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
CogView DALLE-mtf
16 41
1,593 435
1.8% 0.0%
4.2 0.0
7 months ago about 2 years ago
Python Python
Apache License 2.0 MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

CogView

Posts with mentions or reviews of CogView. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-06-18.

DALLE-mtf

Posts with mentions or reviews of DALLE-mtf. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-19.

What are some alternatives?

When comparing CogView and DALLE-mtf you can also consider the following projects:

SwinIR - SwinIR: Image Restoration Using Swin Transformer (official repository)

VQGAN-CLIP - Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

CogView2 - official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"

CLIP-Guided-Diffusion - Just playing with getting CLIP Guided Diffusion running locally, rather than having to use colab.

storyteller - Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech

dalle-mini - DALL·E Mini - Generate images from a text prompt

DialogRPT - EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"

big-sleep - A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Technique was originally created by https://twitter.com/advadnoun

gpt-3 - GPT-3: Language Models are Few-Shot Learners

DALLE-pytorch - Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

dalle-2-preview

MultiModalStory-demo - FairyTailor: Multimodal Generative Framework for Storytelling