New text-to-image network from Google beats DALL-E

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • imagen-pytorch

    Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

  • It seems like lucidrains is currently working on an implementation [1] of it.

    I would love it.

    [1] https://github.com/lucidrains/imagen-pytorch

  • latent-diffusion

    High-Resolution Image Synthesis with Latent Diffusion Models

  • The latent-diffusion[1] one I've been playing with is not terrible at drawing legible text but generally awful at actually drawing the text you want (cf. [2]) (or drawing text when you don't want any.)

    [1] https://github.com/CompVis/latent-diffusion.git

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • DALLE2-pytorch

    Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

  • There are people working on reproducing the models, see here: https://github.com/lucidrains/DALLE2-pytorch

    It's often not worth it to decentralize the computation of the trained model though but it's not hard to get donated cycles and groups are working on it. Don't fret because Google isn't releasing the API/code. They released the paper and that's all you need.

  • DALLE-pytorch

    Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

  • majesty-diffusion

    Majesty Diffusion by @Dango233(@Dango233max) and @apolinario (@multimodalart)

  • Check https://github.com/multimodalart/majesty-diffusion

    There is a Google Colab workbook that you can try and run for free :)

    This is the image-text pairs behind: https://laion.ai/laion-400-open-dataset/

  • text-to-text-transfer-transformer

    Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

  • T5 was open-sourced on release (up to 11B params): https://github.com/google-research/text-to-text-transfer-tra...

    It is also available via Hugging Face transformers.

    Unclear if 11B is the T5-XXL mentioned in the paper, however.

  • T5 was open-sourced on release (up to 11B params): https://github.com/google-research/text-to-text-transfer-tra...

    It is also available via Hugging Face transformers.

    Unclear if 11B is the T5-XXL mentioned in the paper, however.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • dalle-mini

    DALL·E Mini - Generate images from a text prompt

  • check out open source alternative dalle-mini: https://huggingface.co/spaces/dalle-mini/dalle-mini

  • tortoise-tts

    A multi-voice TTS system trained with an emphasis on quality

  • Some cutting-edge stuff is still being made by talented hackers using nothing but a rig of 8x 3090s: https://github.com/neonbjb/tortoise-tts

    Other funding models are possible as well, in the grand scheme of things the price for these models is small enough.

  • DeepCreamPy

    deeppomf's DeepCreamPy + some updates (by Deepshift)

  • 1. DeepCreamPy draws over hentai sensor bars if you direct it where the bar is: https://github.com/gguilt/DeepCreamPy

    2. hentAI automates the process: https://github.com/natethegreate/hent-AI

    3. [NSFW] Should look at this person on Twitter: https://twitter.com/nate_of_hent_ai

    4. [NSFW] PornHub released vintage porn videos upscaled to 4k with AI a while back. The called it the "Remastured Project": https://www.pornhub.com/art/remastured

    5. [NSFW] This project shows the limit of AI-wthout-big-tech-or-corporate-support projects. This project creates female genitalia that don't exist in the real world. Project is "This Vagina Does Not Exist": https://thisvaginadoesnotexist.com/about.html

  • hent-AI

    Automation of censor bar detection

  • 1. DeepCreamPy draws over hentai sensor bars if you direct it where the bar is: https://github.com/gguilt/DeepCreamPy

    2. hentAI automates the process: https://github.com/natethegreate/hent-AI

    3. [NSFW] Should look at this person on Twitter: https://twitter.com/nate_of_hent_ai

    4. [NSFW] PornHub released vintage porn videos upscaled to 4k with AI a while back. The called it the "Remastured Project": https://www.pornhub.com/art/remastured

    5. [NSFW] This project shows the limit of AI-wthout-big-tech-or-corporate-support projects. This project creates female genitalia that don't exist in the real world. Project is "This Vagina Does Not Exist": https://thisvaginadoesnotexist.com/about.html

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts