Our great sponsors
-
big-sleep
A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Technique was originally created by https://twitter.com/advadnoun
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Yeah unfortunately OpenAI has only released the weaker resnets and vision transformers they trained.
Some brilliant folks (Ryan Murdock [@advadnoun], Phil Wang [@lucidrains]) have tried to replicate their results with projects like big-sleep [0] with decent results, but even with this improved VAE we're still a ways from DALL-E quality results.
If anyone would like to play with the model check out either the Google Colab [1] (if you wanna run it on Google's cloud) or my site [2] (if you want a simplified UI).
[0]: https://github.com/lucidrains/big-sleep/
[1]: https://colab.research.google.com/drive/1MEWKbm-driRNF8PrU7o...
[2]: https://dank.xyz
The repository linked is just a part of the entire model so it can't be used as is.
That said there is a completely implementation made by lucidrains[1] with some results, the only missing component now is the dataset.
[1]: https://github.com/lucidrains/DALLE-pytorch
Related posts
- Is creating a StableDiffusion-inspired model feasible for my Master's thesis?
- TEDx talk on how to prepare for a career in vfx with the rapid changes caused by AI / machine learning
- Any good ai art websites that work with pokemon?
- Thoughts on AI image generators from text
- Explore generative art with me