DALLE2-pytorch
DALLE2-pytorch | dalle-2-preview | |
---|---|---|
65 | 61 | |
11,205 | 1,044 | |
- | 0.0% | |
4.3 | 1.8 | |
8 months ago | over 2 years ago | |
Python | ||
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
DALLE2-pytorch
-
One year ago I got access to closed beta DALL-E 2.
I was showing people Dalle2 last year and telling them how much of an impact an open source solution was going to have on, well, everything to do with art and design. (At the time Stable Diffusion had not released, not even the leak, and all hopes was on https://github.com/lucidrains/DALLE2-pytorch)
- [Machinelearning] [D] Quelqu'un travaille-t-il sur l'open-sourcing de Dall-E 2 ?
-
AMA (Emad here hello)
Stable diffusion is the model, MJ will use a variant and DALL-E is the old version (we have our own implementation from our distinguished fellow Lucidrains here: https://github.com/lucidrains/DALLE2-pytorch)
-
An impressionist painting of an floating raccoon god, 4k, digital painting, trending on artstation
Sadly I don't think so. From what I understand the architecture is fixed to 1024x1024 pictures.
- I asked AI to turn P&R characters into muppets..
-
Comparison of AI text-to-image generators
The code is open source, the model is not I believe. https://github.com/lucidrains/DALLE2-pytorch
- Protests erupt outside of DALL-E offices after pricing implementation, press photograph
-
$15 for 115 “generation increments” Very expensive Beta pricing announcement. Dissapointed
Phil Wang has been fairly prolific at creating open source implementations of these text to image models. For example, here is the dalle-2 repo https://github.com/lucidrains/DALLE2-pytorch
-
DALL·E Now Available in Beta
There's already an open-source implementation of DALL-E 2 (https://github.com/lucidrains/DALLE2-pytorch) and a pretrained model for it should be released within this year.
Also true for Google's Imagen, which should be even better than DALLE-2 (and faster) https://github.com/lucidrains/imagen-pytorch.
This is possible because the original research papers behind both DALLE-2 and Imagen were publicly released.
-
would love to know what portion of this prompt is not allowed
The paper describing the model is public and has been implemented here, but that's not the hard part. The model likely requires months of compute and dozens of gigabytes of VRAM to train and run, likely costing several hundred thousand dollars.
dalle-2-preview
-
Microsoft-backed OpenAI to let users customize ChatGPT | Reuters
We believe that many decisions about our defaults and hard bounds should be made collectively, and while practical implementation is a challenge, we aim to include as many perspectives as possible. As a starting point, we’ve sought external input on our technology in the form of red teaming. We also recently began soliciting public input on AI in education (one particularly important context in which our technology is being deployed).
- OpenAI AI not available for Algeria, gotta love Algeria
-
The argument against the use of datasets seems ultimately insincere and pointless
From this OpenAI document:
-
Dalle-2 is > 1,000x as dollar efficient as hiring a human illustrator.
It's also of note that you can't sell a game using this method, as Dalle-2's terms of service prevent use in commercial projects. It's hard to justify rate of return considering you can only ever give it away for free, and even in that case there are some uncertain legal elements regarding copyright and the images that are used to train the dataset.
-
It's pretty obvious where dalle-2 gets some of their training data from! Anyone else had the Getty Images watermark? Prompt was "man in a suit standing in a fountain with his hair on fire."
On their GitHub https://github.com/openai/dalle-2-preview/blob/main/system-card.md I can only see references to v1.
-
“Pinterest” for Dalle-2 images and prompts
"b) Exploration of the bolded part of OpenAI's comment "Each generated image includes a signature in the lower right corner, with the goal of indicating when DALL·E 2 helped generate a certain image." (source)." (source link: https://github.com/openai/dalle-2-preview/blob/main/system-c...)
I feel the DALL-E 2 watermark signature could be a seed or something.
- I’m an outsider to digital art and have a couple questions about A.I created art.
-
The AI Art Apocalypse
DALL-E's docs for example mention it can output whole copyrighted logos and characters[1] and understands it's possible to generate human faces that are bear the likeness of those in the training data. We've also seen people recently critique Stable Diffusion's output for attempting to recreate artists' signatures that came from the commercial trained data.
That said by a certain point the kinks will be ironed out and likely skirt around such issues by only incorporating/manipulating just enough to be considered fair use and creative transformation.
[1] "The model can generate known entities including trademarked logos and copyrighted characters." https://github.com/openai/dalle-2-preview/blob/main/system-c...
- Trabalhei no projeto Dall-e, me pergunte qualquer coisa (AMA)
-
Official Dalle server: Why “furry art” is a banned phrase
Some types of content were purposely excluded from the training dataset(s) (source).
What are some alternatives?
dalle-mini - DALL·E Mini - Generate images from a text prompt
disco-diffusion
DALL-E - PyTorch package for the discrete VAE used for DALL·E.
DALLE-pytorch - Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
latent-diffusion - High-Resolution Image Synthesis with Latent Diffusion Models
glide-text2im - GLIDE: a diffusion-based text-conditional image synthesis model
imagen-pytorch - Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
CLIP - CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image