gpt-3
Our great sponsors
gpt-3 | dalle-2-preview | |
---|---|---|
39 | 61 | |
9,406 | 1,049 | |
- | 0.0% | |
3.5 | 1.8 | |
over 3 years ago | almost 2 years ago | |
- | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
gpt-3
-
Can ChatGPT improve my L2 grammar?
Are generative AI models useful for learning a language, and if so which languages? Over 90% of ChatGPT's training data was in English. The remaining 10% of data was split unevenly between 100+ languages. This suggests that the quality of the outputs will vary from language to language.
-
GPT4 Can’t Ace MIT
I have doubts it was extensively trained on German data. Who knows about GPT4, but GPT3 is ~92% of English and ~1.5% of German, which means it saw more "die, motherfucker, die" than on "die Mutter".
(https://github.com/openai/gpt-3/blob/master/dataset_statisti...)
- Necesito ayuda.
-
[R] PaLM 2 Technical Report
Catalan was 0.018 % of GPT-3's training corpus. https://github.com/openai/gpt-3/blob/master/dataset_statistics/languages_by_word_count.csv.
- I'm seriously concerned that if I lost ChatGPT-4 I would be handicapped
- The responses I got from bard after asking why 100 times… he was pissed 😂
-
BharatGPT: India's Own ChatGPT
>Certainly it is pleasing that they are not just doing Hindi, but some of these languages must be represented online by a very small corpus of text indeed. I wonder how effectively an LLM can be trained on such a small training set for any given language?
as long as it's not the main language it doesn't really matter. Besides English(92.6%), the biggest language by representation (word count) is taken up by french at 1.8%. Most of the languages GPT-3 knows are sitting at <0.2% representation.
https://github.com/openai/gpt-3/blob/master/dataset_statisti...
Competence in the main language will bleed into the rest.
- GPT-4 gets a B on Scott Aaronson's quantum computing final exam
-
[D] Dumb question: is GPT3 model open-sourced?
And from skimming their GH page, it seems it'd be costly to host as well
- ChatGPT and the Daily Question Thread, re-evaluated with GPT-4.
dalle-2-preview
-
Microsoft-backed OpenAI to let users customize ChatGPT | Reuters
We believe that many decisions about our defaults and hard bounds should be made collectively, and while practical implementation is a challenge, we aim to include as many perspectives as possible. As a starting point, we’ve sought external input on our technology in the form of red teaming. We also recently began soliciting public input on AI in education (one particularly important context in which our technology is being deployed).
- OpenAI AI not available for Algeria, gotta love Algeria
-
The argument against the use of datasets seems ultimately insincere and pointless
From this OpenAI document:
-
Dalle-2 is > 1,000x as dollar efficient as hiring a human illustrator.
It's also of note that you can't sell a game using this method, as Dalle-2's terms of service prevent use in commercial projects. It's hard to justify rate of return considering you can only ever give it away for free, and even in that case there are some uncertain legal elements regarding copyright and the images that are used to train the dataset.
-
It's pretty obvious where dalle-2 gets some of their training data from! Anyone else had the Getty Images watermark? Prompt was "man in a suit standing in a fountain with his hair on fire."
On their GitHub https://github.com/openai/dalle-2-preview/blob/main/system-card.md I can only see references to v1.
-
“Pinterest” for Dalle-2 images and prompts
"b) Exploration of the bolded part of OpenAI's comment "Each generated image includes a signature in the lower right corner, with the goal of indicating when DALL·E 2 helped generate a certain image." (source)." (source link: https://github.com/openai/dalle-2-preview/blob/main/system-c...)
I feel the DALL-E 2 watermark signature could be a seed or something.
- I’m an outsider to digital art and have a couple questions about A.I created art.
-
The AI Art Apocalypse
DALL-E's docs for example mention it can output whole copyrighted logos and characters[1] and understands it's possible to generate human faces that are bear the likeness of those in the training data. We've also seen people recently critique Stable Diffusion's output for attempting to recreate artists' signatures that came from the commercial trained data.
That said by a certain point the kinks will be ironed out and likely skirt around such issues by only incorporating/manipulating just enough to be considered fair use and creative transformation.
[1] "The model can generate known entities including trademarked logos and copyrighted characters." https://github.com/openai/dalle-2-preview/blob/main/system-c...
- Trabalhei no projeto Dall-e, me pergunte qualquer coisa (AMA)
-
Official Dalle server: Why “furry art” is a banned phrase
Some types of content were purposely excluded from the training dataset(s) (source).
What are some alternatives?
dalle-mini - DALL·E Mini - Generate images from a text prompt
DALL-E - PyTorch package for the discrete VAE used for DALL·E.
DALLE-mtf - Open-AI's DALL-E for large scale training in mesh-tensorflow.
latent-diffusion - High-Resolution Image Synthesis with Latent Diffusion Models
stylegan2-pytorch - Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement
DALLE2-pytorch - Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
v-diffusion-pytorch - v objective diffusion inference code for PyTorch.
disco-diffusion
tensorrtx - Implementation of popular deep learning networks with TensorRT network definition API
glide-text2im - GLIDE: a diffusion-based text-conditional image synthesis model