DALLE-mtf vs gpt-3

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

DALLE-mtf		gpt-3
	Project
41	Mentions	41
435	Stars	9,406
0.0%	Growth	-
0.0	Activity	3.5
about 2 years ago	Latest Commit	over 3 years ago
Python	Language
MIT License	License	-

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

DALLE-mtf

Posts with mentions or reviews of DALLE-mtf. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-19.

How Open is Generative AI? Part 2
8 projects | dev.to | 19 Dec 2023

This vision is in line with EleutherAI, a non-profit organization founded in July 2020 by a group of researchers. Driven by the perceived opacity and the challenge of reproducibility in AI, their goal was to create leading open-source language models.
The open source learning curve for AI researchers
1 project | news.ycombinator.com | 20 Jul 2023
EleutherAI: Empowering Open-Source Artificial Intelligence Research
1 project | news.ycombinator.com | 11 Jul 2023
Seeking advice on fine-tuning Pythia for semantic search in a non-English language
1 project | /r/learnmachinelearning | 23 May 2023

My current idea is to utilize the EleutherAI pythia (Databricks Dolly). I would like to know whether translating the Dolly-15k dataset into the desired language using state-of-the-art translation techniques like DeepL would be a viable approach to fine-tune the Pythia base model. I want to use this model for semantic search, so perfection is not a necessity.
Does anyone want to collaborate to make anti-capitalist AI?
1 project | /r/antiwork | 17 May 2023

There are open source AI efforts, like EleutherAI. Needless to say, they are lagging behind big players, but it's better than nothing.
ChatGPT is bonkers.
1 project | /r/Praise_AI_Overlords | 21 Apr 2023

The new GPT 3.5 isn't aware what are GPT-3.5 or davinci-002 (repeatable) and claimed that it was designed by EleutherAI and has only 6 bil parameters (wasn't been able to repeat but didn't really try).
My teacher has falsely accused me of using ChatGPT to use an assignment.
1 project | /r/ChatGPT | 18 Apr 2023

Hi, my name is Stella Biderman and I run EleutherAI, the one of the foremost non-profit research institutes in the world that trains and studies large language models. I have been involved with the majority of models to hold the title “largest open source GPT model in the world” and have dabbled in exploring using plagiarism detection tools to identify code written by GPT-J.
dolly-v2-12b
3 projects | /r/LocalLLM | 13 Apr 2023

dolly-v2-12bis a 12 billion parameter causal language model created by Databricks that is derived from EleutherAI’s Pythia-12b and fine-tuned on a ~15K record instruction corpus generated by Databricks employees and released under a permissive license (CC-BY-SA)
Futurism: "The Company Behind Stable Diffusion Appears to Be At Risk of Going Under"
7 projects | /r/StableDiffusion | 7 Apr 2023

It is true that Emad needs to find an appropriate business model. The good news is that the hype is still undergoing. I'm sure that Emad can grab another round of liquidity injection. He got plenty of resources. Remember he is also from the finance industry. He got https://www.eleuther.ai/ which can supply a secured, in-house custom LLM equivalent to bloombergGPT.
How can AI be used to protect against exploitative use of other AI?
1 project | /r/WorkReform | 1 Apr 2023

By promoting fully open-source AI, i.e. making datasets, models, methodology and codebases freely available and transparent. What OpenAI claimed to be aiming for, basically.

gpt-3

Posts with mentions or reviews of gpt-3. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-29.

GPT4.5 or GPT5 being tested on LMSYS?
3 projects | news.ycombinator.com | 29 Apr 2024

>I wasn't talking about "state of the art LLMs," I am aware that commercial offerings are much better trained in Spanish. This was a thought experiment based on comments from people testing GPT-3.5 with Swahili.
A thought experiment from other people comments on another language. So...No. Fabricating failure modes from their constructed ideas about how LLMs work seems to be a frustratingly common occurrence in these kinds of discussions.
>Frustratingly, just few months ago I read a paper describing how LLMs excessively rely on English-language representations of ideas, but now I can't find it.
Most LLMs are trained on English overwhelmingly. GPT-3 had a 92.6% English dataset. https://github.com/openai/gpt-3/blob/master/dataset_statisti...
That the models are as proficient as they are is evidence enough of knowledge transfer clearly happening. https://arxiv.org/abs/2108.13349. If you trained a model on the Catalan tokens GPT-3 was trained on alone, you'd just get a GPT-2 level gibberish model at best.
anyway. These are some interesting papers
How do languages influence each other? Studying cross-lingual data sharing during LLM fine-tuning - https://arxiv.org/pdf/2305.13286
Teaching Llama a New Language Through Cross-Lingual Knowledge Transfer - https://arxiv.org/abs/2404.04042
Multilingual LLMs are Better Cross-lingual In-context Learners with Alignment - https://arxiv.org/abs/2305.05940
It's not like there is perfect transfer but the idea that there's none at all seemed so ridiculous to me (and why i asked the first question). Models would be utterly useless in multilingual settings if that were really the case.
What are LLMs? An intro into AI, models, tokens, parameters, weights, quantization and more
4 projects | dev.to | 28 Apr 2024

Large models: Everything above 10B of parameters. This is where Llama 3, Llama 2, Mistral 8x22B, GPT 3, and most likely GPT 4 sit.
Can ChatGPT improve my L2 grammar?
1 project | /r/AIinLanguageEducation | 4 Dec 2023

Are generative AI models useful for learning a language, and if so which languages? Over 90% of ChatGPT's training data was in English. The remaining 10% of data was split unevenly between 100+ languages. This suggests that the quality of the outputs will vary from language to language.
GPT4 Can’t Ace MIT
1 project | news.ycombinator.com | 18 Jun 2023

I have doubts it was extensively trained on German data. Who knows about GPT4, but GPT3 is ~92% of English and ~1.5% of German, which means it saw more "die, motherfucker, die" than on "die Mutter".
(https://github.com/openai/gpt-3/blob/master/dataset_statisti...)
Necesito ayuda.
1 project | /r/devsarg | 28 May 2023
[R] PaLM 2 Technical Report
1 project | /r/MachineLearning | 10 May 2023

Catalan was 0.018 % of GPT-3's training corpus. https://github.com/openai/gpt-3/blob/master/dataset_statistics/languages_by_word_count.csv.
I'm seriously concerned that if I lost ChatGPT-4 I would be handicapped
1 project | /r/ChatGPT | 25 Apr 2023
The responses I got from bard after asking why 100 times… he was pissed 😂
1 project | /r/ChatGPT | 15 Apr 2023
BharatGPT: India's Own ChatGPT
1 project | news.ycombinator.com | 13 Apr 2023

>Certainly it is pleasing that they are not just doing Hindi, but some of these languages must be represented online by a very small corpus of text indeed. I wonder how effectively an LLM can be trained on such a small training set for any given language?
as long as it's not the main language it doesn't really matter. Besides English(92.6%), the biggest language by representation (word count) is taken up by french at 1.8%. Most of the languages GPT-3 knows are sitting at <0.2% representation.
https://github.com/openai/gpt-3/blob/master/dataset_statisti...
Competence in the main language will bleed into the rest.
GPT-4 gets a B on Scott Aaronson's quantum computing final exam
1 project | /r/Physics | 12 Apr 2023

What are some alternatives?

When comparing DALLE-mtf and gpt-3 you can also consider the following projects:

VQGAN-CLIP - Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

dalle-mini - DALL·E Mini - Generate images from a text prompt

CLIP-Guided-Diffusion - Just playing with getting CLIP Guided Diffusion running locally, rather than having to use colab.

DALL-E - PyTorch package for the discrete VAE used for DALL·E.

stylegan2-pytorch - Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement

big-sleep - A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Technique was originally created by https://twitter.com/advadnoun

v-diffusion-pytorch - v objective diffusion inference code for PyTorch.

DALLE-pytorch - Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

dalle-2-preview

tensorrtx - Implementation of popular deep learning networks with TensorRT network definition API