Paella
blended-latent-diffusion
Paella | blended-latent-diffusion | |
---|---|---|
6 | 1 | |
729 | 514 | |
- | - | |
4.7 | 4.5 | |
7 months ago | 5 months ago | |
Jupyter Notebook | Jupyter Notebook | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Paella
- Like Diffusion but Faster: The Paella Model for Fast Image Generation
-
Paella: CNN based Fast Text-Conditional Discrete Denoising on Vector-Quantized Latent Spaces
Code at Github: https://github.com/dome272/Paella
- Paella, a novel text-to-image model requiring less than 10 steps to sample high-fidelity images
- Paella: a blazing fast diffuser (even works on CPU only, 8GB system RAM)
-
How large is a machine learning algo?
The "code" to run the training or to generate images is actually very tiny. Paella, for instance is a few hundred lines of code, that's a complete working training and image generation code.
blended-latent-diffusion
-
Blended Latent Diffusion's code has been released.
The research paper https://arxiv.org/abs/2206.02779 has finally released their code https://github.com/omriav/blended-latent-diffusion after I asked about it 2 days ago in an issue.
What are some alternatives?
Wuerstchen - Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models
Kandinsky-2 - Kandinsky 2 — multilingual text2image latent diffusion model
Diffusion-Models-Papers-Survey-Taxonomy - Diffusion model papers, survey, and taxonomy
Rerender_A_Video - [SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
glami-1m - The largest multilingual image-text classification dataset. It contains fashion products.
DiffusionFastForward - DiffusionFastForward: a free course and experimental framework for diffusion-based generative models
MultiDiffusion - Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)
paper-implementations - Attempts to implement various deep learning, computer vision papers.
daam - Diffusion attentive attribution maps for interpreting Stable Diffusion.
min-3-flow - A multistage text to image framework. Built from a inference-reduced set of min-dalle, glid-3-xl, and SwinIR.
WhereIsAI - AI company, product, and tool collection.
Frido - Research code for paper "Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis"