Our great sponsors
-
DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
Waifu2x-Extension-GUI
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.
There's already an open-source implementation of DALL-E 2 (https://github.com/lucidrains/DALLE2-pytorch) and a pretrained model for it should be released within this year.
Also true for Google's Imagen, which should be even better than DALLE-2 (and faster) https://github.com/lucidrains/imagen-pytorch.
This is possible because the original research papers behind both DALLE-2 and Imagen were publicly released.
For those who want to try DALL.E but do not have access yet, this is good play site: https://www.craiyon.com/
Additionally, it's also open-sourced on GitHub and can be self-hosted, with easy instructions to do so: https://github.com/kuprel/min-dalle
> Facebook released over 100 pages of notes a few months ago detailing their training process for a model that is similar in size. Does anyone have a link?
Is https://github.com/facebookresearch/metaseq/blob/main/projec... what you're referring to?
Here are a couple I've used recently:
Majestic diffusion - https://github.com/multimodalart/majesty-diffusion
Centipede diffusion - https://colab.research.google.com/github/Zalring/Centipede_D...
Things have moved on a considerable amount since waifu2x
Try https://github.com/n00mkrad/cupscale
There's already an open-source implementation of DALL-E 2 (https://github.com/lucidrains/DALLE2-pytorch) and a pretrained model for it should be released within this year.
Also true for Google's Imagen, which should be even better than DALLE-2 (and faster) https://github.com/lucidrains/imagen-pytorch.
This is possible because the original research papers behind both DALLE-2 and Imagen were publicly released.
My bad! Yes Waifu2x is just a single algorithm, you are right.
The confusion originates from the fact that I was using a GUI project for Waifu2x called "Waifu2x Extension GUI" (https://github.com/AaronFeng753/Waifu2x-Extension-GUI) which other than Waifu2x also supports other algorithms like Real-ESRGAN, Real-CUGAN, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.
So as you said Cupscale is surely more advanced than Waifu2x (the single algorithm), but do you think it's also better than Waifu2x Extension GUI?
Related posts
- Google's StyleDrop can transfer style from a single image
- One year ago I got access to closed beta DALL-E 2.
- Besides Gaming - for what can be a 4080 useful?
- Is creating a StableDiffusion-inspired model feasible for my Master's thesis?
- TEDx talk on how to prepare for a career in vfx with the rapid changes caused by AI / machine learning