imagen-pytorch
holbert
imagen-pytorch | holbert | |
---|---|---|
47 | 3 | |
7,787 | 161 | |
- | - | |
6.8 | 0.0 | |
about 1 month ago | over 1 year ago | |
Python | Haskell | |
MIT License | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
imagen-pytorch
-
Google's StyleDrop can transfer style from a single image
If google doesnt, someone like lucidrains probably would implement it, just like he did for imagen and muse.
- Create a Stable diffusion neural network from scratch.
-
Google just announced an Even better diffusion process.
lucidrains/imagen-pytorch: Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch (github.com)
- Karlo, the first large scale open source DALL-E 2 replication is here
-
training imagen
Hi Can someone guide me a little, as to how i can use LAION dataset to train my imagen model? like how i can download the data, and in which format it should be fed to https://github.com/lucidrains/imagen-pytorch code?
-
If everyone in this sub make a donation of $10 then we can train truly open stable diffusion.
If we were to put money into training something, I'd hope we use a better model, like Imagen.
- AI Content Generation, Part 1: Machine Learning Basics
-
DALL-E 2 is switching to a credits system (50 generations for free at first, 15 free per month)
I've been messing around with this open-source implementation. You can get a pretty good idea of the model size by just copying the parameters from the paper.
-
Protests erupt outside of DALL-E offices after pricing implementation, press photograph
I'm waiting on this implementation/training of imagen: https://github.com/lucidrains/imagen-pytorch
-
Show HN: Food Does Not Exist
I'm honestly surprised that they trained a StyleGAN. Recently, the Imagen architecture has been show to be both easier in structure, easier to train, and even faster to produce good results. Combined with the "Elucidating" paper by NVIDIA's Tero Karras you can train a 256px Imagen to tolerable quality within an hour on a RTX 3090.
Here's a PyTorch implementation by the LAION people:
https://github.com/lucidrains/imagen-pytorch
And here's 2 images I sampled after training it for some hours, like 2 hours base model + 4 hours upscaler:
https://imgur.com/a/46EZsJo
holbert
-
Hacker News top posts: May 26, 2022
Holbert: An Interactive Theorem Prover\ (0 comments)
- Holbert: An Interactive Theorem Prover
-
Formalising Mathematics: An Introduction
> Both tactics and proof terms are already quite old (for CS concepts, that is) and there hasn't been any real competition, so I imagine in the medium-term we'll just see refinements of them.
So it's really more of an issue of presentation? The techniques are fine? (I'm a professional programmer but an amateur logician, I really don't know what the big kids do.)
> I can't imagine anyone wanting to read latex source code over tactics/proof term code. Unless you're talking about rendered latex?
Yeah, you would generally only be looking at LaTeX source to debug your tools.
> But that's not something people can realistically work with.
I don't understand. I rarely work with it, but I was under the impression that it's pretty standard for writing math and science papers? Are there no WYSIWYG tools for working with rendered LaTeX? How do people work with it now, I guess is what I'm asking.
> Graphical proof assistants exist, but nobody uses them.
I just did a quick search and found two but they seem obscure:
https://en.wikipedia.org/wiki/Jape_(software)
https://github.com/liamoc/holbert
I guess the question I have is why does no one use them? Is it just inertia? I mean this is a thread about promoting the use of Lean et. al., so even the non-graphical, well-known tools are still kind of a niche, no?
Are graphical proof assistants only good for students and teaching, not "heavy lifting"?
In any event, I still feel that we can do better on the presentation side of things. (That's not controversial is it? The Lean folks are working on it?) I want to understand what kinds of software would help mathematicians.
> In a sense, tactics are a very weak form of this. Instead of just describing a proof as its structured in the system, they allow a proof author to also describe some of their intent or intuition. It's definitely why some people prefer tactics-based proofs.
That's pretty cool. :)
> I can't even imagine would formalizing something so subjective would even look beyond this. I'm not sure if it's even possible.
The Turing Machine is itself a formalization of a subjective process, eh?
If we get to the point where the machines can "read our minds" then it will be really easy. :) Heck, mathematicians can just watch videos of each other's mental imagery!
In the meantime, externalizing and formalizing these subjective intuitive processes with the machinery we've got seems like a fun and useful challenge, eh?
What are some alternatives?
dalle-mini - DALL·E Mini - Generate images from a text prompt
trepplein - Lean type-checker written in Scala.
DALLE2-pytorch - Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
DALLE-pytorch - Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
latent-diffusion - High-Resolution Image Synthesis with Latent Diffusion Models
DeepCreamPy - deeppomf's DeepCreamPy + some updates
CogVideo - Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"
tortoise-tts - A multi-voice TTS system trained with an emphasis on quality
hent-AI - Automation of censor bar detection
min-dalle - min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch
Waifu2x-Extension-GUI - Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.
stylegan2 - StyleGAN2 - Official TensorFlow Implementation