SpecVQGAN
pollinations
SpecVQGAN | pollinations | |
---|---|---|
2 | 32 | |
318 | 197 | |
- | 5.6% | |
2.2 | 9.2 | |
11 months ago | 11 days ago | |
Jupyter Notebook | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
SpecVQGAN
-
Text-to-Audio Generation Using Instruction Tuned LLM and Latent Diffusion Model
Excellent. Some of the theory here goes back to Oct/2021 and beyond [1].
The riffusion.com [2] guys made this practical. Also, my video of high-level overview and examples [3].
1. SpecVQGAN: https://github.com/v-iashin/SpecVQGAN
2. Riffusion: ://www.riffusion.com/
3. Riffusion high-level overview: https://youtu.be/olkLVGcvib8
- "Taming Visually Guided Sound Generation". Quickly generate audio matching a given video. Code includes a Google Colab.
pollinations
-
Netflix Queen Elizabeth generated by Chat GPT
It literally says pollinations.ai in the bottom right
-
IT CAN MAKE IMAGES
It's not making the images, pollinations.ai is. I have tested it, and if you go to https://image.pollinations.ai/prompt/%7Bdescription%7D, and replace the word 'description' with anything else, it generates a different image.
- I run a free Stable Diffusion bot. I have fun trying to prevent people from overloading it with porn. This time I added (hairy gorilla:1.2) to the prompt when a mature word is detected.
-
Immersive text based adventure prompt to explore the imaginary internet of an alternate universe
I tweaked highly that one markdown prompt and added specific instructions for the fictional content. I also implemented the pollinations.ai prompt inside this one so that it also generates the images that are on the site.
-
Consistent, HIGH QUALITY image generator using pollinations.ai for many use cases (Prompt in comments)
I would not have known about pollinations.ai had I not read the OP!
-
Anime girls go burrrr
I found a couple around the same time. I don't remember which was his but I think the best was https://pollinations.ai/
-
Did you know you can get ChatGPT to generate images with Stable Diffusion?
The link is to the https://pollinations.ai/ API which will generate an image and return it in response to web requests.
- Pollinations.ai
-
I used AI to make Infant Annihilator Album covers
I used Midjourney AI as it's the easiest to set up and the most accurate. However if you want something that doesn't censor the prompts, you can use https://pollinations.ai/
- What AI tools are you using?
What are some alternatives?
poolformer - PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
dalle-playground - A playground to generate images from any text prompt using Stable Diffusion (past: using DALL-E Mini)
vid2cleantxt - Python API & command-line tool to easily transcribe speech-based video files into clean text
tubesync - Syncs YouTube channels and playlists to a locally hosted media server
MoViNet-pytorch - MoViNets PyTorch implementation: Mobile Video Networks for Efficient Video Recognition;
stable-diffusion-webui-colab - stable diffusion webui colab
ru-dalle - Generate images from texts. In Russian
dalle-2-preview
awesome-python-applications - 💿 Free software that works great, and also happens to be open-source Python.
CogVideo - Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"
BMT - Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
sysidentpy - A Python Package For System Identification Using NARMAX Models