Stable-Diffusion
audiocraft
Stable-Diffusion | audiocraft | |
---|---|---|
30 | 37 | |
1,760 | 19,746 | |
- | 2.2% | |
9.8 | 8.3 | |
6 days ago | 11 days ago | |
Jupyter Notebook | Python | |
GNU General Public License v3.0 only | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stable-Diffusion
- Scalable Load Balancing Having Cloud GPU Service Salad Tutorial With Whisper Transcriber Gradio APP
- FLaNK AI-April 22, 2024
-
OneTrainer Fine Tuning vs Kohya SS DreamBooth & Huge Research of OneTrainer’s Masked Training
So stay subscribed and open notification bells to not miss : https://www.youtube.com/SECourses
-
Finding Best Training Hyper Parameters / Configuration Is Neither Cheap Nor Easy
You can use A6000 GPU on MassedCompute with our template for only 31 cents per hour. Follow instructions here (still WIP) : https://github.com/FurkanGozukara/Stable-Diffusion/blob/main/Tutorials/OneTrainer-Master-SD-1_5-SDXL-Windows-Cloud-Tutorial.md
-
Compared Effect Of Image Captioning For SDXL Fine-tuning / DreamBooth Training for a Single Person, 10.3 GB VRAM via OneTrainer
The tutorial will be on our channel : https://www.youtube.com/SECourses
-
A New Gold Tutorial For RunPod & Linux Users : How To Use Storage Network Volume In RunPod & Latest Version Of Automatic1111
Patreon exclusive posts index
- SUPIR Full Tutorial + 1 Click 12GB VRAM Windows & RunPod / Linux Installer + Batch Upscale + Comparison With Magnific
-
Beware When Buying M2 NVMe SSDs: Netac NV7000, Kioxia Exceria Plus G2, Kingston and Sandisk Compared
Used Writing Speed & Cache Testing Python Script ⤵️ https://github.com/FurkanGozukara/Stable-Diffusion/blob/main/CustomPythonScripts/gen_file.py
- Viral Paper Tested MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
-
56 Stable Diffusion And Related Generative AI Tutorials Organized List
Our 1,200+ Stars GitHub Stable Diffusion and other tutorials repo ⤵️ https://github.com/FurkanGozukara/Stable-Diffusion
audiocraft
- [N] MusicGen - Meta's response to Google's MusicLM for text-to-music is freely available for non-commercial usage
-
Open Source Libraries
facebookresearch/audiocraft/MUSICGEN: Music Generation
- Audiocraft: a library for audio processing and generation with deep learning.
- Audiocraft is a library for audio processing and generation with deep learning
-
Meta Open Sources AudioCraft: Generative AI for Audio
https://github.com/facebookresearch/audiocraft/blob/main/LIC...
-
This is not an infinite zoom.
I asked Audiocraft to make me a "chill hip hop beat", I used framesync.xyz to make keyframes for A1111 Deforum extension. Unfortunately, I don't have the settings file anymore, but it was pretty much just a 26s clip at 15fps (440 frames) with a single prompt "a surreal painting by Magritte" and the usual negative prompt magic voodoo. Then, for every clip I used the last frame of the previous clip as init frame. I render at 512x512 and then use ESRGAN4x to upscale to 2048x2048
-
[Frostveil Series] A monk channeling its inner Ønd
However, the music was 100% AI-generated by MusicGen.
Music was entirely generated by AI using MusicGen. Video was generated using PhotoVibrance.
- Try Meta's new MusicGen text-to-audio generator here, free, up to 30 seconds in length. | Text Prompt: Van Halen Style Catchy Electric Guitar Melody Hook for intro of song with distortion
-
I connected my Roland Digital Piano to GPT and MusicGen...
If you want to know more about MusicGen, https://github.com/facebookresearch/audiocraft
What are some alternatives?
sd-dynamic-thresholding - Dynamic Thresholding (CFG Scale Fix) for Stable Diffusion (StableSwarmUI, ComfyUI, and Auto WebUI)
llama - Inference code for Llama models
Fooocus - Focus on prompting and generating
jukebox - Code for the paper "Jukebox: A Generative Model for Music"
multidiffusion-upscaler-for-automatic1111 - Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
audiocraft-infinity-webui
SUPIR - SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild
gpt-producer
caption-upsampling - This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.
tortoise-tts - A multi-voice TTS system trained with an emphasis on quality
CushyStudio - 🛋 The AI and Generative Art platform for everyone
stable-diffusion-webui - Stable Diffusion web UI