kohya-trainer
Adapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning (by Linaqruf)
stable-diffusion-webui-dataset-tag-editor
Extension to edit dataset captions for SD web UI by AUTOMATIC1111 (by toshiaki1729)
kohya-trainer | stable-diffusion-webui-dataset-tag-editor | |
---|---|---|
36 | 7 | |
1,772 | 621 | |
- | - | |
8.3 | 5.3 | |
about 2 months ago | 5 months ago | |
Jupyter Notebook | Python | |
Apache License 2.0 | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
kohya-trainer
Posts with mentions or reviews of kohya-trainer.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-08-04.
-
Best method for training lora with sdxl
This longer colab notebook: I did use this one (or one of the slight derivatives of it) and got out a safetensors file, but the lora didn't work at all--I'd use it a increase it's weight but I just would see no effect
- Question on SD Finetuning
-
Requesting Help: Stable Diffusion with Dreambooth via Automatic1111
It isn't what you are asking for (sry) but I struggled with this thing for way too long until I found out about the Kohya Trainer. https://github.com/Linaqruf/kohya-trainer So much easier with a lot of videos by the various YT folks. Standalone WebUI that just works. Life is good here!
-
Do you need a PhD in AI for AI opportunities?
It's seem that he is stable diffusion model creators. In that space, it's less knowing about the code and more experimenting on what would happen in the training. The stable diffusion community has repertoire of fine-tuning tools that is accessible for someone who have no single idea on the code behind it, no different than using application like kohya.
-
Am I some kind of idiot? I cant for the life of me get Lora training to work on colab or runpod.
Have you tried out one of the colabs from https://github.com/Linaqruf/kohya-trainer ? The colabs themselves are pretty long, but you just have to read each step and then usually push the button to run that cell, then move on to the next one.
-
[Stable Diffusion] Diffusion stable sur Google Colab se bloque toujours!
** https: //github.com/linaqruf/kohya-trainer**
-
Lora training steps with large batch sizes?
There are a lot of variables that affect what kind of settings to use, but afaik the best solution to finding the right step count for what your training is still just to save multiple epochs and then run a x/y/z plot comparison. If you can't do that locally because of your 4gb card, you could try using Lora colabs that include inference capabilities.
-
Colab Troubles (Addendum)
You seem to be a little confused. You wont find an ipynb of a model. You would reference a model via a content portal like hugginface. If your model is hosted there, you dont have to download it to your computer or gdrive first. You just reference it with the hugginface-style reference, ie runwayml/stable-diffusion-v1-5. Some colabs will let you also reference a URL to pull down the model. Example. https://github.com/Linaqruf/kohya-trainer/blob/main/kohya-LoRA-dreambooth.ipynb. In that case, you can get the direct url to a checkpoint, for example at civit.ai. If you're decent at messing around with code, you can deconstruct that code block to use in a different colab. As for gdrive, it's only a couple dollars to get 100G.
- PNG info not copied from images generated through Kohya.
-
Is Colab going to start banning people who use it for Stable Diffusion????
Try this colab to train Lora, it can generate image without the UI too
stable-diffusion-webui-dataset-tag-editor
Posts with mentions or reviews of stable-diffusion-webui-dataset-tag-editor.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-05-05.
-
Using hydrus for managing tags of training data
There are few tools for mass tagging data. Each with their own problems. * stable-diffusion-webui-dataset-tag-editor has good features. But it also has bugs that make it nearly unusable. It is also resource heavy as it runs in the webUI with stable diffusion, and stable diffusion always has models loaded. * BooruDatasetTagManager lacks many useful features.
-
What program to use for mass editing tags for training images?
I tried stable-diffusion-webui-dataset-tag-editor but it has a bug where it would get confused and sometimes swap tags from one image to another ruining everything.
-
Experiment AI Anime w/ C-Net 1.1 + GroundingDINO + SAM + MFR (workflow)
Use WD 1.4 tagger (https://github.com/toriato/stable-diffusion-webui-wd14-tagger) to extract prompt words from each frame (threshold 0.65), then use the dataset tag editor (https://github.com/toshiaki1729/stable-diffusion-webui-dataset-tag-editor) for batch editing, mainly:
-
Civitai should enforce a replicability check
If you haven't come across them yet, these two guides: this and this are good reads, and this one for info about learning rates. Beyond what those guides give info on, there are two points in which I noticed a large increase in my Lora quality- better captioning, and when I resized all the images to have about the same amount of pixels as was being trained. For captioning I have a text file with types of tags I know I'll have to hit- subject (solo, 1girl, 1boy, those early tags), what kind of perspective- portrait, closeup, full body, etc, where the character is looking (looking up, looking to the side, looking at viewer, etc), what the perspective of the viewer is (from above, from below, pov, etc), and I write down common clothing tags for the character. So I have that off to the side, and then I load up this extension for webui. It has a bit of learning curve, but I point it at what pictures I've gotten and get it to interrogate with all the models it offers except blip, and set the confidence threshold to 0.10 so it's spitting out lots of tags. After it interrogates all the pictures, I use the database feature to remove the duplicate tags, and then I save the database so it creates all the text files. Then I go to the "edit caption of selected image" select an image to caption from the left. At that point on the right the top box should be full of tags, and the bottom one should be empty. I look at my checklist from my textfile and start hitting all the areas I need to, which doesn't take long. Then I look up at the top box and read from left to right, top to bottom, one tag a time, and if it's a relevant tag, I type it in the bottom box.
-
embed txt tags
I have been using this: https://github.com/toshiaki1729/stable-diffusion-webui-dataset-tag-editor to get tags on some random images (not for a dataset, just for ease of browsing personal photos and such) unfortunately, this exports as a txt file and doesnt know how to do xmp or tag embedding. does anyone know of a way to emb the exported txt file into the image keywords/categories/whatever it supports (based on format) or a quick way to convert it to an xmp sidecar file? not necessarily related to ai generation, but it is related to ai usage. hopefully someone knows the answer or can point me where to find it.
-
Automatic1111 extensions. What're your must-haves?
Dataset Tag Editor is perfect for editing large datasets and their caption files. It's been around for a couple months and I only found out about it the other day. I could have saved so much time manually editing hundreds of caption files....
-
Questions About Improving Embeddings/Hypernetwork Results
There is one extension I use however: https://github.com/toshiaki1729/stable-diffusion-webui-dataset-tag-editor
What are some alternatives?
When comparing kohya-trainer and stable-diffusion-webui-dataset-tag-editor you can also consider the following projects:
lora - Using Low-rank adaptation to quickly fine-tune diffusion models.
BooruDatasetTagManager
sd_dreambooth_extension
sd-webui-additional-networks
stable-diffusion-webui-wd14-tagger - Labeling extension for Automatic1111's Web UI
stable-diffusion-webui-colab - stable diffusion webui colab
stable-diffusion-webui-depthmap-script - High Resolution Depth Maps for Stable Diffusion WebUI
fast-stable-diffusion - fast-stable-diffusion + DreamBooth
stable-diffusion-webui - Stable Diffusion web UI
EveryDream-trainer - General fine tuning for Stable Diffusion
sd-webui-image-sequence-toolkit - Extension for AUTOMATIC111's WebUI
kohya-trainer vs lora
stable-diffusion-webui-dataset-tag-editor vs BooruDatasetTagManager
kohya-trainer vs sd_dreambooth_extension
stable-diffusion-webui-dataset-tag-editor vs sd-webui-additional-networks
kohya-trainer vs sd-webui-additional-networks
stable-diffusion-webui-dataset-tag-editor vs stable-diffusion-webui-wd14-tagger
kohya-trainer vs stable-diffusion-webui-colab
stable-diffusion-webui-dataset-tag-editor vs stable-diffusion-webui-depthmap-script
kohya-trainer vs fast-stable-diffusion
stable-diffusion-webui-dataset-tag-editor vs stable-diffusion-webui
kohya-trainer vs EveryDream-trainer
stable-diffusion-webui-dataset-tag-editor vs sd-webui-image-sequence-toolkit