My Experience with Training Real-Person Models: A Summary

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

fast-stable-diffusion

239 7,316 8.6 Python

fast-stable-diffusion + DreamBooth

Thanks to the development of the community over the past few months, I quickly learned that Dreambooth was a great algorithm (or model) for training faces. I started with https://github.com/TheLastBen/fast-stable-diffusion, the first available library I found on GitHub, but my graphics card was too small and could only train and run on Colab. As expected, it failed miserably, and I wasn't sure why. Now it seems that the captions I wrote were too poor (I'm not very good at English, and I used ChatGPT to write this post), and I didn't know what to upload for the regularized image.

Dreambooth-Stable-Diffusion

100 3,166 6.8 Jupyter Notebook

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles. (by JoePenna)

I quickly turned to the second library, https://github.com/JoePenna/Dreambooth-Stable-Diffusion, because its readme was very encouraging, and its results were the best. Unfortunately, to use it on Colab, you need to sign up for Colab Pro to use advanced GPUs (at least 24GB of VRAM), and training a model requires at least 14 compute units. As a poor Chinese person, I could only buy Colab Pro from a proxy. The results from JoePenna/Dreambooth-Stable-Diffusion were fantastic, and the preparation was straightforward, requiring only <=20 512*512 photos without writing captions. I used it to create many beautiful photos.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
EveryDream2trainer

48 746 9.2 Python

Then I started thinking, was there a better way? So I searched on Google for a long time, read many posts, and learned that only text reversal, Dreambooth, and EveryDream had good results on real people, but Lora didn't work. Then I tried Dreambooth again, but it was always a disaster, always! I followed the instructions carefully, but it just didn't work for me, so I had to give up. Then I turned to EveryDream2.0 https://github.com/victorchall/EveryDream2trainer, which actually worked reasonably well, but...there was a high probability of showing my front teeth with an open mouth.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Construyendo un asistente genAI de WhatsApp con Amazon Bedrock

2 projects | dev.to | 4 May 2024
Thorn in a HaizeStack Long-Context Jailbreak Test

1 project | news.ycombinator.com | 4 May 2024
DialOp: Decision-Oriented Dialogue for Human-AI Collaboration

1 project | news.ycombinator.com | 4 May 2024
Should I add CLA to my Open-source project?

2 projects | news.ycombinator.com | 4 May 2024
Text Prompt to SVG

1 project | news.ycombinator.com | 4 May 2024