My Experience with Training Real-Person Models: A Summary

This page summarizes the projects mentioned and recommended in the original post on /r/StableDiffusion

Judoscale - Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com
featured
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured
  1. fast-stable-diffusion

    fast-stable-diffusion + DreamBooth

    Thanks to the development of the community over the past few months, I quickly learned that Dreambooth was a great algorithm (or model) for training faces. I started with https://github.com/TheLastBen/fast-stable-diffusion, the first available library I found on GitHub, but my graphics card was too small and could only train and run on Colab. As expected, it failed miserably, and I wasn't sure why. Now it seems that the captions I wrote were too poor (I'm not very good at English, and I used ChatGPT to write this post), and I didn't know what to upload for the regularized image.

  2. Judoscale

    Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.

    Judoscale logo
  3. Dreambooth-Stable-Diffusion

    Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles. (by JoePenna)

    I quickly turned to the second library, https://github.com/JoePenna/Dreambooth-Stable-Diffusion, because its readme was very encouraging, and its results were the best. Unfortunately, to use it on Colab, you need to sign up for Colab Pro to use advanced GPUs (at least 24GB of VRAM), and training a model requires at least 14 compute units. As a poor Chinese person, I could only buy Colab Pro from a proxy. The results from JoePenna/Dreambooth-Stable-Diffusion were fantastic, and the preparation was straightforward, requiring only <=20 512*512 photos without writing captions. I used it to create many beautiful photos.

  4. EveryDream2trainer

    Then I started thinking, was there a better way? So I searched on Google for a long time, read many posts, and learned that only text reversal, Dreambooth, and EveryDream had good results on real people, but Lora didn't work. Then I tried Dreambooth again, but it was always a disaster, always! I followed the instructions carefully, but it just didn't work for me, so I had to give up. Then I turned to EveryDream2.0 https://github.com/victorchall/EveryDream2trainer, which actually worked reasonably well, but...there was a high probability of showing my front teeth with an open mouth.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Python's new t-strings – Dave Peck

    7 projects | news.ycombinator.com | 21 Apr 2025
  • Show HN: Dia, an open-weights TTS model for generating realistic dialogue

    6 projects | news.ycombinator.com | 21 Apr 2025
  • Gemma 3 QAT Models: Bringing AI to Consumer GPUs

    6 projects | news.ycombinator.com | 20 Apr 2025
  • 11 Practical Ways I Use AI Agents Without Losing My Authenticity

    7 projects | dev.to | 21 Apr 2025
  • How to Install and Use Instagram OSINT Tool on Kali Linux

    1 project | dev.to | 21 Apr 2025

Did you know that Python is
the 2nd most popular programming language
based on number of references?