Useful utilities that will help when trying to make stuff

This page summarizes the projects mentioned and recommended in the original post on /r/StableDiffusion

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • img-txt_viewer

    Display an image and text file side-by-side for easy manual caption editing.

  • I've included a portable exe file in the release section that should fix that that sort of issue. Direct link

  • rembg

    Rembg is a tool to remove images background

  • 4.) rembg -- backgrounds be gone. If there isn't an extension in Automatic1111 for this yet, there should be (I haven't checked recently). Same deal as midas, you can point it at a folder and zap the backgrounds off of all your images. Useful in combo with midas and imagemagick if, for instance, you want images of an object on a white background (Stable Diffusion training via LoRAs/Dreambooth may not benefit, but other things like GANs prefer that sort of training image). Useful if you want to "compose" a scene and you have images of dispirate objects/people you want in that scene.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • FFmpeg

    Mirror of https://git.ffmpeg.org/ffmpeg.git

  • 1.) ffmpeg -- this is the bee's knees for video encoding/decoding, turning a bunch of pictures into a video, resizing, format conversion, going video -> image sequence, etc. It has hardware acceleration via Nvidia GPUs built-in, too, so you can use your video card for speeding up a lot of stuff like encoding.

  • DIS

    This is the repo for our new project Highly Accurate Dichotomous Image Segmentation

  • Rembg only adds to my frustration. It's a half-assed solution providing sub-par results and acting as a band-aid for the fundamental absence of any transparency handling in diffusion models. DIS is better, but it's a pain to set up and still often needs retouching. Trying to remove backgrounds post-generation will always be an uphill battle because the information just isn't there.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts