Our great sponsors
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
I've included a portable exe file in the release section that should fix that that sort of issue. Direct link
4.) rembg -- backgrounds be gone. If there isn't an extension in Automatic1111 for this yet, there should be (I haven't checked recently). Same deal as midas, you can point it at a folder and zap the backgrounds off of all your images. Useful in combo with midas and imagemagick if, for instance, you want images of an object on a white background (Stable Diffusion training via LoRAs/Dreambooth may not benefit, but other things like GANs prefer that sort of training image). Useful if you want to "compose" a scene and you have images of dispirate objects/people you want in that scene.
1.) ffmpeg -- this is the bee's knees for video encoding/decoding, turning a bunch of pictures into a video, resizing, format conversion, going video -> image sequence, etc. It has hardware acceleration via Nvidia GPUs built-in, too, so you can use your video card for speeding up a lot of stuff like encoding.
Rembg only adds to my frustration. It's a half-assed solution providing sub-par results and acting as a band-aid for the fundamental absence of any transparency handling in diffusion models. DIS is better, but it's a pain to set up and still often needs retouching. Trying to remove backgrounds post-generation will always be an uphill battle because the information just isn't there.