Useful utilities that will help when trying to make stuff

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

img-txt_viewer

4 33 9.3 Python

Display an image and text file side-by-side for easy manual caption editing.

I've included a portable exe file in the release section that should fix that that sort of issue. Direct link

rembg

52 14,437 8.2 Python

Rembg is a tool to remove images background

4.) rembg -- backgrounds be gone. If there isn't an extension in Automatic1111 for this yet, there should be (I haven't checked recently). Same deal as midas, you can point it at a folder and zap the backgrounds off of all your images. Useful in combo with midas and imagemagick if, for instance, you want images of an object on a white background (Stable Diffusion training via LoRAs/Dreambooth may not benefit, but other things like GANs prefer that sort of training image). Useful if you want to "compose" a scene and you have images of dispirate objects/people you want in that scene.

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
FFmpeg

485 42,374 10.0 C

Mirror of https://git.ffmpeg.org/ffmpeg.git

1.) ffmpeg -- this is the bee's knees for video encoding/decoding, turning a bunch of pictures into a video, resizing, format conversion, going video -> image sequence, etc. It has hardware acceleration via Nvidia GPUs built-in, too, so you can use your video card for speeding up a lot of stuff like encoding.

DIS

12 1,965 4.8 Jupyter Notebook

This is the repo for our new project Highly Accurate Dichotomous Image Segmentation

Rembg only adds to my frustration. It's a half-assed solution providing sub-par results and acting as a band-aid for the fundamental absence of any transparency handling in diffusion models. DIS is better, but it's a pain to set up and still often needs retouching. Trying to remove backgrounds post-generation will always be an uphill battle because the information just isn't there.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project