Stable Diffusion Public Release – Stability.ai

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • stable-diffusion

    A latent text-to-image diffusion model

  • I think the answer is yes, but setup is a bit complicated. I would test this myself, but I don't have an NVIDIA card with at least 10GB of VRAM.

    One time:

    1. Have "conda" installed.

    2. clone https://github.com/CompVis/stable-diffusion

    3. `conda env create -f environment.yaml`

    4. activate the Venv with `conda activate ldm`

    5. Download weights from https://huggingface.co/CompVis/stable-diffusion-v-1-4-origin... (requires registration).

    6. `mkdir -p models/ldm/stable-diffusion-v1/`

    7. `ln -s models/ldm/stable-diffusion-v1/model.ckpt`. (you can download the other version of the model, like v1-1, v1-2, and v1-3 and symlink them instead if you prefer).

    To run:

    1. activate venv with `conda activate ldm` (unless still in a prompt running inside the venv).

    2. `python scripts/txt2img.py --prompt "a photograph of an astronaut riding a horse" --plms`.

    Also there is a safety filter in the code that will black out NSFW or otherwise expected to be offensive images (presumably also including things like swastikas, gore, etc). It is trivial to disable by editing the source if you want.

  • stable-diffusion

    Optimized Stable Diffusion modified to run on lower GPU VRAM (by basujindal)

  • There's a version that's a bit slower but more memory efficient https://github.com/basujindal/stable-diffusion that runs on 6GB too.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • diffusers

    🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

  • Yes you can https://github.com/huggingface/diffusers/releases/tag/v0.2.3 (probably the easiest way)

  • invisible-watermark

    python library for invisible image watermark (blind image watermark)

  • > How could anyone reliably, conclusively determine that an image was generated using a locally-run tool?

    https://github.com/ShieldMnt/invisible-watermark

    (still ways around that, though)

  • CogVideo

    Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"

  • Check out https://github.com/THUDM/CogVideo - progress is being made on coherent video generation.

    Characters and dialogue are effectively solved, just look at GPT-3.

    The entity behind StableDiffusion is also supporting generative music art, so let's see what is coming out of that: https://www.harmonai.org/

    We are currently far away from generating a production quality movie with AI, but I don't think it's going to be nearly as long as a lifetime. In my opinion, we'll have high quality AI shorts within the decade.

  • stable-diffusion-ui

    Discontinued Easiest 1-click way to install and use Stable Diffusion on your computer. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image. [Moved to: https://github.com/easydiffusion/easydiffusion]

  • You can try https://github.com/cmdr2/stable-diffusion-ui . It installs Stable Diffusion to your local computer, and provides a simple browser-based UI for playing with it. No need to mess with conda and other environment settings.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts