Open-source rival for OpenAI’s DALL-E runs on your graphics card

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • stable-diffusion

    A latent text-to-image diffusion model

  • My hunch is that is the result of this: https://github.com/CompVis/stable-diffusion#weights

    > 515k steps at resolution 512x512 on "laion-improved-aesthetics" (a subset of laion2B-en, filtered to images with an original size >= 512x512, estimated aesthetics score > 5.0

    https://github.com/LAION-AI/laion-datasets/blob/main/laion-a... for more details.

    What's remarkable is this: https://github.com/LAION-AI/laion-datasets/blob/main/laion-a...

    That aesthetic predictor was apparently trained on only 4000 images. If my thinking is correct, imagine the impact those 4000 ratings have had on all of the output of this model.

    You can see samples (some NSFW) of different images from the original training set in different rating buckets here, to get an idea of what was included or not in those training steps. http://3080.rom1504.fr/aesthetic/aesthetic_viz.html

  • laion-datasets

    Description and pointers of laion datasets

  • My hunch is that is the result of this: https://github.com/CompVis/stable-diffusion#weights

    > 515k steps at resolution 512x512 on "laion-improved-aesthetics" (a subset of laion2B-en, filtered to images with an original size >= 512x512, estimated aesthetics score > 5.0

    https://github.com/LAION-AI/laion-datasets/blob/main/laion-a... for more details.

    What's remarkable is this: https://github.com/LAION-AI/laion-datasets/blob/main/laion-a...

    That aesthetic predictor was apparently trained on only 4000 images. If my thinking is correct, imagine the impact those 4000 ratings have had on all of the output of this model.

    You can see samples (some NSFW) of different images from the original training set in different rating buckets here, to get an idea of what was included or not in those training steps. http://3080.rom1504.fr/aesthetic/aesthetic_viz.html

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • dalle-mini

    DALL·E Mini - Generate images from a text prompt

  • You can run craiyon / dalle-mini on a card with 8GB of VRAM if you decrease batch size to 1 and skip the CLIP step. Takes about 7 sec to generate an image on a 3070.

    I started with https://github.com/borisdayma/dalle-mini/blob/main/tools/inf... and pared it down.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Fish Speech TTS: clone OpenAI TTS in 30 minutes

    1 project | news.ycombinator.com | 22 May 2024
  • Python Bindings for Llama.cpp

    1 project | news.ycombinator.com | 22 May 2024
  • Show HN: SQLFrame – I ran PySpark without Spark on a SQL database

    2 projects | news.ycombinator.com | 20 May 2024
  • Cover Agent: open-source test generation (implements Meta's TestGen-LLM paper)

    1 project | news.ycombinator.com | 22 May 2024
  • 💸Ahorra en tus cuentas de AWS. usa 💥aws-nuke💥 para limpiar recursos.💸

    1 project | dev.to | 22 May 2024