Database of 16,000 Artists Used to Train Midjourney AI Goes Viral

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

open_clip

28 8,452 8.2 Jupyter Notebook

An open source implementation of CLIP.

It is a misconception that Adobe's models have not been trained on copyrighted work. Nobody should be repeating their marketing claims.
Adobe has not shown how they train the text encoders in Firefly, or what images were used for the text-based conditioning (i.e. "text to image") part of their image generation model. They are almost certainly using CLIP or T5, which are trained on LAION2b, an image dataset with the very problems they are trying to address, C4 (a text dataset similarly encumbered) and similar.
I welcome anyone who works at Adobe to simply answer this question of how they trained the text encoders for text conditioning and put it to rest. There is absolutely nothing sensitive about the issue, unless it exposes them in a lie.
So no chance. I think it's a big fat lie. They'd have to have made some other scientific breakthrough, which they didn't.
Using information from https://openai.com/research/clip and https://github.com/mlfoundations/open_clip, it's possible to investigate the likelihood that using just their stock image dataset, can they make a working text encoder?
It's certainly not impossible, but it's impracticable. On 248m images (roughly the size of Adobe Stock), CLIP gets 37% on ImageNet, and on the 2000m from LAION, it performs 71-80%. And even with 2000m images, CLIP is substantially worse performing than the approach that Imagen uses for "text comprehension," which relies on essentially many billions more images and text tokens.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Is Nicholas Renotte a good guide for a person who knows nothing about ML?

1 project | /r/learnmachinelearning | 27 Jun 2023
Generate Image from Vector Embedding

1 project | /r/StableDiffusion | 6 Jun 2023
What's up in the Python community? – April 2023

3 projects | news.ycombinator.com | 28 Apr 2023
Low accuracy on my CNN model.

1 project | /r/MLQuestions | 13 Apr 2023
Looking for OpenAI CLIP alternative

1 project | /r/StableDiffusion | 21 Feb 2023

Database of 16,000 Artists Used to Train Midjourney AI Goes Viral

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Deep Learning Pytorch Computer Vision language-model multi-modal-learning
Post date: 7 Jan 2024

open_clip

InfluxDB

Related posts

Is Nicholas Renotte a good guide for a person who knows nothing about ML?

Generate Image from Vector Embedding

What's up in the Python community? – April 2023

Low accuracy on my CNN model.

Looking for OpenAI CLIP alternative

Database of 16,000 Artists Used to Train Midjourney AI Goes Viral

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Deep Learning Pytorch Computer Vision language-model multi-modal-learning Post date: 7 Jan 2024

open_clip

InfluxDB

Related posts

Is Nicholas Renotte a good guide for a person who knows nothing about ML?

Generate Image from Vector Embedding

What's up in the Python community? – April 2023

Low accuracy on my CNN model.

Looking for OpenAI CLIP alternative

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Deep Learning Pytorch Computer Vision language-model multi-modal-learning
Post date: 7 Jan 2024