Our great sponsors
-
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
Lmao, no it doesn't. As we can see, their downloader uses very obscure "no ai" headers (which can be disabled, so its useless). They only claim it respects "robots.txt" because the google crawler respects it, if a site changes their robots.txt rules they don't remove it from their dataset, that is not "respecting". https://github.com/rom1504/img2dataset
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
- AI used photographer’s photos for training, then slapped him with an invoice
- An AI Scraping Tool Is Overwhelming Websites with Traffic
- Please make this tool “opt-in” by default
- Img2dataset: Turns large sets of image URLs to an image dataset
- Stability AI plans to let artists opt out of Stable Diffusion 3 image training