Ask HN: Any way to ban GPT-4 and others from harvesting my content and data?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • nginx-ultimate-bad-bot-blocker

    Nginx Block Bad Bots, Spam Referrer Blocker, Vulnerability Scanners, User-Agents, Malware, Adware, Ransomware, Malicious Sites, with anti-DDOS, Wordpress Theme Detector Blocking and Fail2Ban Jail for Repeat Offenders

  • Use nginx anti-bot[1] or product such as Fastly or Cloudflare anti-bot feature, which blocks content scrapping and only allows specific bots using rules or AI. Another option is to block VPN, cloud and hosting companies' ASNs. There is no benefit for you allowing someone to scrap your content apart from Bing and Google. The rest of the bots can die, or you will lose your content to scrappers. So finally, put it behind a paywall if possible. Of course, the paywall only works if you have good content or buying ads to promote it. Good luck, mate.

    [1]https://github.com/mitchellkrogza/nginx-ultimate-bad-bot-blo...

  • Use nginx anti-bot[1] or product such as Fastly or Cloudflare anti-bot feature, which blocks content scrapping and only allows specific bots using rules or AI. Another option is to block VPN, cloud and hosting companies' ASNs. There is no benefit for you allowing someone to scrap your content apart from Bing and Google. The rest of the bots can die, or you will lose your content to scrappers. So finally, put it behind a paywall if possible. Of course, the paywall only works if you have good content or buying ads to promote it. Good luck, mate.

    [1]https://github.com/mitchellkrogza/nginx-ultimate-bad-bot-blo...

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts