Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Browser-agent Alternatives
Similar projects and alternatives to browser-agent
-
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
browser-agent reviews and mentions
- Browser AI Agent
-
An AI Scraping Tool Is Overwhelming Websites with Traffic
The established norm is that scrapers have to download robots.txt and support the standard robots.txt features, notably including `Crawl-Delay` which sets a rate limit. This is the established standard by which websites tell scrapers what the rules are for scraping them.
This tool is scraping sites, it has webmasters reporting actual disruption, it doesn't have robots.txt support. When people complained (eg in https://github.com/rom1504/img2dataset/issues/48), the author's stance was basically "PRs welcome". It looks like a third party recently contributed a PR to make it respect robots.txt (https://github.com/rom1504/img2dataset/pull/302), albeit without `Crawl-Delay` support, which is not merged yet.
I have seen the same thing with other recent AI tools (eg https://github.com/m1guelpf/browser-agent/issues/2) and I think it's important to defend the robots.txt convention and nip this in the bud. If a bot doesn't make a reasonable effort to respect robots.txt and it causes disruption, it's a denial-of-service attack and should be treated as such. No excuses.
- GitHub - m1guelpf/browser-agent: A browser AI agent, using GPT-4
- A bridge between GPT-4 and a headless Chromium browser
-
GPT-4 Week One. The biggest week in AI history. Here's whats happening
run-wild extends m1guelpf's browser-agent project by allowing gpt4 to alter it's goal. This is very dumb and probably ought not exist, but c'est la vie.
-
A note from our sponsor - InfluxDB
www.influxdata.com | 2 May 2024
Stats
m1guelpf/browser-agent is an open source project licensed under MIT License which is an OSI approved license.
The primary programming language of browser-agent is Rust.
Popular Comparisons
Sponsored