crawler
gen_retry
crawler | gen_retry | |
---|---|---|
3 | 1 | |
2,472 | 197 | |
0.7% | - | |
6.8 | 0.0 | |
about 2 months ago | about 1 year ago | |
PHP | Elixir | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
crawler
-
How to Build a Robust Web Scraper with Laravel: and Catch 'Em All
The spatie/crawler package is a powerful tool developed by Spatie, a web development agency known for creating high-quality, open-source packages for the Laravel community. This crawler is designed to simplify the process of building web scrapers and bots in PHP, particularly within the Laravel framework. It provides a flexible and easy-to-use interface to crawl websites and extract needed data efficiently. Click me if you love spatie/crawler, give them a star too, they are the reason that this article exists.
-
Can I use Laravel Dusk to automate downloading an HTML version of a website?
You probably want something like this: https://github.com/spatie/crawler
-
How to setup Wordpress to not crash due to 404 DDoS attack?
Alternatively, write your own basic crawler or adapt an existing package like https://github.com/spatie/crawler.
gen_retry
-
Complete, Production-Ready Phoenix Reference Applications
The second option for out-of-band processing would be a Task or if you want retry logic GenRetry. The primary downside here is that task isn't distributed, so if the server that's trying to run or retry this task goes away, there's nothing to pick it back up and try again.
What are some alternatives?
RED_HAWK - All in one tool for Information Gathering, Vulnerability Scanning and Crawling. A must have tool for all penetration testers
oban - 💎 Robust job processing in Elixir, backed by modern PostgreSQL and SQLite3
scrape - Scrape any website, article or RSS/Atom Feed with ease!
memoize - A method caching macro for elixir using CAS on ETS.
Guzzle - Guzzle, an extensible PHP HTTP client
changelog.com - Changelog is news and podcast for developers. This is our open source platform.
aws-elixir - AWS clients for Elixir
Crawler - A high performance web crawler / scraper in Elixir.
plug_wait1 - Plug adapter for the wait1 protocol
deque - Fast bounded deque using two rotating lists.
diskover-community - Diskover Community Edition - Open source file indexer, file search engine and data management and analytics powered by Elasticsearch
coderplanets.com - coderplanets.com API(GraphQL) server, build with elixir, phoenix, absinthe