logparser
scrapydweb
logparser | scrapydweb | |
---|---|---|
2 | 6 | |
1,433 | 3,004 | |
1.7% | - | |
7.5 | 3.6 | |
3 months ago | about 1 month ago | |
Python | Python | |
GNU General Public License v3.0 or later | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
logparser
-
Log2row: A tool that detects, extracts templates, and structures logs
You use GPT-4 to extract log patterns, does it really need LLM? There are more traditional approach such as https://github.com/logpai/logparser
- A machine learning toolkit for log parsing [ICSE'19, DSN'16]
scrapydweb
-
Best scrapydweb fork
It's seems like there are a lot of more recently updated forks https://github.com/my8100/scrapydweb/network
-
What are your favorite open source scrapy projects?
You also have this as a managment tool https://github.com/my8100/scrapydweb
-
The Complete Scrapyd Guide - Deploy, Schedule & Run Your Scrapy Spiders
There are many different Scrapyd dashboard and admin tools available, from ScrapeOps (Live Demo) to ScrapydWeb, SpiderKeeper, and more.
-
The Complete Guide To ScrapydWeb, Get Setup In 3 Minutes!
ScrapydWeb is the most popular open source Scrapyd admin dashboards. Boasting 2,400 Github stars, ScrapydWeb has been fully embraced by the Scrapy community.
-
Daily Share Price Notifications using Python, SQL and Africas Talking - Part Two
While I am aware that we could use Scrapyd to host your spiders and actually send requests, alongside with ScrapydWeb, I personally prefer to keep my scraper deployment simple, quick, and free. If you are interested in this alternative instead, check out this post written by Harry Wang.
-
Scrapyd + Django in Docker: HTTPConnectionPool (host = '0.0.0.0', port = 6800) error.
If you're looking for an interactive scrapyd webinterface integrated with scrapyd, you can check https://github.com/my8100/scrapydweb. It is rich in features and can save your time in building your own web interface.
What are some alternatives?
loghub - A large collection of system log datasets for AI-driven log analytics [ISSRE'23]
Gerapy - Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.js
ADBench - Official Implement of "ADBench: Anomaly Detection Benchmark", NeurIPS 2022.
scrapy-splash - Scrapy+Splash for JavaScript integration
nginx-ui - Nginx UI allows you to access and modify the nginx configurations files without cli.
SpiderKeeper - admin ui for scrapy/open source scrapinghub
zero-log-parser - Decode Zero log files from the mobile application into text files
SquadJS - Squad Server Script Framework
LogParser - A Log Parser, that create structured data from log files.
scrapeops-scrapy-sdk - Scrapy extension that gives you all the scraping monitoring, alerting, scheduling, and data validation you will need straight out of the box.
pygotham-packaging - Notes from my presentation on Python packaging at PyGotham 2021
scrapy-cloudflare-middleware - A Scrapy middleware to bypass the CloudFlare's anti-bot protection