spidermon
Scrapy Extension for monitoring spiders execution. (by scrapinghub)
open-gov-crawlers
Parse government documents into well formed JSON (by public-law)
spidermon | open-gov-crawlers | |
---|---|---|
2 | 13 | |
510 | 61 | |
0.4% | - | |
6.9 | 6.8 | |
10 days ago | 22 days ago | |
Python | Python | |
BSD 3-clause "New" or "Revised" License | - |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
spidermon
Posts with mentions or reviews of spidermon.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-02-20.
-
Automated testing the scraping output
This is what Spidermon does.
- spidermon: Scrapy Extension for monitoring spiders execution
open-gov-crawlers
Posts with mentions or reviews of open-gov-crawlers.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-09-06.
-
What are the best repos that are a display of clean code and good programming practices that I can learn from?
I get feedback occasionally that this is the cleanest web scraping code someone’s seen: https://github.com/public-law/open-gov-crawlers
-
Sunday Daily Thread: What's everyone working on this week?
Writing more scrapers for legal glossaries of many country governments: adding Australia and the UK: https://github.com/public-law/open-gov-crawlers
-
Just a little custom coding to auto-generate spider info in a repo
Here's the repo's README.md with the table. I made this to help onboarding new open-source developers. Also to help people understand what's there.
-
Why and how to use conda?
I find it very agnostic. I use it for app development, not packages. E.g.: https://github.com/public-law/open-gov-crawlers
-
Wanted: contractor who can complete this HTML scrape
The original PDF: https://github.com/public-law/open-gov-crawlers/blob/rome-statute-english/docs/Rome-Statute.pdf
- Is there anything a webdev can do to help Ukraine right now ?
-
I want to make International Law easy to read and search: how many versions of Chinese do I need to publish?
For techies, here's the GitHub repo: https://github.com/public-law/open-gov-crawlers/discussions/70
- Project in support of Ukraine: International Criminal Law parsers/crawlers
- Scrapy project in support of Ukraine: International Criminal Law (war crimes and the crime of aggression)
- New project in support of Ukraine: International Criminal Law parsers/crawlers