w3lib
Python library of web-related functions (by scrapy)
can_ada
Python bindings for Ada, a fast and spec-compliant URL parser. (by TkTech)
w3lib | can_ada | |
---|---|---|
1 | 2 | |
382 | 123 | |
0.3% | - | |
6.7 | 6.9 | |
about 1 month ago | about 2 months ago | |
Python | C++ | |
BSD 3-clause "New" or "Revised" License | GNU General Public License v3.0 or later |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
w3lib
Posts with mentions or reviews of w3lib.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2024-03-16.
-
Parsing URLs in Python
A great initiative!
We need a better URL parser in Scrapy, for similar reasons. Speed and WHATWG standard compliance (i.e. do the same as web browsers) are the main things.
It's possible to get closer to WHATWG behavior by using urllib and some hacks. This is what https://github.com/scrapy/w3lib does, which Scrapy currently uses. But it's still not quite compliant.
Also, surprisingly, on some crawls URL parsing can take CPU amounts similar to HTML parsing.
Ada / can_ada look very promising!
can_ada
Posts with mentions or reviews of can_ada.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2024-03-16.
-
Parsing URLs in Python
I apologize for the misjudgment. I just followed the link to can_ada and saw really minimal tests, e.g. https://github.com/TkTech/can_ada/blob/main/tests/test_parsi...
I didn't understand that can_ada is not where the parser is developed.