URLExtract

URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD. (by lipoja)

URLExtract Alternatives

Similar projects and alternatives to URLExtract based on common topics and language

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better URLExtract alternative or higher similarity.

URLExtract reviews and mentions

Posts with mentions or reviews of URLExtract. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-07-30.
  • Famous HNers and Their Sites
    3 projects | news.ycombinator.com | 30 Jul 2022
    That'd explain some of the holes mentioned in these comments. I think you just want to match any "word" containing ".[valid TLD]" and then exclude invalid URLs ("@" in first part indicating email, etc).

    I've been using this[0] Python library which seemed good enough for my needs in some scraping project.

    0: https://github.com/lipoja/URLExtract

Stats

Basic URLExtract repo stats
1
236
5.7
2 months ago

lipoja/URLExtract is an open source project licensed under MIT License which is an OSI approved license.

The primary programming language of URLExtract is Python.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com