Goose3 Alternatives
Similar projects and alternatives to Goose3
-
newspaper
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
-
TWINT
Discontinued An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
python-readability
fast python port of arc90's readability tool, updated to match latest readability.js!
Goose3 reviews and mentions
-
Article Extraction Library for Scraping Text Data (Python)
Goose3 is what I use to scrape financial articles for real-time financial news analysis. It's very good. https://github.com/goose3/goose3
-
Website categorization - use cases, taxonomies, content extraction
There are also many ready made libraries available for content extraction written in python which is more commonly used in data science, e.g. goose3 (https://github.com/goose3/goose3) and newspaper (https://github.com/codelucas/newspaper).
Stats
goose3/goose3 is an open source project licensed under Apache License 2.0 which is an OSI approved license.
The primary programming language of Goose3 is HTML.