SaaSHub helps you find the best software and product alternatives Learn more →
Top 10 Python text-mining Projects
-
trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
Project mention: Trafilatura: A tool and library to gather text and metadata on the Web | news.ycombinator.com | 2025-05-28 -
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
-
-
-
-
-
Sevalla
Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
-
llmine_core
Your Platform for Text Mining through Configurable LLM Chains. Ideal for Developers and Semi-Technical Users
-
-
Python text-mining discussion
Python text-mining related posts
-
[Q] Does anyone use R to code qualitative data?
-
Language Input: a new web app for finding content to watch in your target language and keep track of your vocabulary
-
France: starting January 15, the health pass will be invalid "seven months after the last injection" in the absence of a booster dose
-
rake-nltk 1.0.6 released. Comes with the flexibility to choose your own sentence and word tokenizers.
-
rake-nltk 1.0.6 released. Comes with the flexibility to choose your own sentence and word tokenizers.
-
rake-nltk 1.0.6 released. Comes with the flexibility to choose your own sentence and word tokenizers.
-
[N] UK PhD Opportunity: Text mining the impact of SARS-CoV-2 mutations from the research literature at University of Glasgow
-
A note from our sponsor - SaaSHub
www.saashub.com | 1 Sep 2025
Index
What are some of the best open-source text-mining projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | trafilatura | 4,617 |
2 | texthero | 2,905 |
3 | scattertext | 2,311 |
4 | rake-nltk | 1,067 |
5 | huspacy | 171 |
6 | trrex | 145 |
7 | orange3-text | 133 |
8 | llmine_core | 37 |
9 | Answerable | 16 |
10 | corona-ml | 10 |