hazm
Persian NLP Toolkit (by roshan-research)
Pattern
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization. (by clips)
hazm | Pattern | |
---|---|---|
- | 2 | |
1,134 | 8,693 | |
2.4% | 0.4% | |
8.5 | 0.0 | |
11 days ago | 11 months ago | |
Python | Python | |
MIT License | BSD 3-clause "New" or "Revised" License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
hazm
Posts with mentions or reviews of hazm.
We have used some of these posts to build our list of alternatives
and similar projects.
We haven't tracked posts mentioning hazm yet.
Tracking mentions began in Dec 2020.
Pattern
Posts with mentions or reviews of Pattern.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-08-27.
-
Discussion Thread
if you're curious about the nitty gritty, the parsing module's documentation is well written and doesn't require a comp sci or linguistics degree to get the gist of what's happening.
-
What would an interesting and applicable PhD topic?
Spacy. If you have time, explore nltk (the NLTK book is actually a really good place to start). I'm kind of fond of the https://github.com/clips/pattern -- it doesn't get the appreciation it deserves
What are some alternatives?
When comparing hazm and Pattern you can also consider the following projects:
NLTK - NLTK Source
spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python
TextBlob - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
Jieba - 结巴中文分词
textacy - NLP, before and after spaCy
simplemma - Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
quepy - A python framework to transform natural language questions to queries in a database query language.
SnowNLP - Python library for processing Chinese text
pkuseg-python - pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation