polyglot
Pattern

polyglot | Pattern | |
---|---|---|
1 | 3 | |
2,321 | 8,766 | |
0.3% | 0.0% | |
0.0 | 0.0 | |
over 1 year ago | 8 months ago | |
Python | Python | |
GNU General Public License v3.0 only | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
polyglot
Pattern
-
Show HN: A tool to analyze Hacker News sentiment on any term in seconds
There’s some old work [1] that conceptualized sentiment as an interplay between subjectivity and sentiment. The more subjective a statement, the more “range” sentiment gets. I think this is what you are getting at.
I don’t think it ever gained traction, probably because people aren’t interested in creating an actual theory of sentiment that matches the real world.
[1]: https://github.com/clips/pattern/wiki/pattern-en#sentiment
-
Discussion Thread
if you're curious about the nitty gritty, the parsing module's documentation is well written and doesn't require a comp sci or linguistics degree to get the gist of what's happening.
-
What would an interesting and applicable PhD topic?
Spacy. If you have time, explore nltk (the NLTK book is actually a really good place to start). I'm kind of fond of the https://github.com/clips/pattern -- it doesn't get the appreciation it deserves
What are some alternatives?
spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python
NLTK - NLTK Source
TextBlob - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
langid.py - Stand-alone language identification system
quepy - A python framework to transform natural language questions to queries in a database query language.
Stanza - Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
SnowNLP - Python library for processing Chinese text
Jieba - 结巴中文分词
textacy - NLP, before and after spaCy
