shifterator
Pattern
shifterator | Pattern | |
---|---|---|
2 | 2 | |
272 | 8,667 | |
- | 0.3% | |
0.0 | 0.0 | |
6 months ago | 10 months ago | |
Python | Python | |
Apache License 2.0 | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
shifterator
-
NLP: How to visualise the main context (in the form of words, sentences etc) of a text document?
This https://github.com/ryanjgallagher/shifterator
- Shifterator: Interpretable data visualizations for word-level differences
Pattern
-
Discussion Thread
if you're curious about the nitty gritty, the parsing module's documentation is well written and doesn't require a comp sci or linguistics degree to get the gist of what's happening.
-
What would an interesting and applicable PhD topic?
Spacy. If you have time, explore nltk (the NLTK book is actually a really good place to start). I'm kind of fond of the https://github.com/clips/pattern -- it doesn't get the appreciation it deserves
What are some alternatives?
scattertext - Beautiful visualizations of how language differs among document types.
spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python
obsei - Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand image analysis, comparative study and more .
TextBlob - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
ambuda - Main application code for Ambuda, a breakthrough Sanskrit library (ambuda.org)
NLTK - NLTK Source
pyzotero - Pyzotero: a Python client for the Zotero API
textacy - NLP, before and after spaCy
quepy - A python framework to transform natural language questions to queries in a database query language.
pkuseg-python - pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
SnowNLP - Python library for processing Chinese text
Jieba - 结巴中文分词