nlphose
wink-eng-lite-model
nlphose | wink-eng-lite-model | |
---|---|---|
4 | 5 | |
10 | 10 | |
- | - | |
2.7 | 0.0 | |
over 2 years ago | almost 3 years ago | |
Jupyter Notebook | ||
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
nlphose
-
NlphoseBuilder : A tool to create NLP pipelines via drag and drop
The tool generates a nlphose command that can be executed in a docker container to run the pipeline. These pipelines can process streaming text like tweets or static data like files. They can be executed just like normal shell command using nlphose. Let me show you what I mean !
-
Create NLP pipelines with drag and drop
Recently I have started work on query builder GUI for my open source project nlphose.
- nlphose is a collection of command line utilities, which can be piped together to create complex NLP pipelines for processing stream of tweets (or any other textual data). Currently supports sentiment analysis, 0-shot classification, Q&A, NER, Chunking.
- nlphose : A collection of utilities, which can be piped together to create complex NLP pipelines for processing tweets (and other data); inspired by the “Unix tools philosophy”. Currently supports sentiment analysis, question answering , zero-shot classification, language detection, NER, chunking
wink-eng-lite-model
- SuperCharge Input Field for a Dictionary Website
-
How to run NLP on a PDF file?
winkNLP’s English language lite model uses a pre-trained state machine to recognize named entities.
-
How to tokenize a string?
To tokenize a string using winkNLP, read the text using readDoc. Then use the tokens method to extract a collection of tokens from the string. Follow this with the out method to get this collection as a JavaScript array. This is how you can tokenize a string:
-
How to do sentiment analysis?
winkNLP's English language lite model uses ML-SentiCon as a base with further training. For emojis it uses the Emoji Sentiment Ranking. Together, they deliver an f-score of about 84.5%.
-
How to find date and time in text?
Raw texts may contain many named entities like time, money, and hashtags. The English language lite model for winkNLP finds entities spanning multiple tokens by employing pre-trained finite state machine.
What are some alternatives?
ABSA-PyTorch - Aspect Based Sentiment Analysis, PyTorch Implementations. 基于方面的情感分析,使用PyTorch实现。
afinn - AFINN sentiment analysis in Python
nlphoseGUI - This tool allows you to create Natural Language Processing pipelines for use with nlphose using a Blockly based GUI editor in any browser. As you create a pipeline it shows you the corresponding nlphose command which will execute the pipeline.
wink-nlp - Developer friendly Natural Language Processing ✨
blockly - The web-based visual programming editor.
malaya - Natural Language Toolkit for Malaysian language, https://malaya.readthedocs.io/
awesome-sentiment-analysis - Repository with all what is necessary for sentiment analysis and related areas
BERTweet - BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)
FinBERT - A Pretrained BERT Model for Financial Communications. https://arxiv.org/abs/2006.08097
trankit - Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
flair - A very simple framework for state-of-the-art Natural Language Processing (NLP)
nlp_compromise - modest natural-language processing