tokenizer
NLP tokenizers written in Go language (by sugarme)
kagome
Self-contained Japanese Morphological Analyzer written in pure Go (by ikawaha)
tokenizer | kagome | |
---|---|---|
1 | 1 | |
140 | 789 | |
- | - | |
6.1 | 6.4 | |
2 months ago | 19 days ago | |
Go | Go | |
Apache License 2.0 | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tokenizer
Posts with mentions or reviews of tokenizer.
We have used some of these posts to build our list of alternatives
and similar projects.
-
Golang Modules Tutorial
The github repository of tokenizer can be checked here.
kagome
Posts with mentions or reviews of kagome.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-03-19.
-
How do MeCab, Kuromoji and Kagome (Japanese Text Analyzer) compare; and which dictionary to choose?
Kagome is a more recently updated library implemented in Golang.
What are some alternatives?
When comparing tokenizer and kagome you can also consider the following projects:
maleeni - A lexer generator for golang
Sudachi - A Japanese Tokenizer for Business
spaGO - Self-contained Machine Learning and Natural Language Processing library in Go
gse - Go efficient multilingual NLP and text segmentation; support English, Chinese, Japanese and others.
sentences - A multilingual command line sentence tokenizer in Golang
gojieba - "结巴"中文分词的Golang版本
prose - :book: A Golang library for text processing, including tokenization, part-of-speech tagging, and named-entity extraction.
go-i18n - Translate your Go program into multiple languages.
getlang - Natural language detection package in pure Go
whatlanggo - Natural language detection library for Go
gounidecode - Unicode transliterator for #golang