pos-tagging

Top 12 pos-tagging Open-Source Projects

  • HanLP

    中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

  • wink-nlp

    Developer friendly Natural Language Processing ✨

  • Project mention: Show HN: Next-token prediction in JavaScript – build fast LLMs from scratch | news.ycombinator.com | 2024-04-10

    This is awesome, thanks. I've been messing with wink's NLP library (https://winkjs.org/wink-nlp/) to transform user queries and format responses so I can make a proper chat bot - will see what I can learn from these!

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • kagome

    Self-contained Japanese Morphological Analyzer written in pure Go

  • Sudachi

    A Japanese Tokenizer for Business

  • CogCompNLP

    CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, transliteration, verb-sense, and more.

  • malaya

    Natural Language Toolkit for Malaysian language, https://malaya.readthedocs.io/

  • camel_tools

    A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.

  • Project mention: [Arabic>latin transliteration] any apps for this? | /r/translator | 2023-04-30

    Otherwise it depends on your use case. There are NLP libraries like this one that can do the job.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • jumanpp

    Juman++ (a Morphological Analyzer Toolkit)

  • datalinguist

    Stanford CoreNLP in idiomatic Clojure.

  • gum

    Repository for the Georgetown University Multilayer Corpus (GUM) (by amir-zeldes)

  • Project mention: Évariste Galois | news.ycombinator.com | 2023-11-27

    CC BY SA 3.0: https://github.com/amir-zeldes/gum/blob/master/LICENSE.txt

    I didn't know about that project, that's really cool! I'd be curious to know whether the person who devised this scheme was aware of structured meaning representations (UCCA, AMR, ...), and if so, why they chose to create a new meaning representation. Maybe the goals of the project and/or the constraints of Wikidata necessitated this.

    Anyway, GUM (and its sister corpus EWT) does have a lot of parsed permissively-licensed text, so whoever's in charge should definitely consider using them. (Amir, the maintainer, is also super friendly and would respond to an email.)

  • NaiPosTagger

    A part of speech tagger written in PHP.

  • wink-eng-lite-model

    English lite language model for wink-nlp.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

pos-tagging related posts

Index

What are some of the best open-source pos-tagging projects? This list will help you:

Project Stars
1 HanLP 32,304
2 wink-nlp 1,143
3 kagome 789
4 Sudachi 741
5 CogCompNLP 469
6 malaya 456
7 camel_tools 376
8 jumanpp 365
9 datalinguist 112
10 gum 86
11 NaiPosTagger 14
12 wink-eng-lite-model 10

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com