The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 23 named-entity-recognition Open-Source Projects
-
HanLP
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
-
CoreNLP
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
-
Stanza
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
simpletransformers
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
-
NCRFpp
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
-
Universal Data Tool
Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.
-
entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
-
seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
-
open-semantic-search
Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
-
nlu
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
-
CogCompNLP
CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, transliteration, verb-sense, and more.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Step by step guide to create customized chatbot by using spaCy (Python NLP library) | dev.to | 2024-03-23Hi Community, In this article, I will demonstrate below steps to create your own chatbot by using spaCy (spaCy is an open-source software library for advanced natural language processing, written in the programming languages Python and Cython):
The Stanford CoreNLP Model
Project mention: Spark NLP 5.1.0: Introducing state-of-the-art OpenAI Whisper speech-to-text, OpenAI Embeddings and Completion transformers, MPNet text embeddings, ONNX support for E5 text embeddings, new multi-lingual BART Zero-Shot text classification, and much more! | /r/Python | 2023-09-06
Project mention: Would this method work to increase the memory of the model? Saving summaries generated by a 2nd model and injecting them depending on the current topic. | /r/LocalLLaMA | 2023-06-09
There is of course the list at https://github.com/juand-r/entity-recognition-datasets, but all of the recent English datasets cover other domains of English, such as the music NER, space NER, etc. All interesting things, but not 2020s English newswire.
Project mention: Need a better way to organize documents in massive nas | /r/DataHoarder | 2023-07-06documents, pdf's - https://opensemanticsearch.org
named-entity-recognition related posts
- Recent English newswire NER datasets?
- PIXIU: NEW Data - star count:172.0
- PIXIU: NEW Data - star count:124.0
- PIXIU: NEW Data - star count:124.0
- PIXIU: NEW Data - star count:124.0
- PIXIU: NEW Data - star count:124.0
- Would this method work to increase the memory of the model? Saving summaries generated by a 2nd model and injecting them depending on the current topic.
-
A note from our sponsor - WorkOS
workos.com | 19 Apr 2024
Index
What are some of the best open-source named-entity-recognition projects? This list will help you:
Project | Stars | |
---|---|---|
1 | HanLP | 32,214 |
2 | spaCy | 28,660 |
3 | NLP-progress | 22,290 |
4 | flair | 13,558 |
5 | nlp_compromise | 11,189 |
6 | CoreNLP | 9,451 |
7 | Stanza | 7,043 |
8 | simpletransformers | 3,972 |
9 | spark-nlp | 3,667 |
10 | DeepKE | 2,891 |
11 | NCRFpp | 1,877 |
12 | Universal Data Tool | 1,871 |
13 | entity-recognition-datasets | 1,431 |
14 | BERT-NER | 1,168 |
15 | seqeval | 1,044 |
16 | spacy-llm | 919 |
17 | open-semantic-search | 905 |
18 | nlu | 803 |
19 | name-dataset | 772 |
20 | ckip-transformers | 628 |
21 | BERTweet | 555 |
22 | datashare | 541 |
23 | CogCompNLP | 469 |