The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 23 Python named-entity-recognition Projects
-
HanLP
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
-
Stanza
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
-
simpletransformers
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
NCRFpp
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
-
entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
-
seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
-
nlu
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
-
PIXIU
This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).
-
camel_tools
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
-
healthsea
Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.
-
ARElight
Granular Viewer of Sentiments Between Entities in Massively Large Documents and Collections of Texts, powered by AREkit
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Step by step guide to create customized chatbot by using spaCy (Python NLP library) | dev.to | 2024-03-23Hi Community, In this article, I will demonstrate below steps to create your own chatbot by using spaCy (spaCy is an open-source software library for advanced natural language processing, written in the programming languages Python and Cython):
Project mention: Would this method work to increase the memory of the model? Saving summaries generated by a 2nd model and injecting them depending on the current topic. | /r/LocalLLaMA | 2023-06-09
There is of course the list at https://github.com/juand-r/entity-recognition-datasets, but all of the recent English datasets cover other domains of English, such as the music NER, space NER, etc. All interesting things, but not 2020s English newswire.
Otherwise it depends on your use case. There are NLP libraries like this one that can do the job.
Project mention: A transformer-based method for zero and few-shot biomedical NER | news.ycombinator.com | 2023-05-12
Project mention: A LLM trained to follow annotation guidelines, for information extraction tasks | news.ycombinator.com | 2023-10-30
Python named-entity-recognition related posts
- Recent English newswire NER datasets?
- PIXIU: NEW Data - star count:172.0
- PIXIU: NEW Data - star count:124.0
- PIXIU: NEW Data - star count:124.0
- PIXIU: NEW Data - star count:124.0
- PIXIU: NEW Data - star count:124.0
- Would this method work to increase the memory of the model? Saving summaries generated by a 2nd model and injecting them depending on the current topic.
-
A note from our sponsor - WorkOS
workos.com | 19 Apr 2024
Index
What are some of the best open-source named-entity-recognition projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | HanLP | 32,214 |
2 | spaCy | 28,660 |
3 | NLP-progress | 22,290 |
4 | flair | 13,538 |
5 | Stanza | 7,043 |
6 | simpletransformers | 3,972 |
7 | DeepKE | 2,891 |
8 | NCRFpp | 1,877 |
9 | entity-recognition-datasets | 1,431 |
10 | BERT-NER | 1,168 |
11 | seqeval | 1,044 |
12 | spacy-llm | 919 |
13 | nlu | 803 |
14 | name-dataset | 772 |
15 | ckip-transformers | 628 |
16 | BERTweet | 555 |
17 | PIXIU | 393 |
18 | camel_tools | 375 |
19 | zshot | 315 |
20 | GoLLIE | 204 |
21 | huspacy | 146 |
22 | healthsea | 84 |
23 | ARElight | 35 |