Top 23 Python named-entity-recognition Projects

HanLP

1 3 35,542 6.8 Python

Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
Sevalla

sevalla.com featured

Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
spaCy

2 113 32,385 8.7 Python

💫 Industrial-strength Natural Language Processing (NLP) in Python

Project mention: SpaCy: Industrial-Strength Natural Language Processing (NLP) in Python | news.ycombinator.com | 2025-08-23
NLP-progress

3 17 22,924 6.5 Python

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
flair

4 10 14,275 9.2 Python

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Project mention: WhisperNER: Unified Open Named Entity and Speech Recognition | news.ycombinator.com | 2024-11-21

only the last string is a LOC named entity. Of course you can change definitions from the standard ones if you like, but then you should be careful not to compare with tools that use the original standard definition of NER such as flairNLP [1].
[1] https://github.com/flairNLP/flair?tab=readme-ov-file
Stanza

5 8 7,575 7.7 Python

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
DeepPavlov

6 2 6,924 0.9 Python

An open source library for deep learning end-to-end dialog systems and chatbots.

Project mention: Conversational AI and the Evolution of Search: Redefining How We Find Information | dev.to | 2025-01-29

DeepPavlov: A conversational AI library for building multi-skill chatbots and virtual assistants.
presidio

7 9 5,463 9.0 Python

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

Project mention: How We Integrate AI in epilot - Chapter 2: Serverless RAG w/ LangChain & Weaviate | dev.to | 2025-05-26

Presidio
InfluxDB

www.influxdata.com featured

InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
simpletransformers

8 6 4,213 3.7 Python

Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
DeepKE

9 2 4,098 7.2 Python

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
GLiNER

10 8 2,290 8.7 Python

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024

Project mention: Navigating the Cybersecurity Maze: Challenges and Solutions in AI Agent Development | dev.to | 2025-02-26

PII and Secret Detection: This involves identifying and removing personally identifiable information (PII) or secrets from the data. Tools like Presidio and GLiNER are great for this purpose. The following Python code demonstrates how to use Guardrails to detect PII and secrets in text:
NCRFpp

11 1 1,895 0.0 Python

NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
entity-recognition-datasets

12 3 1,548 6.2 Python

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
spacy-llm

13 4 1,303 7.5 Python

🦙 Integrating LLMs into structured NLP pipelines
BERT-NER

14 1 1,241 0.0 Python

Pytorch-Named-Entity-Recognition-with-BERT
seqeval

15 1 1,149 2.6 Python

A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
name-dataset

16 3 939 7.7 Python

The Python library for names.

Project mention: Chain of Draft: Thinking Faster by Writing Less | dev.to | 2025-02-28

NameDataset
nlu

17 25 936 7.7 Python

1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
ckip-transformers

18 1 740 3.3 Python

CKIP Transformers
BERTweet

19 1 594 3.8 Python

BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)
camel_tools

20 2 482 5.9 Python

A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
GoLLIE

21 1 392 3.4 Python

Guideline following Large Language Model for Information Extraction
zshot

22 2 385 6.5 Python

Zero and Few shot named entity & relationships recognition
huspacy

23 3 171 6.7 Python

HuSpaCy: industrial-strength Hungarian natural language processing
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python named-entity-recognition discussion

Python named-entity-recognition related posts

Chain of Draft: Thinking Faster by Writing Less

1 project | dev.to | 28 Feb 2025
WhisperNER: Unified Open Named Entity and Speech Recognition

2 projects | news.ycombinator.com | 21 Nov 2024
Recent English newswire NER datasets?

2 projects | /r/LanguageTechnology | 27 Aug 2023
PIXIU: NEW Data - star count:172.0

1 project | /r/algoprojects | 15 Aug 2023
PIXIU: NEW Data - star count:124.0

1 project | /r/algoprojects | 8 Jul 2023
PIXIU: NEW Data - star count:124.0

1 project | /r/algoprojects | 7 Jul 2023
PIXIU: NEW Data - star count:124.0

1 project | /r/algoprojects | 6 Jul 2023
A note from our sponsor - InfluxDB
www.influxdata.com | 1 Sep 2025

InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source named-entity-recognition projects in Python? This list will help you:

#	Project	Stars
1	HanLP	35,542
2	spaCy	32,385
3	NLP-progress	22,924
4	flair	14,275
5	Stanza	7,575
6	DeepPavlov	6,924
7	presidio	5,463
8	simpletransformers	4,213
9	DeepKE	4,098
10	GLiNER	2,290
11	NCRFpp	1,895
12	entity-recognition-datasets	1,548
13	spacy-llm	1,303
14	BERT-NER	1,241
15	seqeval	1,149
16	name-dataset	939
17	nlu	936
18	ckip-transformers	740
19	BERTweet	594
20	camel_tools	482
21	GoLLIE	392
22	zshot	385
23	huspacy	171

Python named-entity-recognition

Top 23 Python named-entity-recognition Projects

Python named-entity-recognition discussion

Python named-entity-recognition related posts

Chain of Draft: Thinking Faster by Writing Less

WhisperNER: Unified Open Named Entity and Speech Recognition

Recent English newswire NER datasets?

PIXIU: NEW Data - star count:172.0

PIXIU: NEW Data - star count:124.0

PIXIU: NEW Data - star count:124.0

PIXIU: NEW Data - star count:124.0

Index

Did you know that Python is the 2nd most popular programming language based on number of references?

Did you know that Python is
the 2nd most popular programming language
based on number of references?