Python named-entity-recognition

Open-source Python projects categorized as named-entity-recognition

Top 23 Python named-entity-recognition Projects

named-entity-recognition
  1. HanLP

    Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. spaCy

    💫 Industrial-strength Natural Language Processing (NLP) in Python

    Project mention: 15,000 lines of verified cryptography now in Python | news.ycombinator.com | 2025-04-18

    Geez honestly

    This seems to be the issue https://github.com/explosion/spaCy/issues/13658#issuecomment...

    And you depend on opinionated libraries that break with newer versions. Why? Well because f you that's why! Because our library is not just a tool, it's a lifestyle

    Though it seems that Pydantic 1x does support 3.13 https://docs.pydantic.dev/1.10/changelog/#v11020-2025-01-07

  4. NLP-progress

    Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

  5. flair

    A very simple framework for state-of-the-art Natural Language Processing (NLP)

    Project mention: WhisperNER: Unified Open Named Entity and Speech Recognition | news.ycombinator.com | 2024-11-21

    only the last string is a LOC named entity. Of course you can change definitions from the standard ones if you like, but then you should be careful not to compare with tools that use the original standard definition of NER such as flairNLP [1].

    [1] https://github.com/flairNLP/flair?tab=readme-ov-file

  6. Stanza

    Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

  7. DeepPavlov

    An open source library for deep learning end-to-end dialog systems and chatbots.

    Project mention: Conversational AI and the Evolution of Search: Redefining How We Find Information | dev.to | 2025-01-29

    DeepPavlov: A conversational AI library for building multi-skill chatbots and virtual assistants.

  8. simpletransformers

    Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. DeepKE

    [EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

  11. GLiNER

    Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024

    Project mention: Navigating the Cybersecurity Maze: Challenges and Solutions in AI Agent Development | dev.to | 2025-02-26

    PII and Secret Detection: This involves identifying and removing personally identifiable information (PII) or secrets from the data. Tools like Presidio and GLiNER are great for this purpose. The following Python code demonstrates how to use Guardrails to detect PII and secrets in text:

  12. NCRFpp

    NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.

  13. entity-recognition-datasets

    A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.

  14. spacy-llm

    🦙 Integrating LLMs into structured NLP pipelines

  15. BERT-NER

    Pytorch-Named-Entity-Recognition-with-BERT

  16. seqeval

    A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)

  17. nlu

    1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.

  18. name-dataset

    The Python library for names.

    Project mention: Chain of Draft: Thinking Faster by Writing Less | dev.to | 2025-02-28

    NameDataset

  19. ckip-transformers

    CKIP Transformers

  20. BERTweet

    BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)

  21. camel_tools

    A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.

  22. GoLLIE

    Guideline following Large Language Model for Information Extraction

  23. zshot

    Zero and Few shot named entity & relationships recognition

  24. huspacy

    HuSpaCy: industrial-strength Hungarian natural language processing

  25. ChatGPT_for_IE

    Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python named-entity-recognition discussion

Log in or Post with

Python named-entity-recognition related posts

Index

What are some of the best open-source named-entity-recognition projects in Python? This list will help you:

# Project Stars
1 HanLP 35,016
2 spaCy 31,537
3 NLP-progress 22,861
4 flair 14,160
5 Stanza 7,458
6 DeepPavlov 6,876
7 simpletransformers 4,177
8 DeepKE 3,886
9 GLiNER 2,001
10 NCRFpp 1,895
11 entity-recognition-datasets 1,539
12 spacy-llm 1,242
13 BERT-NER 1,237
14 seqeval 1,132
15 nlu 912
16 name-dataset 905
17 ckip-transformers 724
18 BERTweet 590
19 camel_tools 449
20 GoLLIE 371
21 zshot 370
22 huspacy 167
23 ChatGPT_for_IE 144

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?