Top 23 Python spacy Projects

spaCy

107 28,789 9.2 Python

💫 Industrial-strength Natural Language Processing (NLP) in Python

Project mention: How I discovered Named Entity Recognition while trying to remove gibberish from a string. | dev.to | 2024-05-06

rasa

16 18,012 9.6 Python

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Project mention: 🔥🚀 Top 10 Open-Source Must-Have Tools for Crafting Your Own Chatbot 🤖💬 | dev.to | 2023-11-06

Support Rasa on GitHub ⭐

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
thinc

4 2,796 7.6 Python

🔮 A refreshing functional take on deep learning, compatible with your favorite libraries

Project mention: JAX – NumPy on the CPU, GPU, and TPU, with great automatic differentiation | news.ycombinator.com | 2023-09-28

Agree, though I wouldn’t call PyTorch a drop-in for NumPy either. CuPy is the drop-in. Excepting some corner cases, you can use the same code for both. Thinc’s ops work with both NumPy and CuPy:
https://github.com/explosion/thinc/blob/master/thinc/backend...

textacy

1 2,174 6.1 Python

NLP, before and after spaCy
pytextrank

2 2,102 5.9 Python

Python implementation of TextRank algorithms ("textgraphs") for phrase extraction
Klayers

4 1,969 8.0 Python

Python Packages as AWS Lambda Layers
scispacy

2 1,616 6.7 Python

A full spaCy pipeline and models for scientific/biomedical documents.
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
spacy-models

3 1,515 9.2 Python

💫 Models for the spaCy Natural Language Processing (NLP) library
Dragonfire

2 1,372 0.0 Python

the open-source virtual assistant for Ubuntu based Linux distributions
refinery

20 1,366 4.5 Python

The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
projects

6 1,249 4.5 Python

🪐 End-to-end NLP workflows from prototype to production (by explosion)
lambda-packs

1 1,106 4.0 Python

Precompiled packages for AWS Lambda
spacy-llm

4 948 8.8 Python

🦙 Integrating LLMs into structured NLP pipelines

Project mention: Integrating LLMs into structured NLP pipelines | news.ycombinator.com | 2023-09-10

skweak

8 910 6.2 Python

skweak: A software toolkit for weak supervision applied to NLP tasks
cltk

1 820 7.9 Python

The Classical Language Toolkit (by cltk)
subreddit-analyzer

5 486 0.0 Python

A comprehensive Data and Text Mining workflow for submissions and comments from any given public subreddit.
medaCy

1 421 0.0 Python

:hospital: Medical Text Mining and Information Extraction with spaCy
WordDumb

15 335 8.8 Python

A calibre plugin that generates Kindle Word Wise and X-Ray files for KFX, AZW3, MOBI and EPUB eBook.

Project mention: Create Kindle X-ray with calibre? | /r/Calibre | 2023-07-08

Manual here: https://xxyzz.github.io/WordDumb/

zshot

2 320 6.6 Python

Zero and Few shot named entity & relationships recognition

Project mention: A transformer-based method for zero and few-shot biomedical NER | news.ycombinator.com | 2023-05-12

summarizer

4 267 0.0 Python

A Reddit bot that summarizes news articles written in Spanish or English. It uses a custom built algorithm to rank words and sentences.
negspacy

1 267 1.0 Python

spaCy pipeline object for negating concepts in text
LemmInflect

1 248 2.5 Python

A python module for English lemmatization and inflection.
concise-concepts

1 242 2.5 Python

This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python spacy related posts

Integrating LLMs into structured NLP pipelines

1 project | news.ycombinator.com | 10 Sep 2023
Advanced NLP with SpaCy

1 project | news.ycombinator.com | 9 Sep 2023
Spacy-LLM: Integrating LLMs into structured NLP pipelines

1 project | news.ycombinator.com | 27 Jul 2023
SQLite-ner: SQLite tool to extract entities into a new table using spaCy

1 project | news.ycombinator.com | 6 Jul 2023
GitHub - redraw/sqlite-ner: sqlite tool to extract entities into a new table using spaCy

1 project | /r/sqlite | 6 Jul 2023
Identify custom labels as well as existing labels with Spacy v3

1 project | /r/LanguageTechnology | 12 Mar 2023
Lambda with Python libraries

1 project | /r/aws | 22 Jan 2023
A note from our sponsor - SaaSHub
www.saashub.com | 9 May 2024

SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source spacy projects in Python? This list will help you:

	Project	Stars
1	spaCy	28,789
2	rasa	18,012
3	thinc	2,796
4	textacy	2,174
5	pytextrank	2,102
6	Klayers	1,969
7	scispacy	1,616
8	spacy-models	1,515
9	Dragonfire	1,372
10	refinery	1,366
11	projects	1,249
12	lambda-packs	1,106
13	spacy-llm	948
14	skweak	910
15	cltk	820
16	subreddit-analyzer	486
17	medaCy	421
18	WordDumb	335
19	zshot	320
20	summarizer	267
21	negspacy	267
22	LemmInflect	248
23	concise-concepts	242

Python spacy

Top 23 Python spacy Projects

Python spacy related posts

Integrating LLMs into structured NLP pipelines

Advanced NLP with SpaCy

Spacy-LLM: Integrating LLMs into structured NLP pipelines

SQLite-ner: SQLite tool to extract entities into a new table using spaCy

GitHub - redraw/sqlite-ner: sqlite tool to extract entities into a new table using spaCy

Identify custom labels as well as existing labels with Spacy v3

Lambda with Python libraries

Index