Python spacy

Open-source Python projects categorized as spacy

Top 23 Python spacy Projects

  1. spaCy

    ๐Ÿ’ซ Industrial-strength Natural Language Processing (NLP) in Python

    Project mention: 15,000 lines of verified cryptography now in Python | news.ycombinator.com | 2025-04-18

    Geez honestly

    This seems to be the issue https://github.com/explosion/spaCy/issues/13658#issuecomment...

    And you depend on opinionated libraries that break with newer versions. Why? Well because f you that's why! Because our library is not just a tool, it's a lifestyle

    Though it seems that Pydantic 1x does support 3.13 https://docs.pydantic.dev/1.10/changelog/#v11020-2025-01-07

  2. Judoscale

    Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.

    Judoscale logo
  3. rasa

    ๐Ÿ’ฌ Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

    Project mention: What is Rasa? A Beginnerโ€™s Guide to Conversational AI | dev.to | 2024-12-31

    Rasa GitHub Repository

  4. thinc

    ๐Ÿ”ฎ A refreshing functional take on deep learning, compatible with your favorite libraries

  5. Klayers

    Python Packages as AWS Lambda Layers

    Project mention: Creating an Image Thumbnail Generator Using AWS Lambda and S3 Event Notifications with Terraform | dev.to | 2024-06-30

    Klayers: https://github.com/keithrozario/Klayers/tree/master

  6. textacy

    NLP, before and after spaCy

  7. pytextrank

    Python implementation of TextRank algorithms ("textgraphs") for phrase extraction

  8. scispacy

    A full spaCy pipeline and models for scientific/biomedical documents.

  9. InfluxDB

    InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.

    InfluxDB logo
  10. spacy-models

    ๐Ÿ’ซ Models for the spaCy Natural Language Processing (NLP) library

  11. refinery

    The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.

    Project mention: Ultimate guide to prompt engineering | dev.to | 2024-12-07

    Tools: Platforms like LangChain, Kern AI Refinery, and Langtail simplify testing, debugging, and optimizing prompts.

  12. Dragonfire

    the open-source virtual assistant for Ubuntu based Linux distributions

  13. projects

    ๐Ÿช End-to-end NLP workflows from prototype to production (by explosion)

  14. spacy-llm

    ๐Ÿฆ™ Integrating LLMs into structured NLP pipelines

  15. lambda-packs

    Precompiled packages for AWS Lambda

  16. skweak

    skweak: A software toolkit for weak supervision applied to NLP tasks

  17. cltk

    The Classical Language Toolkit (by cltk)

  18. spacy-layout

    ๐Ÿ“š Process PDFs, Word documents and more with spaCy

    Project mention: AI and All Data Weekly for 09 Dec 2024 | dev.to | 2024-12-09

    โ„๏ธ Apache Polaris + Iceberg Quickstart โšก๏ธ How to extract tables from pdfs ๐Ÿš€ Microsoft 1bit LLM BitNet ๐Ÿฟ๏ธ Verifying Kafka Transactions Entry 2 ๐Ÿฟ๏ธ FLUSS: Streaming Storage ๐Ÿฟ๏ธ Fluss -> Flow for Flink Real Time Analytics ๐ŸŒ TableFlow - iceberg / kafka โ„๏ธ Snowflake Cortex AI + Slack ๐Ÿฟ๏ธโ„๏ธ Door dash flink, kafka, snowflake ๐Ÿง  Prompt Stack -- all in one ๐Ÿ”Œ SpaCY Layout for PDF ๐Ÿ“ฑ Responsible AI Pathways ๐Ÿ“ผ Megaparse documents python ๐Ÿ”Œ Time Series LLM โ„๏ธ Generate Synthetic Data in Snowflake ๐Ÿฟ๏ธ LLMs and GenAI - When to use them ๐Ÿฟ๏ธ Flink Observability with Prometheus ๐Ÿ“ก New SQL GUI ๐Ÿซ TDD for GenAI ๐Ÿ•ต๏ธ ๐ŸŽ Open Source Agent Framework for Production ๐Ÿ’ป Cedit command line editor ๐Ÿญ ServiceNow AgentLab ๐ŸŽค Snowflake Lessons Learned in Replication ๐ŸŽ„ Privastead ๐Ÿ”Œ Backup Icloud with nodejs on linux ๐Ÿ”Œ Backup Google with nodejs on linux ๐ŸŽ„ HuggingFace macos chat source code ๐ŸŽ Ollama working with structured output ๐ŸŽ dspy ai how to ๐Ÿ”Œ Piazza updater ๐Ÿ”Œ Building a financial report with langgraph ColPali Notebook with QWEN 2 VL

  19. subreddit-analyzer

    A comprehensive Data and Text Mining workflow for submissions and comments from any given public subreddit.

  20. medaCy

    :hospital: Medical Text Mining and Information Extraction with spaCy

  21. WordDumb

    A calibre plugin that generates Kindle Word Wise and X-Ray files for KFX, AZW3, MOBI and EPUB eBook.

  22. zshot

    Zero and Few shot named entity & relationships recognition

  23. negspacy

    spaCy pipeline object for negating concepts in text

  24. summarizer

    A Reddit bot that summarizes news articles written in Spanish or English. It uses a custom built algorithm to rank words and sentences.

  25. LemmInflect

    A python module for English lemmatization and inflection.

  26. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python spacy discussion

Log in or Post with

Python spacy related posts

  • Creating an Image Thumbnail Generator Using AWS Lambda and S3 Event Notifications with Terraform

    2 projects | dev.to | 30 Jun 2024
  • Integrating LLMs into structured NLP pipelines

    1 project | news.ycombinator.com | 10 Sep 2023
  • Advanced NLP with SpaCy

    1 project | news.ycombinator.com | 9 Sep 2023
  • Spacy-LLM: Integrating LLMs into structured NLP pipelines

    1 project | news.ycombinator.com | 27 Jul 2023
  • SQLite-ner: SQLite tool to extract entities into a new table using spaCy

    1 project | news.ycombinator.com | 6 Jul 2023
  • GitHub - redraw/sqlite-ner: sqlite tool to extract entities into a new table using spaCy

    1 project | /r/sqlite | 6 Jul 2023
  • Identify custom labels as well as existing labels with Spacy v3

    1 project | /r/LanguageTechnology | 12 Mar 2023
  • A note from our sponsor - InfluxDB
    influxdata.com | 30 Apr 2025
    Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems. Learn more โ†’

Index

What are some of the best open-source spacy projects in Python? This list will help you:

# Project Stars
1 spaCy 31,470
2 rasa 20,067
3 thinc 2,844
4 Klayers 2,312
5 textacy 2,216
6 pytextrank 2,175
7 scispacy 1,796
8 spacy-models 1,735
9 refinery 1,435
10 Dragonfire 1,383
11 projects 1,377
12 spacy-llm 1,235
13 lambda-packs 1,117
14 skweak 922
15 cltk 848
16 spacy-layout 567
17 subreddit-analyzer 489
18 medaCy 436
19 WordDumb 430
20 zshot 366
21 negspacy 279
22 summarizer 273
23 LemmInflect 265

Sponsored
Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?