Python spacy

Open-source Python projects categorized as spacy | Edit details

Top 15 Python spacy Projects

  • spaCy

    💫 Industrial-strength Natural Language Processing (NLP) in Python

    Project mention: Topic modelling with Gensim and SpaCy on startup news | dev.to | 2022-01-17

    SpaCy is one of the most popular NLP libraries, and is very fast and flexible.

  • rasa

    💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

    Project mention: How to Create the Perfect README for Your Open Source Project | dev.to | 2021-11-02

    This example is sourced from RasaHQ

  • SonarQube

    Static code analysis for 29 languages.. Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.

  • thinc

    🔮 A refreshing functional take on deep learning, compatible with your favorite libraries

    Project mention: good examples of functional-like python code that one can study? | reddit.com/r/functionalprogramming | 2021-06-29

    thinc - defining neural nets in functional way jax, a new deep learning framework puts emphasis on functions rather than tensors, I've tested it for a couple of applications and it's really cool, you can write stuff like you'd write math expressions in papers using numpy. That speeds up development significantly, and makes code much more readable

  • textacy

    NLP, before and after spaCy

    Project mention: Spacy for keyword extraction | reddit.com/r/LanguageTechnology | 2021-12-21

    Have a look at textacy: https://github.com/chartbeat-labs/textacy

  • pytextrank

    Python implementation of TextRank for phrase extraction and summarization of text documents

    Project mention: Question on easing comprehension | dev.to | 2021-09-15
  • Dragonfire

    the open-source virtual assistant for Ubuntu based Linux distributions

    Project mention: Why your own Assistant when there are sooo many? | reddit.com/r/SapphireFramework | 2021-08-31
  • spacy-models

    💫 Models for the spaCy Natural Language Processing (NLP) library

    Project mention: word similarity vs. sentence similarity | reddit.com/r/LanguageTechnology | 2021-08-25

    Well the medium model is using Glove (common crawl) for word vectors. There are only 685K keys so depending on the corpus you are working with, its possible lots of the words you are interested in don't have a corresponding vector and end up as zero vectors. Spacy Document/Span vectors are simply averages of the word vectors. So the higher performance of phrases may simply be because there is a higher chance of non Out of Vocabulary (OOV) words. So less chance of a zero vector.

  • OPS

    OPS - Build and Run Open Source Unikernels. Quickly and easily build and deploy open source unikernels in tens of seconds. Deploy in any language to any cloud.

  • Klayers

    Python Packages as AWS Lambda Layers

    Project mention: Can a lambda use a layer which is stored in S3 | reddit.com/r/aws | 2021-03-19

    I like to use this guy’s layers as an arn: https://github.com/keithrozario/Klayers

  • projects

    🪐 End-to-end NLP workflows from prototype to production (by explosion)

    Project mention: Using pre-trained BERT embeddings for multi-class text classification | reddit.com/r/LanguageTechnology | 2022-01-10

    spaCy has an example project that uses BERT that you could use as a reference. It's multilabel but it should be easy to tweak the config to be just multiclass instead.

  • skweak

    skweak: A software toolkit for weak supervision applied to NLP tasks

    Project mention: The hand-picked selection of the best Python libraries released in 2021 | reddit.com/r/Python | 2021-12-21

    skweak.

  • subreddit-analyzer

    A comprehensive Data and Text Mining workflow for submissions and comments from any given public subreddit.

    Project mention: [For Hire] Data Analysis, Bots, Web Scrapers & Automation Software | reddit.com/r/forhire | 2021-03-23

    Subreddit Analyzer using pandas, matplotlib, Seaborn, spaCy and wordcloud.

  • medaCy

    :hospital: Medical Text Mining and Information Extraction with spaCy

    Project mention: Help / Direction | reddit.com/r/MLQuestions | 2021-02-12

    If you want an easier/ more straight-forward approach, you can check out Medacy (https://github.com/NLPatVCU/medaCy)

  • summarizer

    A Reddit bot that summarizes news articles written in Spanish or English. It uses a custom built algorithm to rank words and sentences.

    Project mention: please send me your most verbose school/academic/college texts. | reddit.com/r/godot | 2022-01-03

    annnnnnd I can't find any comments it has made now. It's a bot on reddit that reads linked articles and generates a summary. It doesn't do what you're doing though, so it won't be useful anyways. Here is a bot I found, if you're feeling adventurous and want to read python source code that has nothing to do with what you're doing haha https://github.com/PhantomInsights/summarizer

  • thinc-apple-ops

    🍏 Make Thinc faster on macOS by calling into Apple's native Accelerate library

    Project mention: Spacy training on Apple M1 vs. AMD Ryzen 5900X: 55% faster, 16x more efficient | news.ycombinator.com | 2021-11-08
  • healthsea

    Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.

    Project mention: I built an NLP pipeline for analyzing supplement reviews called Healthsea 🐳 | reddit.com/r/Python | 2022-01-06

    Github: https://github.com/explosion/healthsea

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2022-01-17.

Python spacy related posts

Index

What are some of the best open-source spacy projects in Python? This list will help you:

Project Stars
1 spaCy 22,251
2 rasa 13,425
3 thinc 2,439
4 textacy 1,866
5 pytextrank 1,701
6 Dragonfire 1,208
7 spacy-models 998
8 Klayers 993
9 projects 731
10 skweak 620
11 subreddit-analyzer 474
12 medaCy 347
13 summarizer 235
14 thinc-apple-ops 57
15 healthsea 49
Find remote jobs at our new job board 99remotejobs.com. There are 30 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
Less time debugging, more time building
Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.
scoutapm.com