Python nlp-library

Open-source Python projects categorized as nlp-library

Top 23 Python nlp-library Projects

  • transformers

    🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

  • Project mention: How to count tokens in frontend for Popular LLM Models: GPT, Claude, and Llama | dev.to | 2024-05-21

    Thanks to transformers.js, we can run the tokenizer and model locally in the browser. Transformers.js is designed to be functionally equivalent to Hugging Face's transformers python library, meaning you can run the same pretrained models using a very similar API.

  • spaCy

    💫 Industrial-strength Natural Language Processing (NLP) in Python

  • Project mention: How I discovered Named Entity Recognition while trying to remove gibberish from a string. | dev.to | 2024-05-06
  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • OpenPrompt

    An Open-Source Framework for Prompt-Learning.

  • FARM

    :house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

  • tika-python

    Tika-Python is a Python binding to the Apache Tikaâ„¢ REST services allowing Tika to be called natively in the Python community.

  • contextualized-topic-models

    A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

  • pythainlp

    Thai Natural Language Processing in Python.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • skweak

    skweak: A software toolkit for weak supervision applied to NLP tasks

  • janome

    Japanese morphological analysis engine written in pure Python

  • OCTIS

    OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

  • DataDreamer

    DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

  • Project mention: FLaNK AI - 01 April 2024 | dev.to | 2024-04-01
  • camel_tools

    A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.

  • zshot

    Zero and Few shot named entity & relationships recognition

  • mutate

    A library to synthesize text datasets using Large Language Models (LLM)

  • turkish-deasciifier

    Turkish deasciifier in Python based on Deniz Yüret's turkish-mode for Emacs

  • toiro

    A comparison tool of Japanese tokenizers

  • NLP-Guide

    Natural Language Processing (NLP). Covering topics such as Tokenization, Part Of Speech tagging (POS), Machine translation, Named Entity Recognition (NER), Classification, and Sentiment analysis.

  • rakun2

    RaKUn 2.0 - A fast keyword detection algorithm

  • taxonomy4good

    Taxonomy4Good: a sustainability lexicon that provides the freedom to create custom taxonomies in addition to listed ESG and Sustainability Standards taxonomies.

  • Semi-Automated-Youtube-Channel

    Semi automated youtube channel that has a lot of cool features for someone to use in their content generating project

  • Project mention: YouTube content creation assistant | /r/Python | 2023-06-08
  • breame

    Lightweight utility tools for the detection of multiple spellings, meanings, and language-specific terminology in British and American English

  • MultiEL

    Multilingual Entity Linking model by BELA model

  • Project mention: [P] MultiEL: Multilingual Entity Linking model by BELA model | /r/MachineLearning | 2023-06-29
  • loquax

    NLP framework for phonology

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python nlp-library related posts

  • DataDreamer

    1 project | news.ycombinator.com | 11 Feb 2024
  • [P] MultiEL: Multilingual Entity Linking model by BELA model

    1 project | /r/MachineLearning | 29 Jun 2023
  • YouTube content creation assistant

    1 project | /r/Python | 8 Jun 2023
  • Seeking your insights on "Loquax": A tool for phonological analysis

    3 projects | /r/latin | 30 May 2023
  • I used GPT-4 to create code that automates absolutely everything in creating YouTube Shorts, from voiceover to editing, even down to choosing the illustration images.

    3 projects | /r/ChatGPT | 27 May 2023
  • [Arabic>latin transliteration] any apps for this?

    1 project | /r/translator | 30 Apr 2023
  • [P] Programmatic: Powerful Weak Labeling

    2 projects | /r/MachineLearning | 20 Apr 2022
  • A note from our sponsor - Scout Monitoring
    www.scoutapm.com | 3 Jun 2024
    Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today. Learn more →

Index

What are some of the best open-source nlp-library projects in Python? This list will help you:

Project Stars
1 transformers 126,915
2 spaCy 28,978
3 OpenPrompt 4,195
4 FARM 1,729
5 tika-python 1,432
6 contextualized-topic-models 1,176
7 pythainlp 933
8 skweak 913
9 janome 832
10 OCTIS 690
11 DataDreamer 691
12 camel_tools 385
13 zshot 325
14 mutate 149
15 turkish-deasciifier 143
16 toiro 114
17 NLP-Guide 66
18 rakun2 61
19 taxonomy4good 26
20 Semi-Automated-Youtube-Channel 16
21 breame 11
22 MultiEL 8
23 loquax 2

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com