Python nlp-library

Open-source Python projects categorized as nlp-library

Top 23 Python nlp-library Projects

  • transformers

    🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

    Project mention: Gemma doesn't suck anymore – 8 bug fixes | news.ycombinator.com | 2024-03-11

    Thanks! :) I'm pushing them into transformers, pytorch-gemma and collabing with the Gemma team to resolve all the issues :)

    The RoPE fix should already be in transformers 4.38.2: https://github.com/huggingface/transformers/pull/29285

    My main PR for transformers which fixes most of the issues (some still left): https://github.com/huggingface/transformers/pull/29402

  • spaCy

    💫 Industrial-strength Natural Language Processing (NLP) in Python

    Project mention: Step by step guide to create customized chatbot by using spaCy (Python NLP library) | dev.to | 2024-03-23

    Hi Community, In this article, I will demonstrate below steps to create your own chatbot by using spaCy (spaCy is an open-source software library for advanced natural language processing, written in the programming languages Python and Cython):

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • OpenPrompt

    An Open-Source Framework for Prompt-Learning.

  • FARM

    :house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

  • tika-python

    Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

  • contextualized-topic-models

    A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.

    Project mention: [Project]Topic modelling of tweets from the same user | /r/MachineLearning | 2023-04-14

    In our experiments, CTM works well with tweets: https://github.com/MilaNLProc/contextualized-topic-models (I'm one of the authors)

  • pythainlp

    Thai Natural Language Processing in Python.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • skweak

    skweak: A software toolkit for weak supervision applied to NLP tasks

  • janome

    Japanese morphological analysis engine written in pure Python

    Project mention: [discussion] Open AI api translations | /r/Re_Zero | 2023-04-19
  • OCTIS

    OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

  • DataDreamer

    DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

    Project mention: FLaNK Stack 26 February 2024 | dev.to | 2024-02-26
  • camel_tools

    A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.

    Project mention: [Arabic>latin transliteration] any apps for this? | /r/translator | 2023-04-30

    Otherwise it depends on your use case. There are NLP libraries like this one that can do the job.

  • zshot

    Zero and Few shot named entity & relationships recognition

    Project mention: A transformer-based method for zero and few-shot biomedical NER | news.ycombinator.com | 2023-05-12
  • mutate

    A library to synthesize text datasets using Large Language Models (LLM)

  • turkish-deasciifier

    Turkish deasciifier in Python based on Deniz Yüret's turkish-mode for Emacs

  • toiro

    A comparison tool of Japanese tokenizers

  • NLP-Guide

    Natural Language Processing (NLP). Covering topics such as Tokenization, Part Of Speech tagging (POS), Machine translation, Named Entity Recognition (NER), Classification, and Sentiment analysis.

  • rakun2

    RaKUn 2.0 - A fast keyword detection algorithm

  • taxonomy4good

    Taxonomy4Good: a sustainability lexicon that provides the freedom to create custom taxonomies in addition to listed ESG and Sustainability Standards taxonomies.

  • Semi-Automated-Youtube-Channel

    Semi automated youtube channel that has a lot of cool features for someone to use in their content generating project

    Project mention: YouTube content creation assistant | /r/Python | 2023-06-08
  • breame

    Lightweight utility tools for the detection of multiple spellings, meanings, and language-specific terminology in British and American English

  • MultiEL

    Multilingual Entity Linking model by BELA model

    Project mention: [P] MultiEL: Multilingual Entity Linking model by BELA model | /r/MachineLearning | 2023-06-29
  • loquax

    NLP framework for phonology

    Project mention: Seeking your insights on "Loquax": A tool for phonological analysis | /r/latin | 2023-05-30

    Lovely - thanks so much for the feedback u/christmas_fan1 - it means a lot. I've created an issue with it linking back to your original comment: https://github.com/mattlianje/loquax/issues/11

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-03-23.

Python nlp-library related posts

Index

What are some of the best open-source nlp-library projects in Python? This list will help you:

Project Stars
1 transformers 122,577
2 spaCy 28,506
3 OpenPrompt 4,116
4 FARM 1,718
5 tika-python 1,395
6 contextualized-topic-models 1,151
7 pythainlp 926
8 skweak 907
9 janome 825
10 OCTIS 678
11 DataDreamer 598
12 camel_tools 374
13 zshot 309
14 mutate 148
15 turkish-deasciifier 140
16 toiro 111
17 NLP-Guide 62
18 rakun2 60
19 taxonomy4good 25
20 Semi-Automated-Youtube-Channel 15
21 breame 11
22 MultiEL 7
23 loquax 2
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com