nlp-library

Top 23 nlp-library Open-Source Projects

  • transformers

    🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

  • Project mention: Maxtext: A simple, performant and scalable Jax LLM | news.ycombinator.com | 2024-04-23

    Is t5x an encoder/decoder architecture?

    Some more general options.

    The Flax ecosystem

    https://github.com/google/flax?tab=readme-ov-file

    or dm-haiku

    https://github.com/google-deepmind/dm-haiku

    were some of the best developed communities in the Jax AI field

    Perhaps the “trax” repo? https://github.com/google/trax

    Some HF examples https://github.com/huggingface/transformers/tree/main/exampl...

    Sadly it seems much of the work is proprietary these days, but one example could be Grok-1, if you customize the details. https://github.com/xai-org/grok-1/blob/main/run.py

  • spaCy

    💫 Industrial-strength Natural Language Processing (NLP) in Python

  • Project mention: Step by step guide to create customized chatbot by using spaCy (Python NLP library) | dev.to | 2024-03-23

    Hi Community, In this article, I will demonstrate below steps to create your own chatbot by using spaCy (spaCy is an open-source software library for advanced natural language processing, written in the programming languages Python and Cython):

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Awesome-pytorch-list

    A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.

  • OpenPrompt

    An Open-Source Framework for Prompt-Learning.

  • FARM

    :house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

  • tika-python

    Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

  • contextualized-topic-models

    A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • pythainlp

    Thai Natural Language Processing in Python.

  • skweak

    skweak: A software toolkit for weak supervision applied to NLP tasks

  • janome

    Japanese morphological analysis engine written in pure Python

  • kagome

    Self-contained Japanese Morphological Analyzer written in pure Go

  • Sudachi

    A Japanese Tokenizer for Business

  • OCTIS

    OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

  • lingua

    The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike

  • DataDreamer

    DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

  • Project mention: FLaNK AI - 01 April 2024 | dev.to | 2024-04-01
  • Giveme5W1H

    Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?

  • medspacy

    Library for clinical NLP with spaCy.

  • camel_tools

    A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.

  • Project mention: [Arabic>latin transliteration] any apps for this? | /r/translator | 2023-04-30

    Otherwise it depends on your use case. There are NLP libraries like this one that can do the job.

  • zshot

    Zero and Few shot named entity & relationships recognition

  • Project mention: A transformer-based method for zero and few-shot biomedical NER | news.ycombinator.com | 2023-05-12
  • bllip-parser

    BLLIP reranking parser (also known as Charniak-Johnson parser, Charniak parser, Brown reranking parser) See http://pypi.python.org/pypi/bllipparser/ for Python module.

  • mutate

    A library to synthesize text datasets using Large Language Models (LLM)

  • lingo

    package lingo provides the data structures and algorithms required for natural language processing (by chewxy)

  • turkish-deasciifier

    Turkish deasciifier in Python based on Deniz Yüret's turkish-mode for Emacs

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

nlp-library related posts

Index

What are some of the best open-source nlp-library projects? This list will help you:

Project Stars
1 transformers 124,557
2 spaCy 28,704
3 Awesome-pytorch-list 14,932
4 OpenPrompt 4,146
5 FARM 1,723
6 tika-python 1,411
7 contextualized-topic-models 1,157
8 pythainlp 927
9 skweak 909
10 janome 828
11 kagome 789
12 Sudachi 740
13 OCTIS 681
14 lingua 657
15 DataDreamer 632
16 Giveme5W1H 500
17 medspacy 474
18 camel_tools 376
19 zshot 316
20 bllip-parser 225
21 mutate 149
22 lingo 146
23 turkish-deasciifier 143

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com