Jupyter Notebook Natural Language Processing

Open-source Jupyter Notebook projects categorized as Natural Language Processing

Top 23 Jupyter Notebook Natural Language Processing Projects

  • Made-With-ML

    Learn how to design, develop, deploy and iterate on production-grade ML applications.

    Project mention: [D] How do you keep up to date on Machine Learning? | /r/learnmachinelearning | 2023-08-13

    Made With ML

  • nlp-tutorial

    Natural Language Processing Tutorial for Deep Learning Researchers

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • nlpaug

    Data augmentation for NLP

  • pytorch-sentiment-analysis

    Tutorials on getting started with PyTorch and TorchText for sentiment analysis.

  • Data-science

    Collection of useful data science topics along with articles, videos, and code (by khuyentran1401)


    A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

    Project mention: AutoGen: Enabling Next-Gen GPT-X Applications | news.ycombinator.com | 2023-08-22

    I really like the simplicity of this framework, and they hit on a lot of common problems found in other agent-based frameworks. Most intrigued by the RAG improvements.

    Seems like Microsoft was frustrated with the pace of movement in this space and the shitty results of agents (which admittedly kept my interest turned away from agents for the last few months). I'm interested again because it makes practical sense, and from looking at the example notebooks, seems fairly easy to integrate into existing applications.

    Maybe this is the 'low code' approach that might actually work, and bridge together engineering and non-engineering resources.

    This example was what caught my eye: https://github.com/microsoft/FLAML/blob/main/notebook/autoge...

  • adapters

    A Unified Library for Parameter-Efficient and Modular Transfer Learning

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • ml-course

    Open Machine Learning course

    Project mention: ml-course: NEW Courses - star count:1339.0 | /r/algoprojects | 2023-11-06
  • pythoncode-tutorials

    The Python Code Tutorials

  • ecco

    Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).

  • bert_score

    BERT score for text generation

  • EasyEdit

    An Easy-to-use Knowledge Editing Framework for LLMs.

    Project mention: Looking for Paper about LLM Fine Tuning for specific topic / Alignment Paper | /r/LocalLLaMA | 2023-12-09
  • fastText_multilingual

    Multilingual word vectors in 78 languages

    Project mention: Ask HN: What's the coolest non standard application of LLMs you've seen? | news.ycombinator.com | 2023-12-23

    (6 years ago)

    Aligning the fastText vectors of 78 languages


  • transformers-interpret

    Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

  • question_generation

    Neural question generation using transformers

    Project mention: Yes/No style Question and Answer Generation | /r/learnpython | 2023-06-15

    I have seen models which do something similar but the questions they ask are not in a Yes/No style such as this T5 - based Question Generator. Essentially, I was wondering how I would go about developing such a model.

  • ThoughtSource

    A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/

  • hate-speech-and-offensive-language

    Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017

  • conformal-prediction

    Lightweight, useful implementation of conformal prediction on real data.

  • fromage

    🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

  • malaya

    Natural Language Toolkit for Malaysian language, https://malaya.readthedocs.io/

  • adaptnlp

    An easy to use Natural Language Processing library and framework for predicting, training, fine-tuning, and serving up state-of-the-art NLP models.

  • bert-sklearn

    a sklearn wrapper for Google's BERT model

  • retvec

    RETVec is an efficient, multilingual, and adversarially-robust text vectorizer.

    Project mention: New AI Spam Detection Deployed to Gmail and Open Sourced by Google | news.ycombinator.com | 2023-12-13
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-12-23.

Jupyter Notebook Natural Language Processing related posts


What are some of the best open-source Natural Language Processing projects in Jupyter Notebook? This list will help you:

Project Stars
1 Made-With-ML 35,181
2 nlp-tutorial 13,480
3 nlpaug 4,252
4 pytorch-sentiment-analysis 4,157
5 Data-science 3,912
6 FLAML 3,552
7 adapters 2,312
8 ml-course 1,973
9 pythoncode-tutorials 1,948
10 ecco 1,870
11 bert_score 1,361
12 EasyEdit 1,194
13 fastText_multilingual 1,185
14 transformers-interpret 1,181
15 question_generation 1,049
16 ThoughtSource 814
17 hate-speech-and-offensive-language 740
18 conformal-prediction 607
19 fromage 446
20 malaya 444
21 adaptnlp 414
22 bert-sklearn 289
23 retvec 251
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives