SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 NLP Open-Source Projects
-
Project mention: None of the top 10 projects in GitHub is actually a software project 🤯 | dev.to | 2025-05-10
We see an addition to the AI community with AutoGPT. Along with Tensorflow they represent the AI community in the software category, which is getting relevant (2 out of 8). We can expect in the future to have new AI projects in the top 25 such as Transformers or Ollama (currently top 34 and 36, respectively).
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
⭐️ RAG Flow on GitHub
-
Ai learning
-
Resource: BERT Paper
-
Project mention: The Top 9️⃣ Repositories to learn Python programming + Resources (Extra) 🤯 | dev.to | 2024-11-06
⭐️ AI For Beginners on GitHub.
-
HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
-
Project mention: 15,000 lines of verified cryptography now in Python | news.ycombinator.com | 2025-04-18
Geez honestly
This seems to be the issue https://github.com/explosion/spaCy/issues/13658#issuecomment...
And you depend on opinionated libraries that break with newer versions. Why? Well because f you that's why! Because our library is not just a tool, it's a lifestyle
Though it seems that Pydantic 1x does support 3.13 https://docs.pydantic.dev/1.10/changelog/#v11020-2025-01-07
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Project mention: Code Explanation: "STORM: Synthesis of Topic Outlines through Retrieval and Multi-perspective Question Asking" | dev.to | 2025-03-08Note: this explanation only covers the knowledge_storm in the storm repo because it aligns with my interests.
-
500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code
500 AI Machine learning Deep learning Computer vision NLP Projects with code
500 AI machine learning NLP programming projects
-
Project mention: A Picture Is Worth 170 Tokens: How Does GPT-4o Encode Images? | news.ycombinator.com | 2024-06-07
Has anyone tried Kosmos [0] ? I came across it the other day and it looked shiny and interesting, but I haven't had a chance to put it to the test much yet.
[0] - https://github.com/microsoft/unilm/tree/master/kosmos-2.5
-
haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Project mention: Building a Prompt-Based Crypto Trading Platform with RAG and Reddit Sentiment Analysis using Haystack | dev.to | 2025-04-28Haystack forms the backbone of our RAG system. It provides pipelines for processing documents, embedding text, and retrieving relevant information.
-
rasa
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
Rasa GitHub Repository
-
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Project mention: 20 Open Source Tools I Recommend to Build, Share, and Run AI Projects | dev.to | 2024-11-13Datasets library repository for accessing and sharing datasets with the community.
-
Project mention: A ranked list of machine learning Python libraries. Updated weekly | news.ycombinator.com | 2025-01-31
-
-
-
Machine Learning Youtube courses
-
FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
Project mention: About FinGPT: Open-Source Financial Large Language Models | news.ycombinator.com | 2024-08-28 -
-
Awesome-pytorch-list
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
-
-
DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
-
Project mention: WhisperNER: Unified Open Named Entity and Speech Recognition | news.ycombinator.com | 2024-11-21
only the last string is a LOC named entity. Of course you can change definitions from the standard ones if you like, but then you should be careful not to compare with tools that use the original standard definition of NER such as flairNLP [1].
[1] https://github.com/flairNLP/flair?tab=readme-ov-file
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
NLP discussion
NLP related posts
-
VerifAI – open-source generative search with verification
-
How to Install Foundation-Sec 8B by Cisco: The Ultimate Cybersecurity AI Model
-
Routr: Fast local replacement for DuckDuckGo bangs
-
How to Install Qwen2.5-Omni 3B Locally
-
Making Sure AI Agents Play Nice: A Look at How We Evaluate Them
-
Are LLMs Random?
-
Building a Prompt-Based Crypto Trading Platform with RAG and Reddit Sentiment Analysis using Haystack
-
A note from our sponsor - SaaSHub
www.saashub.com | 16 May 2025
Index
What are some of the best open-source NLP projects? This list will help you:
# | Project | Stars |
---|---|---|
1 | transformers | 144,375 |
2 | ragflow | 52,039 |
3 | ailearning | 40,779 |
4 | bert | 39,124 |
5 | AI-For-Beginners | 37,414 |
6 | HanLP | 35,016 |
7 | spaCy | 31,537 |
8 | storm | 24,288 |
9 | 500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code | 23,257 |
10 | unilm | 21,230 |
11 | haystack | 20,709 |
12 | rasa | 20,134 |
13 | datasets | 20,090 |
14 | best-of-ml-python | 20,028 |
15 | Chinese-LLaMA-Alpaca | 18,816 |
16 | awesome-nlp | 17,147 |
17 | ML-YouTube-Courses | 16,489 |
18 | FinGPT | 16,110 |
19 | gensim | 16,017 |
20 | Awesome-pytorch-list | 15,807 |
21 | nlp-tutorial | 14,437 |
22 | DeepLearningExamples | 14,253 |
23 | flair | 14,160 |