SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 nlp-machine-learning Open-Source Projects
-
tika-python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
contextualized-topic-models
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
-
lingua-go
The most accurate natural language detection library for Go, suitable for short text and mixed-language text
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
lingua-rs
The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
-
lingua
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
-
searchGPT
Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.
-
awesome-sentiment-analysis
Repository with all what is necessary for sentiment analysis and related areas
-
machine-learning-resources
A curated list of awesome machine learning frameworks, libraries, courses, books and many more.
-
Multi-Type-TD-TSR
Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
-
AI_ChatBot_Python
AI ChatBot using Python Tensorflow and Natural Language Processing (NLP) along side TFLearn
-
semantic-autocomplete
A blazing-fast semantic search React component. Match by meaning, not just by letters. Search as you type without waiting (no debounce needed). Rank by cosine similarity.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Jarvis: A Voice Virtual Assistant in Python (OpenAI, ElevenLabs, Deepgram) | news.ycombinator.com | 2023-12-18There is another one (Also Jarvis) that's been around for a while and is more useful, wonder if they can combine forces? https://github.com/ggeop/Python-ai-assistant
Not sure if anyone has noticed but OpenAI now has a mobile app (I've been using the PWA all this time) and the voice assistant on there is really strong. Sounds good, fast, and seems to even run a pass on my voice before it submits the query.
That is essentially correct. You take an object and "embed" it in a high-dimensional vector space to represent it.
For a deep dive, I highly recommend Vicki Boykis's free materials:
https://vickiboykis.com/what_are_embeddings/
Project mention: I created a program that finds out which anki cards out of 50_000 are in english and deletes them in 2 minutes | /r/rust | 2023-10-23Discovery of Lingua: While working on a different project, I discovered the Lingua library.
Project mention: Show HN: Toolkit for LLM Fine-Tuning, Ablating and Testing | news.ycombinator.com | 2024-04-07
Project mention: machine-learning-resources: NEW Courses - star count:359.0 | /r/algoprojects | 2023-05-27
nlp-machine-learning related posts
- [N] Huggingface/nvidia release open source GPT-2B trained on 1.1T tokens
- YTRecap (looking for collaborators/contributors)
- Show HN: Answering to: 'What is the contrary of courage?'
- MetalTranslate – Customizable machine translation in C++
- Converse with book – Built with GPT-3
- Converse with a Book [pdf]
- Entity Extraction with Predefined List
-
A note from our sponsor - SaaSHub
www.saashub.com | 29 Apr 2024
Index
What are some of the best open-source nlp-machine-learning projects? This list will help you:
Project | Stars | |
---|---|---|
1 | OpenPrompt | 4,152 |
2 | tika-python | 1,418 |
3 | contextualized-topic-models | 1,163 |
4 | lingua-go | 1,095 |
5 | skweak | 909 |
6 | Python-ai-assistant | 853 |
7 | what_are_embeddings | 846 |
8 | lingua-rs | 820 |
9 | LLM-Finetuning-Toolkit | 669 |
10 | babyai | 669 |
11 | lingua | 657 |
12 | dr-doc-search | 601 |
13 | searchGPT | 570 |
14 | awesome-sentiment-analysis | 526 |
15 | NLP-conference-compendium | 458 |
16 | machine-learning-resources | 381 |
17 | segment-anything-with-clip | 300 |
18 | LemmInflect | 246 |
19 | Multi-Type-TD-TSR | 236 |
20 | AI_ChatBot_Python | 223 |
21 | financial-news-dataset | 211 |
22 | Astock | 189 |
23 | semantic-autocomplete | 161 |
Sponsored