nlp-machine-learning

Open-source projects categorized as nlp-machine-learning

Top 23 nlp-machine-learning Open-Source Projects

  • OpenPrompt

    An Open-Source Framework for Prompt-Learning.

  • tika-python

    Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • contextualized-topic-models

    A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.

  • lingua-go

    The most accurate natural language detection library for Go, suitable for short text and mixed-language text

  • skweak

    skweak: A software toolkit for weak supervision applied to NLP tasks

  • Python-ai-assistant

    Python AI assistant 🧠

  • Project mention: Jarvis: A Voice Virtual Assistant in Python (OpenAI, ElevenLabs, Deepgram) | news.ycombinator.com | 2023-12-18

    There is another one (Also Jarvis) that's been around for a while and is more useful, wonder if they can combine forces? https://github.com/ggeop/Python-ai-assistant

    Not sure if anyone has noticed but OpenAI now has a mobile app (I've been using the PWA all this time) and the voice assistant on there is really strong. Sounds good, fast, and seems to even run a pass on my voice before it submits the query.

  • what_are_embeddings

    A deep dive into embeddings starting from fundamentals

  • Project mention: The Illustrated Word2Vec | news.ycombinator.com | 2024-04-19

    That is essentially correct. You take an object and "embed" it in a high-dimensional vector space to represent it.

    For a deep dive, I highly recommend Vicki Boykis's free materials:

    https://vickiboykis.com/what_are_embeddings/

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • lingua-rs

    The most accurate natural language detection library for Rust, suitable for short text and mixed-language text

  • Project mention: I created a program that finds out which anki cards out of 50_000 are in english and deletes them in 2 minutes | /r/rust | 2023-10-23

    Discovery of Lingua: While working on a different project, I discovered the Lingua library.

  • LLM-Finetuning-Toolkit

    Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

  • Project mention: Show HN: Toolkit for LLM Fine-Tuning, Ablating and Testing | news.ycombinator.com | 2024-04-07
  • babyai

    BabyAI platform. A testbed for training agents to understand and execute language commands.

  • lingua

    The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike

  • searchGPT

    Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.

  • awesome-sentiment-analysis

    Repository with all what is necessary for sentiment analysis and related areas

  • NLP-conference-compendium

    Compendium of the resources available from top NLP conferences.

  • machine-learning-resources

    A curated list of awesome machine learning frameworks, libraries, courses, books and many more.

  • Project mention: machine-learning-resources: NEW Courses - star count:359.0 | /r/algoprojects | 2023-05-27
  • segment-anything-with-clip

    Segment Anything combined with CLIP

  • LemmInflect

    A python module for English lemmatization and inflection.

  • Multi-Type-TD-TSR

    Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:

  • AI_ChatBot_Python

    AI ChatBot using Python Tensorflow and Natural Language Processing (NLP) along side TFLearn

  • financial-news-dataset

    Reuters and Bloomberg

  • Astock

    Astock

  • semantic-autocomplete

    A blazing-fast semantic search React component. Match by meaning, not just by letters. Search as you type without waiting (no debounce needed). Rank by cosine similarity.

  • Project mention: Show HN: Semantic Search React Component | news.ycombinator.com | 2024-04-14
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

nlp-machine-learning related posts

Index

What are some of the best open-source nlp-machine-learning projects? This list will help you:

Project Stars
1 OpenPrompt 4,152
2 tika-python 1,418
3 contextualized-topic-models 1,163
4 lingua-go 1,095
5 skweak 909
6 Python-ai-assistant 853
7 what_are_embeddings 846
8 lingua-rs 820
9 LLM-Finetuning-Toolkit 669
10 babyai 669
11 lingua 657
12 dr-doc-search 601
13 searchGPT 570
14 awesome-sentiment-analysis 526
15 NLP-conference-compendium 458
16 machine-learning-resources 381
17 segment-anything-with-clip 300
18 LemmInflect 246
19 Multi-Type-TD-TSR 236
20 AI_ChatBot_Python 223
21 financial-news-dataset 211
22 Astock 189
23 semantic-autocomplete 161

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com