New to NLP: Various questions

This page summarizes the projects mentioned and recommended in the original post on /r/LanguageTechnology

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • spaCy

    💫 Industrial-strength Natural Language Processing (NLP) in Python

  • Stemming removes suffixes and the result is not necessarily a word. Lemmatization gives the word from it's derived. Spacy doesn't even have stemming as the creator doesn't consider it useful.

  • huggingface_hub

    The official Python client for the Huggingface Hub.

  • Bag of words (BoW) is quite old, word2vec is oldish (2013), but they can still perform well enough, depending on what you want to do. FastText is also "old" (2016), even though it's a very good baseline and it's actually kinda based on a bag of words model and the skipgram model that was introduced with the word2vec paper. The hot new thing is HuggingFace and the BERT family of models, but they are orders of magnitude more resource intensive.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • [D] How do you keep up to date on Machine Learning?

    1 project | /r/learnmachinelearning | 13 Aug 2023
  • Open-Source Production Machine Learning Course

    1 project | news.ycombinator.com | 31 Jul 2023
  • Advice for switching careers within analytics

    1 project | /r/datascience | 30 Jun 2023
  • Ask HN: Resource to learn how to train and use ML Models

    3 projects | news.ycombinator.com | 2 Jun 2023
  • Need help to find resources to learn ml ops

    1 project | /r/developersIndia | 2 May 2023