Preprocessing methods besides stop words, regular expressions, lemmatization and stemming for an NLP classification problem

This page summarizes the projects mentioned and recommended in the original post on /r/MLQuestions

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • TextAttack

    TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

  • Could have a look at what's available in the augmentor here https://github.com/QData/TextAttack. I'm not experienced with NLP so I may be way off here

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • TextAttack VS OpenAttack - a user suggested alternative

    2 projects | 6 Jul 2022
  • Show HN: Next-token prediction in JavaScript – build fast LLMs from scratch

    11 projects | news.ycombinator.com | 10 Apr 2024
  • DataDreamer

    1 project | news.ycombinator.com | 11 Feb 2024
  • A Curated List of Free ML/ DL YouTube Courses

    1 project | news.ycombinator.com | 28 Jan 2024
  • Sorry if this is a dumb question but is the main idea behind LLMs to output text based on user input?

    2 projects | /r/LocalLLaMA | 11 Dec 2023