Models for spam detection on short messages with both text and numerical inputs

This page summarizes the projects mentioned and recommended in the original post on /r/LanguageTechnology

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • StyloMetrix

    StyloMetrix

  • Or look into stylistic features and add them to xgboost classifier (https://github.com/ZILiAT-NASK/StyloMetrix - I used those combined with BERT last hidden state for fake news classification and got the best results so far - here repo if you wish to get some inspirations: https://github.com/MarBry111/Fake-News-Detection-for-Social-Media-Posts-in-Polish-Language , https://github.com/Hassaan-Elahi/Writing-Styles-Classification-Using-Stylometric-Analysis here some more inspirations for stylistic features) to enhance the results.

  • Fake-News-Detection-for-statements-in-Polish-Language

    Master thesis repository for thesis topic done at MINI faculty at Warsaw Universit of Technology.

  • Or look into stylistic features and add them to xgboost classifier (https://github.com/ZILiAT-NASK/StyloMetrix - I used those combined with BERT last hidden state for fake news classification and got the best results so far - here repo if you wish to get some inspirations: https://github.com/MarBry111/Fake-News-Detection-for-Social-Media-Posts-in-Polish-Language , https://github.com/Hassaan-Elahi/Writing-Styles-Classification-Using-Stylometric-Analysis here some more inspirations for stylistic features) to enhance the results.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Writing-Styles-Classification-Using-Stylometric-Analysis

    ✍️ An intelligent system that takes a document and classifies different writing styles within the document using stylometric techniques.

  • Or look into stylistic features and add them to xgboost classifier (https://github.com/ZILiAT-NASK/StyloMetrix - I used those combined with BERT last hidden state for fake news classification and got the best results so far - here repo if you wish to get some inspirations: https://github.com/MarBry111/Fake-News-Detection-for-Social-Media-Posts-in-Polish-Language , https://github.com/Hassaan-Elahi/Writing-Styles-Classification-Using-Stylometric-Analysis here some more inspirations for stylistic features) to enhance the results.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Show HN: Hstream – write Python apps quickly (like Streamlit) –> eject to Django

    1 project | news.ycombinator.com | 12 May 2024
  • Yi 1.5

    2 projects | news.ycombinator.com | 12 May 2024
  • Homoiconic Python

    8 projects | news.ycombinator.com | 12 May 2024
  • Gravity-Simulator: N-body gravity simulator with plotting, interactive modules

    1 project | news.ycombinator.com | 12 May 2024
  • Our classifier outperforms CatBoost, XGBoost, LightGBM on 5 benchmark datasets

    2 projects | news.ycombinator.com | 12 May 2024