fastText_multilingual vs hate-speech-and-offensive-language

fastText_multilingual

Multilingual word vectors in 78 languages (by babylonhealth)

hate-speech-and-offensive-language

Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017 (by t-davidson)

hatespeech offensive NLP icwsm Twitter abuse offensive-language hate-speech Natural Language Processing Dataset labeled-data Classifier Machine Learning computational-social-science

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

fastText_multilingual		hate-speech-and-offensive-language
	Project
1	Mentions	2
1,186	Stars	755
0.0%	Growth	-
0.0	Activity	1.9
about 1 year ago	Latest Commit	11 months ago
Jupyter Notebook	Language	Jupyter Notebook
BSD 3-clause "New" or "Revised" License	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

fastText_multilingual

Posts with mentions or reviews of fastText_multilingual. We have used some of these posts to build our list of alternatives and similar projects.

Ask HN: What's the coolest non standard application of LLMs you've seen?
1 project | news.ycombinator.com | 23 Dec 2023

(6 years ago)
Aligning the fastText vectors of 78 languages
https://github.com/babylonhealth/fastText_multilingual/blob/...

hate-speech-and-offensive-language

Posts with mentions or reviews of hate-speech-and-offensive-language. We have used some of these posts to build our list of alternatives and similar projects.

How to make a class column for a classifier from sentiment analysis results?
1 project | /r/learnpython | 24 Jan 2022

I've used NRCLex to perform sentiment analysis on some Twitter data. I have hate speech classifier code (https://github.com/t-davidson/hate-speech-and-offensive-language/blob/master/classifier/final_classifier.ipynb) I want to pass the dataset through, but before I can I need to have a "class" column for the model. For those not familiar, NRCLex returns scores for 10 emotions: anticipation, joy, anger, fear, surprise, disgust, positive, negative, sadness and trust. The table looks like this (letters denoting emotions):
Where do we go from here and who is going to step up to help us?
1 project | news.ycombinator.com | 28 Jan 2021

Some of this exists, and both Quora and Facebook (among others) use it extensively. Both hate speech and porn are good targets for machine learning. It needs supervision, but it can take a lot of load off human moderators.
Open source implementations exist, e.g.:
https://github.com/t-davidson/hate-speech-and-offensive-lang...
I suspect more message board will want to start applying these sooner rather than later. Most have already figured out that they need anti-spam tools, rather than it coming as a surprise when they roll things out and it fills up with bots. The technology is similar.
You mention being able to share that information across boards, and I don't know of any widespread implementation of that. You can, at least, let somebody else handle your authentication, which slightly slows their ability to create new accounts when you blacklist one. I'd like to see those sites distinguish "aged" accounts, so that it at least takes some effort or cost to use a new account.

What are some alternatives?

When comparing fastText_multilingual and hate-speech-and-offensive-language you can also consider the following projects:

toxicity - The world's largest social media toxicity dataset.

cia - 🐱‍💻 CIA Factbook data analysis and dataset reconstruction, modification, and tuning go here.

Tegridy-MIDI-Dataset - Tegridy MIDI Dataset for precise and effective Music AI models creation.

ThoughtSource - A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/

airline-sentiment-streaming - Streaming with Airline Sentiment. Utilizing Cloudera Machine Learning, Apache NiFi, Apache Hue, Apache Impala, Apache Kudu

100daysofpractice-dataset - Data from Instagram posts with the hashtag #100daysofpractice.

hashformers - Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).

PLOD-AbbreviationDetection - This repository contains the PLOD Dataset for Abbreviation Detection released with our LREC 2022 publication

bubo-2t - Bubo-2T is a Steampunk companion robot that can recognise hand gestures and tweet out messages

investigation-youtube-ad-placements - Data and code from our stories, "Google Has a Secret Blocklist that Hides YouTube Hate Videos from Advertisers—But It’s Full of Holes," and "Google Blocks Advertisers from Targeting Black Lives Matter YouTube Videos."

Compare fastText_multilingual vs hate-speech-and-offensive-language and see what are their differences.

fastText_multilingual

hate-speech-and-offensive-language

fastText_multilingual

hate-speech-and-offensive-language

What are some alternatives?