-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Thanks. Here's the notebook of the analysis. Script to scrape the comments is also there in the repo.
I used PushShift API to retrieve around 3 million comments from this subreddit; from April 2018 to May 2022 sampled every 4 days. The subreddit has around 12-15 million comments in total, but I don't have the computer or internet to carry out that level of analysis. Even after (randomly) narrowing down the dataset to 1 million comments, some of the metrics took the program around 10 hours to process, so this is a lengthy process for a layman with an average computer to carry out.
Error frequency. The number of grammatical errors per number of words in a comment. Usually "better" comments have less errors. I used the LanguageTool API.
Related posts
-
Ask HN: Grammarly Alternatives?
-
Recent ECE Masters grad looking to change careers from IT to RF engineering
-
Hey guys! I have my first draft here as a first-year computer engineering student. I'm preparing for an internship fair and I'd like to have something decent. Roast me!!
-
Top 3 Free Grammar Checkers for Flawless Writing
-
Существует какое-нибудь приложение похожее на Grammarly или Writeandimprove, но для русского языка?