empirist-corpus
A web and social media corpus based on the dataset of the EmpiriST 2015 shared task (by fau-klue)
quanteda
An R package for the Quantitative Analysis of Textual Data (by quanteda)
empirist-corpus | quanteda | |
---|---|---|
1 | 5 | |
2 | 824 | |
- | 0.4% | |
0.0 | 9.7 | |
about 2 years ago | 8 days ago | |
Perl | R | |
Creative Commons Attribution Share Alike 4.0 | GNU General Public License v3.0 only |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
empirist-corpus
Posts with mentions or reviews of empirist-corpus.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-10-05.
-
German POS Corpus for Commercial use
A small, manually annotated CMC corpus: https://github.com/fau-klue/empirist-corpus
quanteda
Posts with mentions or reviews of quanteda.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-07-05.
-
Error: could not find function "textstat_frequency"
> library(quanteda) Package version: 3.2.3 Unicode version: 14.0 ICU version: 70.1 Parallel computing: 8 of 8 threads used. See https://quanteda.io for tutorials and examples.
-
finding out comment similarity in R?
Suggest you look into https://quanteda.io/
- Ideas for thesis using R to analyse public media's opinion on topic
-
best text mining packages?
Quanteda is probably worth checking out.
- Natural language processing in R
What are some alternatives?
When comparing empirist-corpus and quanteda you can also consider the following projects:
flair - A very simple framework for state-of-the-art Natural Language Processing (NLP)
dplyr - dplyr: A grammar of data manipulation
corpora - A collection of small corpuses of interesting data for the creation of bots and similar stuff.
BTM - Biterm Topic Modelling for Short Text with R
wesanderson - A Wes Anderson color palette for R
tidytext - Text mining using tidy tools :sparkles::page_facing_up::sparkles:
ggplot2 - An implementation of the Grammar of Graphics in R
rmarkdown - Dynamic Documents for R