empirist-corpus
corpora
empirist-corpus | corpora | |
---|---|---|
1 | 7 | |
2 | 4,851 | |
- | - | |
0.0 | 5.5 | |
about 2 years ago | 3 months ago | |
Perl | JavaScript | |
Creative Commons Attribution Share Alike 4.0 | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
empirist-corpus
-
German POS Corpus for Commercial use
A small, manually annotated CMC corpus: https://github.com/fau-klue/empirist-corpus
corpora
- Corpora: A collection of small corpuses of interesting data
-
How can you database hundreds or thousands of items for a trading game like Pirates. Only needed info: name,$$$ and a general type inferred by the list. I'm thinking .csv
Check if this or some other word lists in the dataset is useful https://github.com/dariusk/corpora/blob/master/data/objects/objects.json
-
Obtaining a Word List
This might work. https://github.com/dariusk/corpora/blob/master/data/words/word_clues/clues_five.json
-
Procedural Text Generation?
corpora
- A lot of adjectives, but not necessarily every adjective in the English language
-
Part 1: How to Build a Serverless Twitter Bot
I picked one from Darius because he also keeps a GitHub repository of a lot of corpora that a ton of bot makers pull from. You can find at https://github.com/dariusk/corpora.
-
Finding lists of words and other resources for text generation
Check out Corpora. Lots of lists in various categories, and you can't get friendlier than the CC0 license!
What are some alternatives?
flair - A very simple framework for state-of-the-art Natural Language Processing (NLP)
atto - The new BASIC computer that runs in your browser!
quanteda - An R package for the Quantitative Analysis of Textual Data
Eigengrau-s-Essential-Establishment-Generator - A town generator that is suitable for out of the box play in any fantasy TTRPG setting.
tf2-botcheck - App that interacts with TF2 to detect known named bots and name-stealing bots in Casual.
Korpora - Korean corpus repository
pluralize - Pluralize or singularize any word based on a count
KahootBot - A generator for Kahoot bots
badwords - A javascript filter for badwords
speakers - Speaker count for 450+ languages
english-words - :memo: A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion
twit - Twitter API Client for node (REST & Streaming API)