corpora
english-words
corpora | english-words | |
---|---|---|
7 | 84 | |
4,851 | 10,070 | |
- | 0.9% | |
5.5 | 0.0 | |
3 months ago | 28 days ago | |
JavaScript | Python | |
- | The Unlicense |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
corpora
- Corpora: A collection of small corpuses of interesting data
-
How can you database hundreds or thousands of items for a trading game like Pirates. Only needed info: name,$$$ and a general type inferred by the list. I'm thinking .csv
Check if this or some other word lists in the dataset is useful https://github.com/dariusk/corpora/blob/master/data/objects/objects.json
-
Obtaining a Word List
This might work. https://github.com/dariusk/corpora/blob/master/data/words/word_clues/clues_five.json
-
Procedural Text Generation?
corpora
- A lot of adjectives, but not necessarily every adjective in the English language
-
Part 1: How to Build a Serverless Twitter Bot
I picked one from Darius because he also keeps a GitHub repository of a lot of corpora that a ton of bot makers pull from. You can find at https://github.com/dariusk/corpora.
-
Finding lists of words and other resources for text generation
Check out Corpora. Lots of lists in various categories, and you can't get friendlier than the CC0 license!
english-words
- The longest word you can type on the first row of a QWERTY keyboard
-
Is there an English based word where the letter J is followed by a consonant?
From this word list, there are 88 "words" containing a J followed by a consonant. The only ones in any kind of common use (that aren't abbreviations or something like that) are from Arabic.
-
Is there a create that provides a dictionary of words?
What you're looking for is not a crate but data. You can search for a list of all words in English (or your language of choice), such as this, but for a game, you probably want only the most common ones.
-
Need help importing the entire English dictionary in an iterable format. No definitions, just words.
You can find all the English words here.
-
What is sleep paralysis? And Astral projection if you are just your physical body?
If I were to put a 30-digit integer number, and 5 random words from https://github.com/dwyl/english-words (the dictionary files, not the webpage), would you be able to tell me what they were? How much lead-up time would that take? (I'll find a location where nobody else could see the paper, and would make it so that after my publishing the location and time, nobody would have a reasonable chance of getting there.)
-
Getting an English dictionary
You can just do a join with any text file containing English words. For example a quick search shows this.
-
Most common English words containing every possible pair of letters [OC]
List of every English word: https://github.com/dwyl/english-words
- Obtaining a Word List
- Re-building Spelling Bee for fun: what dictionary should I use
-
Need help in making "crossword puzzle" for assignment in C++
Well, that seems like a really shitty task then, if you were not even given a limited list of valid words to search for. English has many thousands of different words and it might be difficult to find a complete list somewhere. Maybe https://github.com/dwyl/english-words
What are some alternatives?
atto - The new BASIC computer that runs in your browser!
google-10000-english - This repo contains a list of the 10,000 most common English words in order of frequency, as determined by n-gram frequency analysis of the Google's Trillion Word Corpus.
Eigengrau-s-Essential-Establishment-Generator - A town generator that is suitable for out of the box play in any fantasy TTRPG setting.
SecLists - SecLists is the security tester's companion. It's a collection of multiple types of lists used during security assessments, collected in one place. List types include usernames, passwords, URLs, sensitive data patterns, fuzzing payloads, web shells, and many more.
tf2-botcheck - App that interacts with TF2 to detect known named bots and name-stealing bots in Casual.
Adj-Noun-Wordlist-Generator - Outputs combinations of adjectives, nouns and digits.
Korpora - Korean corpus repository
Removeddit - View deleted stuff from reddit
empirist-corpus - A web and social media corpus based on the dataset of the EmpiriST 2015 shared task
toybox
pluralize - Pluralize or singularize any word based on a count
data-police-shootings - The Washington Post is compiling a database of every fatal shooting in the United States by a police officer in the line of duty since 2015.