Korpora Alternatives
Similar projects and alternatives to Korpora
-
korean-word-ipa-dictionary
Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
-
japanese-words-to-vectors
Word2vec (word to vectors) approach for Japanese language using Gensim and Mecab.
-
corpora
A collection of small corpuses of interesting data for the creation of bots and similar stuff.
-
open-discourse
Open Discourse is the first fully comprehensive corpus of the plenary proceedings of the federal German Parliament (Bundestag).
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Korpora reviews and mentions
-
Resources About Cross-Linguistic Relative Phoneme Frequency
- LDA (usually very expensive, but some options exist and in some cases you can google them to find them elsewhere for free): https://www.ldc.upenn.edu/ - Connecting with a university or looking at a linguistics lab's corpus holdings (some will host -- or freely acquired the corpus and therefore you can find it on the internet) - Some language-specific lists or collections: e.g. https://warwick.ac.uk/fac/soc/al/repository/staff/harrisontilly/corpora-for-workshop/, https://github.com/ko-nlp/Korpora , https://guides.uflib.ufl.edu/frenchlinguistics/corpora - Some larger overviews, which may contain links: e.g. https://www.clarin.eu/resource-families/corpora-academic-texts , https://libguides.reed.edu/linguistics/datasets-corpora - Some larger projects to create (often text-based) corpora for multiple languages (often for NLP): e.g. https://www.sketchengine.eu/documentation/tenten-corpora/
Stats
ko-nlp/Korpora is an open source project licensed under Creative Commons Attribution 4.0 which is not an OSI approved license.
The primary programming language of Korpora is Python.
Popular Comparisons
Sponsored