bible-corpus
awesome-hungarian-nlp
bible-corpus | awesome-hungarian-nlp | |
---|---|---|
1 | 3 | |
163 | 208 | |
- | - | |
7.6 | 3.2 | |
2 months ago | 7 months ago | |
Creative Commons Zero v1.0 Universal | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
bible-corpus
-
Multilingual annotated Bibles dataset
I'm looking for an annotated parallel corpus of Bibles. I've found nice parallel corpora (like https://github.com/christos-c/bible-corpus) but none with labels (named entities if possible, but POS would do too)
awesome-hungarian-nlp
- Szoláris Magyar: Az elmúlt négy hónapban egy morfológia alapú alternatív írásrendszeren dolgoztam, ami a magyar nyelvre illeszkedik (további infó kommentekben)
-
Language Input: a new web app for finding content to watch in your target language and keep track of your vocabulary
Pity there's no Hungarian. I see spacy support it for some things but not the full pipeline. There's a cool NLP resource for Hungarian if you ever feel inclined to support it at some point ;)
-
Upcoming App Announcement: Lemmatize, a Foreign Language Reader
Very cool. Glad to see Hungarian on the list too :) there's a pretty great list of NLP related links for Hungarian here if you haven't seen it before. Could be useful.
What are some alternatives?
bangla-corpus - A curated list of Bangla NLP Corpus
awesome-sentiment-analysis - Repository with all what is necessary for sentiment analysis and related areas
301-wwyd-translation - Translation of G Uzaku's mahjong book 301 "Established Practice" Which to cut?
NLP-progress - Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
bibleapi-bibles-json - Bible translations in JSON format
spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python
proiel-treebank - Official releases of the PROIEL treebank of ancient Indo-European languages
awesome-computational-neuroscience - A list of schools and researchers in computational neuroscience
sematle - NLU service that converts plain English to known and structured data.
contract-discovery - Data and additional information regarding the paper: Contract Discovery. Dataset and a Few-Shot Semantic Retrieval Challenge with Competitive Baselines (to appear in Findings of EMNLP).
umibench - Testbench for sentiment and factuality in texts.
financial-news-dataset - Reuters and Bloomberg