japanese-words-to-vectors
languagepod101-scraper
japanese-words-to-vectors | languagepod101-scraper | |
---|---|---|
1 | 1 | |
83 | 143 | |
- | - | |
10.0 | 0.0 | |
over 2 years ago | 7 months ago | |
Python | Python | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
japanese-words-to-vectors
-
Abstract-Concreteness Value Lexical Data for Japanese
I'm looking for data for how concrete or abstract different lexical items are in Japanese, similar to this data for English. I'm not very well versed in computational linguistics, so even though I've found this word-to-vector model that can create vectors for Japanese words, but I'm not sure how to extrapolate abstractness values from the resulting vectors, or if that's even possible without using a predefined abstract-concrete vector like shown here.
languagepod101-scraper
-
Has anyone been through all five levels/pathways of JapanesePod101.com?
A python scraper to download everything: https://github.com/nedlir/languagepod101-scraper
What are some alternatives?
Korpora - Korean corpus repository
blinkist-scraper - π Python tool to download book summaries and audio from Blinkist.com, and generate some pretty output
open-discourse - Open Discourse is the first fully comprehensive corpus of the plenary proceedings of the federal German Parliament (Bundestag).
Learning-Python - This repo is made for the Learning Python blog course. In this course, all relevant material is provided for the course. For any suggestions, feedback or doubts, feel free to contact me via LinkedIn or Gmail.
tex-course-index-template - A template for writing a condensed course index leveraging LaTeX indexing
JobFunnel - Scrape job websites into a single spreadsheet with no duplicates.
ichiran - Linguistic tools for texts in Japanese language
unihandecode - unihandecode is a transliteration library to convert all characters/words in Unicode into ASCII alphabet that aware with Language preference priorities
statum - πΊοΈ statum, a Twitch streamer-related website. Written in Python + Flask, with MongoDB. Current features include Twitch OAuth integration, personalized dashboard, unique streamer insights & much more.
Amazon-Product-Information-Scraper - This Python web-scraping project retrieves product names, prices, review stars, and review counts for a specific product category.
adaltavoce - Ad alta voce - Podcast non ufficiale
tabroom-API - tournaments.tech's API for scraping tabroom.com