The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 23 Python language-learning Projects
-
the-coding-interview
Programming exercises, code katas and puzzles for your job interview training - or just for fun.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
WordDumb
A calibre plugin that generates Kindle Word Wise and X-Ray files for KFX, AZW3, MOBI and EPUB eBook.
-
languagepod101-scraper
Python scraper for Language Pods such as Japanesepod101.com :japanese_ogre: :japan: :sushi: Compatible with Japanese, Chinese, French, German, Italian, Korean, Portuguese, Russian, Spanish and many more! ✨
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
google-books-ngram-frequency
Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code
-
add-stress-to-epub
A program that sets the stress and the letter ё of Russian text and ebooks using Wiktionary data and grammar analysis.
-
pact
Python GUI tool for language learning: create clips from mp3 files, add transcription via Vosk AI, and export to Anki (by jzohrab)
-
transcript_to_ebook
Transcript to ebook is a tool that will help you get transcript from your favorite Youtube video in various formats.
-
top-open-subtitles-sentences
Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code
-
vocab-to-anki
An interactive app to generate English Vocabulary Flashcards for Anki automatically from an online dictionary.
-
tatoeba-to-anki
Creates Anki Flash cards from Tatoeba sentences, ordering them by difficulty and downloading audio (by Vuizur)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Manual here: https://xxyzz.github.io/WordDumb/
I'm not quite sure if it counts as easy, but this tool here (https://github.com/Destaq/chinese-comprehension) works really well if you're comfortable with a bit of playing around with things on your computer (I found the instructions reasonably clear, but I've done a bit of coding before). This works really well for files, and I use it to estimate the number of unique characters/word in a book as well as the total character count.
I wrote Pact to export short audio clips to Anki, so that interesting clips get played again and again. A good though tough drill is to try to repeat a sentence verbatim, i.e. word-for-word, at the same speed as the speaker.
Project mention: A colloquial (عامیانه) frequency list! Our prayers have been answered. | /r/farsi | 2023-08-03
Project mention: Show HN: Create annotated Anki decks for language learning | news.ycombinator.com | 2023-06-13
Finnish, mostly. I moved to Finland a few years ago and have been making slow but consistent progress along the vectors I care about ever since. I just wish language learning didn't take so much time.
I've built some little tools for the task, which ended up teaching me enough about modern development that I could enter back into software as a career without too much hassle. There were no good Anki frequency list decks, so I made https://ankiweb.net/shared/info/1331009943 and later https://ankiweb.net/shared/info/1149950470 .
These in turn led me to devour a book on the inner workings of SQLite and web dev, because I needed some way to scrape Tatoeba without losing my data every time. Eventually I got good enough to start reading the 'clear Finnish's news, but then I realized YLE.fi didn't seem to have an easy way for me to scrape all previous news articles, so I built https://hiandrewquinn.github.io/selkouutiset-archive/ as an excuse to get a little deeper into Hugo and also learn some stuff about Git modules, systemd timers, doing things on a Raspberry Pi, doing things in GCP...
... And finally today I made the first lurching version prototype of a flashcard generator for that news archive, at https://github.com/Selkouutiset-Archive/selkokortti . I guess I just keep stringing the tools and interests I have together to make bigger and bigger things. Maybe that's all a career/vocation really is at the end of the day.
I've also been learning a lot about QEMU and virtualization. That's mostly for work. I make software that runs on trains.
Python language-learning related posts
- FOSS alternative for language learning?
- Seamless: Meta's New Speech Models
- Show HN: Own Duolingo Without Overengineering
- A colloquial (عامیانه) frequency list! Our prayers have been answered.
- My own Duolingo without overengineering
- Listening Comprehension Practice
- Aktivuloj klopodas savi la kursojn de Duolingo
-
A note from our sponsor - WorkOS
workos.com | 19 Apr 2024
Index
What are some of the best open-source language-learning projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | LibreLingo | 1,870 |
2 | the-coding-interview | 1,673 |
3 | WordDumb | 330 |
4 | vocabsieve | 312 |
5 | movies2anki | 290 |
6 | Game2Text | 168 |
7 | languagepod101-scraper | 145 |
8 | subs2cia | 85 |
9 | flywheel | 65 |
10 | google-books-ngram-frequency | 25 |
11 | yt-to-anki | 20 |
12 | chinese-comprehension | 19 |
13 | add-stress-to-epub | 17 |
14 | pact | 16 |
15 | transcript_to_ebook | 15 |
16 | grzegorz | 12 |
17 | top-open-subtitles-sentences | 12 |
18 | vocab-to-anki | 11 |
19 | PronunciationTrainer | 9 |
20 | Ankinator | 8 |
21 | tatoeba-to-anki | 7 |
22 | talking-decks | 5 |
23 | selkokortti | 4 |