recoll
tatoeba2
Our great sponsors
recoll | tatoeba2 | |
---|---|---|
1 | 46 | |
6 | 659 | |
- | 3.0% | |
0.0 | 0.0 | |
over 3 years ago | 11 days ago | |
Dockerfile | PHP | |
- | GNU Affero General Public License v3.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
recoll
-
Show HN: Full text search Project Gutenberg (60m paragraphs)
This is really cool. Something like this should exist.
It seems like you could do it more easily, and have faster search responses, with the following steps:
1. Mirror the current gutenberg archive (e.g. rsync -av --del aleph.gutenberg.org::gutenberg gutenberg
2. Install recoll-webui from https://www.lesbonscomptes.com/recoll/pages/recoll-webui-ins... or using docker-recoll-webui: https://github.com/sunde41/recoll
tatoeba2
-
Best vocab (not writing) app
I use both. I make a lot of my own cards so I get to focus on the vocab I want. Generally find a word I want to learn, use https://forvo.com/ to find native audio for it, then use https://tatoeba.org/ to find sentences use that word. Once you get a bit of practise it's pretty quick to make a word note, then make 2 or 3 sentence notes for it*. However I do use some pre-made decks like this set of sentence decks for each HSK level with native audio: https://ankiweb.net/shared/byauthor/933449107
- How do I get audio data from from native speakers for Anki?
-
anyone know a site like Reverso but for simpler sentences?
As someone else suggested, Tatoeba is also a good option. Nowadays, I use it less and less because I prefer the more didactic sentences found on online dictionaries. Nonetheless, it's still very good, especially due to the sheer quantity of sentences you can find there.
Have you taken a look at Tatoeba? The sentences are generally simpler than on Reverso. Plus, you can create lists that can be exported to apps like Anki.
-
Cantonese vocabulary visualization and example sentences
To build the tool, I analyzed sentences in the HKCanCor corpus and from Tatoeba to find the most common words. I then created a graph structure where each character is a node and the words are edges. The definitions came from CC-Canto and CEDICT.
-
Where to find examples of phrases or sentence structures?
Hidden in the little vertical dot menu is a green "show example sentences in Tatoeba". You can of course also just go to Tatoeba and search for phrases too: https://tatoeba.org/
Not sure this answers 100%, but there is a sentences database (also providing translations in various languages): https://tatoeba.org/
-
How did your Anki vocabulary memorization pan out when you finally went to a foreign country?
For now I've just been doing it manually - however https://tatoeba.org does have a handy set of pre-compile zip of all their sentences you can mess with. Checkout this link: https://tatoeba.org/en/downloads
-
[A2/B1] Clozemaster.com is quite good for Finnish, and will get you into especially good shape to read Finnish subtitles, IMO.
The database they base the sentences from is https://tatoeba.org/. They do quality some checks, but I have no idea how many and how often.
-
Hey Reddit. What is the coolest website you’ve visited that might not be known to everyone?
Tatoeba - Open collaborative multilingual sentence dictionary; instead of translations of individual words, it's a corpus of full sentences translated between many languages
What are some alternatives?
gutensearch - Search engine for Project Gutenberg books
rum - RUM access method - inverted index with additional information in posting lists
rum - Simple, decomplected, isomorphic HTML UI library for Clojure and ClojureScript
river-runner - Uses USGS/MERIT Basin data to visualize the path of a rain droplet to its endpoint.
FrequencyWords - Repository for Frequency Word List Generator and processed files