gutensearch
recoll
Our great sponsors
gutensearch | recoll | |
---|---|---|
1 | 1 | |
6 | 6 | |
- | - | |
0.0 | 0.0 | |
about 3 years ago | over 3 years ago | |
Python | Dockerfile | |
- | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
gutensearch
-
Show HN: Full text search Project Gutenberg (60m paragraphs)
Thanks! I had the exact same problem and eventually it got me to do something about it. It is particularly bad with writers from antiquity or with a lot of popular appeal.
I've begun adding to this repository, it'll come in piece by piece as I clean up the code: https://github.com/cordb/gutensearch
recoll
-
Show HN: Full text search Project Gutenberg (60m paragraphs)
This is really cool. Something like this should exist.
It seems like you could do it more easily, and have faster search responses, with the following steps:
1. Mirror the current gutenberg archive (e.g. rsync -av --del aleph.gutenberg.org::gutenberg gutenberg
2. Install recoll-webui from https://www.lesbonscomptes.com/recoll/pages/recoll-webui-ins... or using docker-recoll-webui: https://github.com/sunde41/recoll
What are some alternatives?
tatoeba2 - Tatoeba is a platform whose purpose is to create a collaborative and open dataset of sentences and their translations.
rum - RUM access method - inverted index with additional information in posting lists
react-virtualized - React components for efficiently rendering large lists and tabular data
rum - Simple, decomplected, isomorphic HTML UI library for Clojure and ClojureScript