reddit_export_userdata
sist2
reddit_export_userdata | sist2 | |
---|---|---|
4 | 18 | |
12 | 764 | |
- | - | |
10.0 | 8.5 | |
over 3 years ago | 10 days ago | |
Python | C | |
- | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
reddit_export_userdata
-
Looking For An App That Will Download Whole Webpages Offline (Specifically Reddit Threads)
You can use cron script to run a regular export of your Reddit saves: https://github.com/dbeley/reddit_export_userdata
- What are Your favorite tools to backup reddit data? (Text Posts, Media Content, Comments..)
-
What are the best programs to batch convert URLs or HTML files to PDFs?
Here's the script: https://github.com/dbeley/reddit_export_userdata
- Save your Reddit Data (saves, etc.)
sist2
-
Better option then filebrowser to share files
Quickly Googling for a docker indexer and search app I turned up Sist2, that on the surface looks like might fit your needs. I don't have an appropriate data store to run it against, so I can't speak to its indexing speed or efficacy. However, the developer does have an accessible demo to try, and the front end at least appears to function well.
-
'google-like' search engine for files on my NAS
I'm also looking for tools like this. You can check out this: https://github.com/simon987/sist2
-
What would you love to see as self hosted service?
Maybe sist2 (https://github.com/simon987/sist2 may fit the bill. It indexes all the metadata and then act as a giant search engine.
- How can I OCR my car manual and make it easy to use in the garage?
- Looking For An App That Will Download Whole Webpages Offline (Specifically Reddit Threads)
-
Seeking a self-hostable search engine for *everything* that I own
I am long user of sist2 from simon987 for full text search of pdf. It indexes everything (file content and metadata) through elasticsearch while providing a nice GUI. https://github.com/simon987/sist2
-
Self hosted web page that indexes all data on a given folder with ability to search? [pi]
I have no experience with this tool, but I recall seeing it in the past. Perhaps it fills your need: https://github.com/simon987/sist2
-
Search engine for local files
sist2 is my primary file indexing / search engine for my SingleFile web archive. Lightweight, blazing fast and tons of customisable options.
-
Docker container with web app for indexing/searching large number of documents
I haven’t tried it in ages but used recoll for local indexing lots of random documents, I found a few repos on GitHub and Docker Hub but nothing super active but may be worth looking at viktor-c/docker-recoll-webui or sist2 is newer and I haven’t used it but may be better maintained at this point
-
Selfhosted File Management Solution? - tags, searching, etc
Having a tool that can scan and index a shared folder would be amazing, and it being accessible from a web browser would also be great, because then I could search from any one of my several devices. The closest thing I have found was sist2. The demo seems to be what I need, but I couldn't seem to get it to run with docker. There's a direct install method, but I haven't tried that yet.
What are some alternatives?
eternity - bypass Reddit's 1000-item listing limits by externally storing your Reddit items (saved, created, upvoted, downvoted, hidden) in your own database
Docspell - Assist in organizing your piles of documents, resulting from scanners, e-mails and other sources with miminal effort.
redditSavedDownloader - Script to export your saved submissions and comments
docker-recoll-webui - Recoll with web frontend and pdf-ocr in a docker container
reddit-html-archiver - archive reddit data as offline friendly web pages
Typesense - Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
ripme - Downloads albums in bulk
MeiliSearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
single-file-cli - CLI tool for saving a faithful copy of a complete web page in a single HTML file (based on SingleFile)
Ambar - :mag: Ambar: Document Search Engine
bulk-downloader-for-reddit - Downloads and archives content from reddit
Gigablast - Nov 20 2017 -- A distributed open source search engine and spider/crawler written in C/C++ for Linux on Intel/AMD. From gigablast dot com, which has binaries for download. See the README.md file at the very bottom of this page for instructions.