sist2
Yacy
Our great sponsors
sist2 | Yacy | |
---|---|---|
18 | 115 | |
759 | 3,253 | |
- | 2.8% | |
8.2 | 8.7 | |
24 days ago | 22 days ago | |
C | Java | |
GNU General Public License v3.0 only | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
sist2
-
Better option then filebrowser to share files
Quickly Googling for a docker indexer and search app I turned up Sist2, that on the surface looks like might fit your needs. I don't have an appropriate data store to run it against, so I can't speak to its indexing speed or efficacy. However, the developer does have an accessible demo to try, and the front end at least appears to function well.
-
'google-like' search engine for files on my NAS
I'm also looking for tools like this. You can check out this: https://github.com/simon987/sist2
-
What would you love to see as self hosted service?
Maybe sist2 (https://github.com/simon987/sist2 may fit the bill. It indexes all the metadata and then act as a giant search engine.
- How can I OCR my car manual and make it easy to use in the garage?
- Looking For An App That Will Download Whole Webpages Offline (Specifically Reddit Threads)
-
Seeking a self-hostable search engine for *everything* that I own
I am long user of sist2 from simon987 for full text search of pdf. It indexes everything (file content and metadata) through elasticsearch while providing a nice GUI. https://github.com/simon987/sist2
-
Self hosted web page that indexes all data on a given folder with ability to search? [pi]
I have no experience with this tool, but I recall seeing it in the past. Perhaps it fills your need: https://github.com/simon987/sist2
-
Search engine for local files
sist2 is my primary file indexing / search engine for my SingleFile web archive. Lightweight, blazing fast and tons of customisable options.
-
Docker container with web app for indexing/searching large number of documents
I haven’t tried it in ages but used recoll for local indexing lots of random documents, I found a few repos on GitHub and Docker Hub but nothing super active but may be worth looking at viktor-c/docker-recoll-webui or sist2 is newer and I haven’t used it but may be better maintained at this point
-
Selfhosted File Management Solution? - tags, searching, etc
Having a tool that can scan and index a shared folder would be amazing, and it being accessible from a web browser would also be great, because then I could search from any one of my several devices. The closest thing I have found was sist2. The demo seems to be what I need, but I couldn't seem to get it to run with docker. There's a direct install method, but I haven't tried that yet.
Yacy
- New ways we're tackling spammy, low-quality content on Search
- YaCy, a distributed Web Search Engine, based on a peer-to-peer network
-
New 60% of OpenAI model's responses contain plagiarism
It turns out you can make it all the way to become president of Harvard [1] while ignoring this rule so it is questionable whether it is as set in stone as you make it out to be, at least in certain disciplines.
In a way these models are a perfect mirror of the current academic climate. They plagiarise without remorse, they follow the latest identity-politics diktat to a point and make up 'facts' when needed to reach a desired narrative. Google Gemini is the latest example [2] of where this leads.
Given that it is plausible that models like these will soon be used in educational settings this is a recipe for disaster. The same goes for the trend to replace search engine results with 'interpreted' results in which LLMs take up the same role as Winston in 1984: Winston works in the Ministry of Truth where he alters historical records to fit the needs of the Party.
It is time for a decentralised distributed search engine which limits itself to pure search, something like YaCy [3]. Something to replace Winstonian search engines like Google and Bing (et al.).
[1] https://www.campusreform.org/article/claudine-gay-is-a-dei-h...
[2] https://news.ycombinator.com/item?id=39465255
[3] https://yacy.net/
-
Is Google Getting Worse? A Longitudinal Investigation of SEO Spam in Search [pdf]
> Now I just need some kind of open source search engine to run on it ...
Here you go: https://yacy.net
-
Welcome to mwmbl, the free, open-source and non-profit search engine
I remember https://yacy.net/ but the big problem of this project was java and had not implementations in others languages. I mean it as imagine torrent was only in perl.
-
admarus alternatives - ipfs-search and Yacy
3 projects | 9 Aug 2023
Admarus is similar as Yacy but aims to be distributed where Yacy is federated. Both are made for the web
- Brave Search launches own image and video search
-
Show HN: DiskerNet – Browse the Internet from Your Disk, Now Open Source
You should check out https://yacy.net: a global, P2P web search engine, where each peer can build and share its own index, etc.
-
How do you organize your data?
I also have an instance of Yacy installed, which I use to index the entire system, giving me my own private, internal search engine.
- Ask HN: Best search engine alternatives to Google?
What are some alternatives?
Docspell - Assist in organizing your piles of documents, resulting from scanners, e-mails and other sources with miminal effort.
Searx - Privacy-respecting metasearch engine
docker-recoll-webui - Recoll with web frontend and pdf-ocr in a docker container
MeiliSearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
Typesense - Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
searxng - SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
Gigablast - Nov 20 2017 -- A distributed open source search engine and spider/crawler written in C/C++ for Linux on Intel/AMD. From gigablast dot com, which has binaries for download. See the README.md file at the very bottom of this page for instructions.
Ambar - :mag: Ambar: Document Search Engine
Seeks - Seeks is a decentralized p2p websearch and collaborative tool.