solr
sist2
solr | sist2 | |
---|---|---|
6 | 18 | |
1,015 | 764 | |
2.7% | - | |
9.8 | 8.5 | |
6 days ago | 9 days ago | |
Java | C | |
Apache License 2.0 | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
solr
- Iniciando no Elasticsearch: Conceitos básicos
-
Swirl: An open-source search engine with LLMs and ChatGPT to provide all the answers you need 🌌
Using the Galaxy UI, knowledge workers can systematically review the best results from all configured services including Apache Solr, ChatGPT, Elastic, OpenSearch, PostgreSQL, Google BigQuery, plus generic HTTP/GET/POST with configurations for premium services like Google's Programmable Search Engine, Miro and Northern Light Research.
-
Looking for software
Apache Solr can be used to index and search text-based documents. It supports a wide range of file formats including PDFs, Microsoft Office documents, and plain text files. https://solr.apache.org/
-
What do you use for site search? Custom built solution? Meilisearch? Algolia?
Solr https://solr.apache.org/
-
'google-like' search engine for files on my NAS
if so, then https://solr.apache.org/ can be a solution, though there's a bit of setup involved. oh yea, you get to write your own "search interface" too which would end up calling solr's api to find stuff.
- An alternative to Elasticsearch that runs on a few MBs of RAM
sist2
-
Better option then filebrowser to share files
Quickly Googling for a docker indexer and search app I turned up Sist2, that on the surface looks like might fit your needs. I don't have an appropriate data store to run it against, so I can't speak to its indexing speed or efficacy. However, the developer does have an accessible demo to try, and the front end at least appears to function well.
-
'google-like' search engine for files on my NAS
I'm also looking for tools like this. You can check out this: https://github.com/simon987/sist2
-
What would you love to see as self hosted service?
Maybe sist2 (https://github.com/simon987/sist2 may fit the bill. It indexes all the metadata and then act as a giant search engine.
- How can I OCR my car manual and make it easy to use in the garage?
- Looking For An App That Will Download Whole Webpages Offline (Specifically Reddit Threads)
-
Seeking a self-hostable search engine for *everything* that I own
I am long user of sist2 from simon987 for full text search of pdf. It indexes everything (file content and metadata) through elasticsearch while providing a nice GUI. https://github.com/simon987/sist2
-
Self hosted web page that indexes all data on a given folder with ability to search? [pi]
I have no experience with this tool, but I recall seeing it in the past. Perhaps it fills your need: https://github.com/simon987/sist2
-
Search engine for local files
sist2 is my primary file indexing / search engine for my SingleFile web archive. Lightweight, blazing fast and tons of customisable options.
-
Docker container with web app for indexing/searching large number of documents
I haven’t tried it in ages but used recoll for local indexing lots of random documents, I found a few repos on GitHub and Docker Hub but nothing super active but may be worth looking at viktor-c/docker-recoll-webui or sist2 is newer and I haven’t used it but may be better maintained at this point
-
Selfhosted File Management Solution? - tags, searching, etc
Having a tool that can scan and index a shared folder would be amazing, and it being accessible from a web browser would also be great, because then I could search from any one of my several devices. The closest thing I have found was sist2. The demo seems to be what I need, but I couldn't seem to get it to run with docker. There's a direct install method, but I haven't tried that yet.
What are some alternatives?
llm-integration - spring-starter, which enables semantic search, backed by OpenAI, by couple of lines
Docspell - Assist in organizing your piles of documents, resulting from scanners, e-mails and other sources with miminal effort.
open-semantic-search - Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
docker-recoll-webui - Recoll with web frontend and pdf-ocr in a docker container
fess - Fess is very powerful and easily deployable Enterprise Search Server.
Typesense - Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
swirl-search - Swirl is an open-source search platform that uses AI to search multiple content and data sources simultaneously and return AI-ranked results. And provides summaries of your answers from searches using LLMs. It's a one-click, easy-to-use Retrieval Augmented Generation (RAG) Solution.
MeiliSearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
orange - Cross-platform local file search engine.
Ambar - :mag: Ambar: Document Search Engine
LuceneBench - Lucene Benchmark : benchmarking Lucene vs. SeekStorm
Gigablast - Nov 20 2017 -- A distributed open source search engine and spider/crawler written in C/C++ for Linux on Intel/AMD. From gigablast dot com, which has binaries for download. See the README.md file at the very bottom of this page for instructions.