Searchmysite.net Alternatives
Similar projects and alternatives to searchmysite.net
-
Scout Monitoring
Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
-
searx
Discontinued Privacy-respecting metasearch engine [Moved to: https://github.com/searx/searx] (by asciimoo)
-
marqo
Discontinued Tensor search for humans. [Moved to: https://github.com/marqo-ai/marqo] (by S2Search)
-
swirl-search
Swirl is an open-source search platform that uses AI to search multiple content and data sources simultaneously and return AI-ranked results. And provides summaries of your answers from searches using LLMs. It's a one-click, easy-to-use Retrieval Augmented Generation (RAG) Solution.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
searchmysite.net reviews and mentions
-
Almost all searches on my independent search engine are now from SEO spam bots
Thanks V. I'm seeing a similar number of problem search requests (although nowhere near as many real search requests:-), so it is probably the same "SEO practitioners" running the same "scraping footprints" against different search engines around the same time.
I was kind-of hoping that somewhere in this discussion there would be an "And the answer to your problem is...", but I suppose it is a very specific problem which only a search engine would encounter. I think the Cloudflare solution you have is probably the best to block the requests as early as possible. The reverse proxy config[0] I've got seems to be mostly holding out for now though.
[0] https://github.com/searchmysite/searchmysite.net/issues/55
-
http://searchmysite.net - a search engine for just the good bits of the internet
Someone on r/datahoarder has a bulk downloader for reddit which you might be able to use for initially populating a search index, and there is an API which you might be able to use to keep the index up-to-date, so it may be technically possible, assuming you don't hit API rate limits etc. It would be a big piece of work though, and you would be trying to compete in your spare time against a team of full-time paid reddit search engineers, while having the disadvantage of not being able to connect to the reddit database directly. I can't find how big their search team is at the moment, but according to https://www.reddit.com/r/blog/comments/mqcpcg/you\_want\_a\_better\_reddit\_search\_ok\_were\_on\_it/ they are looking to double it this year. So, while I don't want to sound defeatist, I'm not sure it is something I'll be looking to take on personally as a priority at the moment. But thanks for your suggestion. searchmysite.net is of course open source (https://github.com/searchmysite/searchmysite.net) with pluggable custom indexers, so you would be most welcome to give it a go yourself.
Stats
searchmysite/searchmysite.net is an open source project licensed under GNU Affero General Public License v3.0 which is an OSI approved license.
The primary programming language of searchmysite.net is Python.