Yacy
goggles-quickstart
Our great sponsors
Yacy | goggles-quickstart | |
---|---|---|
115 | 22 | |
3,253 | 565 | |
2.5% | 3.0% | |
8.7 | 2.8 | |
20 days ago | about 2 months ago | |
Java | ||
GNU General Public License v3.0 or later | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Yacy
- New ways we're tackling spammy, low-quality content on Search
- YaCy, a distributed Web Search Engine, based on a peer-to-peer network
-
New 60% of OpenAI model's responses contain plagiarism
It turns out you can make it all the way to become president of Harvard [1] while ignoring this rule so it is questionable whether it is as set in stone as you make it out to be, at least in certain disciplines.
In a way these models are a perfect mirror of the current academic climate. They plagiarise without remorse, they follow the latest identity-politics diktat to a point and make up 'facts' when needed to reach a desired narrative. Google Gemini is the latest example [2] of where this leads.
Given that it is plausible that models like these will soon be used in educational settings this is a recipe for disaster. The same goes for the trend to replace search engine results with 'interpreted' results in which LLMs take up the same role as Winston in 1984: Winston works in the Ministry of Truth where he alters historical records to fit the needs of the Party.
It is time for a decentralised distributed search engine which limits itself to pure search, something like YaCy [3]. Something to replace Winstonian search engines like Google and Bing (et al.).
[1] https://www.campusreform.org/article/claudine-gay-is-a-dei-h...
[2] https://news.ycombinator.com/item?id=39465255
[3] https://yacy.net/
-
Is Google Getting Worse? A Longitudinal Investigation of SEO Spam in Search [pdf]
> Now I just need some kind of open source search engine to run on it ...
Here you go: https://yacy.net
-
Welcome to mwmbl, the free, open-source and non-profit search engine
I remember https://yacy.net/ but the big problem of this project was java and had not implementations in others languages. I mean it as imagine torrent was only in perl.
-
admarus alternatives - ipfs-search and Yacy
3 projects | 9 Aug 2023
Admarus is similar as Yacy but aims to be distributed where Yacy is federated. Both are made for the web
- Brave Search launches own image and video search
-
Show HN: DiskerNet – Browse the Internet from Your Disk, Now Open Source
You should check out https://yacy.net: a global, P2P web search engine, where each peer can build and share its own index, etc.
-
How do you organize your data?
I also have an instance of Yacy installed, which I use to index the entire system, giving me my own private, internal search engine.
- Ask HN: Best search engine alternatives to Google?
goggles-quickstart
-
LLMs and Programming in the first days of 2024
I see a lot of recommendations for kagi, but no mention of brave search - specifically the (beta) feature called “goggles”. Afaiu it’s a blend of kagi’s “lenses” and the site ranking in search results.
https://search.brave.com/help/goggles There is a list (search) of public goggles: https://search.brave.com/goggles
The goggles itself are just text files with basic syntax and can be hosted on e.g. github gist. (though you have to publish it to brave)
https://github.com/brave/goggles-quickstart/blob/main/goggle...
Tbh, I can’t really compare brave search to kagi, since I never used kagi (though I’m using Orion - webkit based browser from the same dev and love it). Afaik, brave search is using its own index, thus making the results somehow limited and inferior to kagis. Just wanted to throw some (free) alternative here that works for me. :)
* Note that Brave search, despite privacy oriented, is still ad funded and there was few controversies about brave’s (browser) privacy in the past. (if that’s relevant for you)
-
How to block websites from search results in Brave Search?
You are technically in control of it even if it is in Brave's servers, so, you can create, modify it and delete it. https://github.com/brave/goggles-quickstart
-
Reputable UL-focused backpacking blogs and resources?
I'm putting together a list of UL resources to ultimately become indexed and searchable (through Goggles). This index will be public and something that will hopefully be community maintained, but I'll just be getting it started for now.
-
Brave search goggle question
If you have some time, would you like to try to create such a Goggle? There are some instructions and examples as to how to do that here: https://github.com/brave/goggles-quickstart
-
Looking for a website blocker that works with Brave
If you talk about search results, well, Brave Search has goggles, you only need a github account and you can create your own list, so you can only get what you want.
-
Brave Search lets you remove Pinterest results and more
"Goggles are simple, self-contained text files which can be hosted in Github or Gitlab. These files contain instructions allowing you to tell Brave Search how you'd like your results to be ranked. You can target specific URL patterns (and, soon, website titles and other aspects of Web pages) and indicate how their ranking should be altered (e.g. boosted, downranked, or completely discarded from the results)." - https://github.com/brave/goggles-quickstart
Great to see a company protecting and empowering its users. I just tried it out myself. Very easy to implement (e.g., "Amazon-Excluded Search"). Thanks for sharing!
-
I'm really starting to dislike Google
Try Brave goggles: https://github.com/brave/goggles-quickstart
-
YaCy – your own search engine
If you haven't heard of [Brave Goggles](https://github.com/brave/goggles-quickstart) I highly recommend checking it out. Just being able to create the search index is a massive task, so being able to apply rules server-side to their "expanded recall set" will give you what most people building search engines want, which is to control the algorithm. We weren't able to do that until now since applying rules client-side doesn't work well on a small search result set.
Related: I created a tool to create Goggles using subreddits as a signal source for domains: [Narwhalizer](https://github.com/forcesunseen/narwhalizer)
-
Google Search Is Quietly Damaging Democracy | A series of incremental changes over the years has transformed the tool from an explorative search function to one that is ripe for deception
To learn more about Goggles, visit the repository, where you can find the syntax specification, examples, FAQs, and more.
-
Discussion Thread
And most important of all, anyone can create, apply and share their own rules (https://github.com/brave/goggles-quickstart). This means you can effectively implement your own web result ranking. For example you can give priority to websites you consider more trustworthy and remove the ones that have low-quality clickbait.
What are some alternatives?
Searx - Privacy-respecting metasearch engine
grasp - A reliable org-capture browser extension for Chrome/Firefox
MeiliSearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
YaCyIndexerGreasemonkey - Greasemonkey script to index visited websites with the YaCy P2P search engine.
searxng - SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
LinkAce - LinkAce is a self-hosted archive to collect links of your favorite websites.
Gigablast - Nov 20 2017 -- A distributed open source search engine and spider/crawler written in C/C++ for Linux on Intel/AMD. From gigablast dot com, which has binaries for download. See the README.md file at the very bottom of this page for instructions.
OneTab-Night-Mode - Little theme for the onetab page to make it less eye raping.
Seeks - Seeks is a decentralized p2p websearch and collaborative tool.
promnesia - Another piece of your extended mind
Typesense - Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
duckduckgo-locales - Translation files for <a href="https://duckduckgo.com"> </a>