Our great sponsors
-
MarginaliaSearch
Internet search engine for text-oriented websites. Indexing the small, old and weird web.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
There have been a few attempts at a crowdsourced-rank search engine (which is similar to what you're suggesting - people indexing the content), but it seems to be a hard cookie, most of the examples of similar ideas I could find on ProductHunt or ShowHN seem dead:
https://payperrun.com/%3E/search?displayParams={%22q%22:%22c...
(btw, I just launched this llm-embedding based search service that lets you check if a startup idea has already been tried/failed).
I don't know if this idea has a higher death rate than the baseline, but my guess is Google/PageRank is good enough for most use-cases, and then if you want quality sources, you can just follow them on YouTube, Twitter, Instagram, etc. Wait, maybe I shouldn't try to compete with Google?
> A better resource today would have to start with some radical choice such as whitelisting, if only to reduce the head-end costs of ingesting material.
> It's tempting to imagine some rules like: no ads, no popups of any kind, government mandated or not, especially no cookie banners, no paywall, but even sites like Wikipedia fail at those criteria today.
This sounds like the approach that the Marginalia (https://search.marginalia.nu/) search engine is taking. My understanding is that its algorithm favors text-heavy sites. And additions to its index are done via GitHub Pull Request so it's effectively using an approve-list (whitelist).
Related posts
- Where Have All the Websites Gone?
- Show HN: Sort and Filter Ask HN Who's Hiring by LLM-Embedding Proximity
- Ask HN: What do you regret doing or not doing in your 30s?
- My thoughts on starting an online business as someone who's never done it before
- Show HN: An Embedding-Based Search Service over ShowHN, AskHN, GitHub, More