Paul Graham's Twitter thread on Search engines and SEO spam

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • loda-rust

    Web editor for the LODA language. Also includes my experiments with Abstraction and Reasoning Corpus ARC.

  • https://github.com/loda-lang/loda-rust/blob/develop/script/t...

    Example of the 100 most similar documents:

  • loda-identify-similar-programs

    Discontinued Measure how similar LODA programs are

  • https://github.com/neoneye/loda-identify-similar-programs/bl...

    There can be false positives, so after LSH then do a more in-depth comparison.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Yacy

    Distributed Peer-to-Peer Web Search Engine and Intranet Search Appliance

  • > Creating a "trustless" search crawler, where anybody can participate, and then applying an algorithm to determine trust or value feels like it'd be a never-ending arms race - that'd require AI and extensive/expensive resources

    Not necessarily: https://yacy.net

  • uBlock-Origin-dev-filter

    Filters to block and remove copycat-websites from DuckDuckGo, Google and other search engines. Specific to dev websites like StackOverflow or GitHub.

  • For developers, you can remove some spam websites from Google and other search engines, with these uBlock filters: https://github.com/quenhus/uBlock-Origin-dev-filter

  • digraph

    Organize the world

  • > I think building search vertical that are hand-curated would be very interesting to see.

    That was my inspiration behind a side project I made a few years ago — a decentralized, hand curated "search engine" [0]. Never got beyond the side project stage. But I see promise in this in the future. Eventually we'll figure out that human and moderated curation is better than the best machine learning.

    [0] https://github.com/emwalker/digraph

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Ask HN: What web apps use WASM today?

    2 projects | news.ycombinator.com | 22 Jan 2022
  • Loda-lang – language, computational model, and OEIS miner

    3 projects | news.ycombinator.com | 25 Sep 2021
  • I keep getting this error message - "loda-221209-macos quit unexpectedly"

    1 project | /r/BOINC | 15 Dec 2022
  • My attempt at ChatGPT... I ran a linux terminal, then fooled it into shutting down

    1 project | /r/ProgrammerHumor | 10 Dec 2022
  • The On-Line Encyclopedia of Integer Sequences

    1 project | news.ycombinator.com | 18 Apr 2021