Our great sponsors
-
https://github.com/loda-lang/loda-rust/blob/develop/script/t...
Example of the 100 most similar documents:
-
https://github.com/neoneye/loda-identify-similar-programs/bl...
There can be false positives, so after LSH then do a more in-depth comparison.
-
SonarQube
Static code analysis for 29 languages.. Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.
-
> Creating a "trustless" search crawler, where anybody can participate, and then applying an algorithm to determine trust or value feels like it'd be a never-ending arms race - that'd require AI and extensive/expensive resources
Not necessarily: https://yacy.net
-
uBlock-Origin-dev-filter
Filters to block and remove copycat-websites from DuckDuckGo, Google and other search engines. Specific to dev websites like StackOverflow or GitHub.
For developers, you can remove some spam websites from Google and other search engines, with these uBlock filters: https://github.com/quenhus/uBlock-Origin-dev-filter
-
> I think building search vertical that are hand-curated would be very interesting to see.
That was my inspiration behind a side project I made a few years ago — a decentralized, hand curated "search engine" [0]. Never got beyond the side project stage. But I see promise in this in the future. Eventually we'll figure out that human and moderated curation is better than the best machine learning.