SaaSHub helps you find the best software and product alternatives Learn more →
Web-discovery-project Alternatives
Similar projects and alternatives to web-discovery-project
-
SurveyJS
Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.
-
pyllms
Minimal Python library to connect to LLMs (OpenAI, Anthropic, AI21, Cohere, Aleph Alpha, HuggingfaceHub, Google PaLM2, with a built-in model performance benchmark.
web-discovery-project reviews and mentions
- Brave Web Discovery Project Overview
-
Brave Search launches own image and video search
https://github.com/brave/web-discovery-project/blob/main/mod...
I'm curious to see what you think about this. If you're not okay with Firefox telling Mozilla your IP address every time you connect, does the same go for Brave sending entire pages of your search results to them?
-
The shady world of Brave selling copyrighted data for AI training
> Simply observe the event in which a user does a query q in Brave and then, within one hour, does the same query on a different search engine.
> What we do is to move the script that detects bad-queries to the browser, run it against the queries that the user does in real-time and then, when all conditions are met, send the following data back to our servers,
Wait. Brave browser sends back to Brave Search engine about your browsing? I guess I would be uninstalling.
Ref: https://github.com/brave/web-discovery-project/blob/main/mod...
-
I watched this, then checked it out in Brave and found it's even worse there...
this doesn't make much sense to me, you can look at the web discovery project on their github: https://github.com/brave/web-discovery-project/tree/main/modules/web-discovery-project
-
Brave launches private search ads
Brave search is private in no-user-identification and no-reidentification-via-record-unlinkability senses always, both legs: we do not build user profiles. As with all engines, we learn from head of query log and what link is clicked. That is an essential first party purpose, but we collect no personal data because nothing is linkable across queries and clicks to any person.
If you opt into the Web Discovery Project (it's off by default and a separate setting), then your queries and clicks across all navigation are anonymized by dropping any with enough entropy to suspect they bear personal data, dropping IP and headers, and otherwise ensuring record-unlinkability. See
https://support.brave.com/hc/en-us/articles/4409406835469-Wh...
and
https://github.com/brave/web-discovery-project/blob/main/mod....
Because Premium Brave Search has no ads, it's not useful _per se_ for ad measurement, conversion, or modeling. But I took your question do mean "do premium search queries and outbound link clicks feed into the search engine?" -- they do. But no ads, and no way to model directly how an ad would perform, beyond the big data benefit that all search engine use to help ad sales and matching.
Last thing: with off-by-default (opt-in) Brave Rewards user ads (push => new tab on click), the matching agent is in the browser, inactive until opt-in, off upon opt-out, and you can clear its history. Confirmations and revenue shares via Chaumian blind signature protocol (Privacy Pass uses same crypto all). No server-side ad matching at all. Same for Brave News (for ads and all feeds, everything).
With search ads, matching is server-side (after the edge proxy that drops IP etc.) based on only the query, device, country, and timezone. Anything more personalized, we can go to client-side matching and ad insertion. I hope this helps.
-
Brave Search Privacy Question
In order to build the Brave Search index, all current Brave versions (at least on desktop) are shipped with Web Discovery. If you opt into Web Discovery, certain URLs and browsing behavior is sent to Brave to refine the index. Web Discovery provides a much better way of refining their algorithm than any crawler, since it is based off of real-world web traffic. Brave asserts that all data sent to itself is unable to be connected back to a single user. If you have the time, the whitepaper is available on GH and gets into the nitty-gritty: https://github.com/brave/web-discovery-project/blob/main/modules/web-discovery-project/sources/README.md
-
A note from our sponsor - SaaSHub
www.saashub.com | 11 May 2024
Stats
brave/web-discovery-project is an open source project licensed under Mozilla Public License 2.0 which is an OSI approved license.
The primary programming language of web-discovery-project is JavaScript.
Sponsored