DownloadNet vs ripgrep-all

DownloadNet

💾 DownloadNet - All content you browse online available offline. Search through the full-text of all pages in your browser history. ⭐️ Star to support our work! (by dosyago)

Source Code

Suggest alternative

Edit details

ripgrep-all

rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc. (by phiresky)

Suggest topics

Source Code

Suggest alternative

Edit details

SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App

With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

surveyjs.io

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

DownloadNet		ripgrep-all
	Project
20	Mentions	43
3,653	Stars	6,200
2.1%	Growth	-
6.1	Activity	7.6
18 days ago	Latest Commit	1 day ago
JavaScript	Language	Rust
GNU General Public License v3.0 or later	License	GNU General Public License v3.0 or later

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

DownloadNet

Posts with mentions or reviews of DownloadNet. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-01-11.

ArchiveBox: Open-source self-hosted web archiving
11 projects | news.ycombinator.com | 11 Jan 2024

For anyone who uses Chrome and wants to view their archived pages in the browser as if they were still online (URL and everything intact), and also full-text search through their browsing history that was archived (like AB plans to add in future, I think, right nikki?) you can check out DownloadNet: https://github.com/dosyago/DownloadNet
You can have multiple archives, and even use a mode where you only archive pages you bookmark rather than everything.
Show HN: Rem: Remember Everything (open source)
11 projects | news.ycombinator.com | 27 Dec 2023

This does look cool. It reminds me of a recent discovery I made. The other day, while trying to recover some disk space, I found a giant file on my hard disk. It turned out to be a nine-hour screen recording from almost a year ago. I had no idea it existed, so I must’ve accidentally left the screen recording on. Watching it was fascinating; it was like a window into my thought process at that time. You could see how I was researching something online. It was almost like a play-by-play, akin to re-watching a sports performance – very instructive and surprisingly useful.
In a similar vein to what you’ve done, but focusing specifically on web browsing, I’ve created a tool called ‘DownloadNet.’ It archives for offline use and fully indexes every page you visit. Additionally, it can be configured to archive only the pages you bookmark, offering another mode of operation. It’s an open-source tool, so feel free to check it out: https://github.com/dosyago/DownloadNet
You're Gonna Need a Bigger Browser
2 projects | news.ycombinator.com | 4 Nov 2023

Given that I directly work in this space I found the article's synthesis of a range of ideas about browser innovation to be highly relevant.
More generally, the article is actually extremely interesting and examines a bunch of ideas worthy of consideration if you're interested in the future of web browsing.
Perhaps none of the ideas are new in isolation, but it's encouraging that people are doing this foundational conceptual work and imagining where a synthesis of them would go.
Despite being interesting somehow on the page it was not so easy to read. Here's a summary of key ideas:
Stagnation in Browser Evolution: Berjon notes that despite being central to the web's architecture, browsers haven't changed much in their fundamental design for a long time. They have undergone incremental changes but the core concept remains largely the same as it was decades ago.
Reimagining Browsers: He suggests that to increase user agency—a principle that the web should empower users—we need to consider major overhauls to what a browser is and how it operates.
Integration of Search and Social: Berjon challenges the traditional separation of browsers, search engines, and social platforms. He advocates for an integrated approach where the browser encompasses these functions, aligning more closely with users' experiences and expectations.
Shift From Client to Agent: The author proposes rethinking the browser not just as a client for retrieving documents but as an "agent" that provides a variety of services, potentially including server-like functions, to empower users.
User Agency and Personal Data Servers: By incorporating elements such as Personal Data Servers (PDS), users could manage their own data and services like recommendations, identity, and subscriptions, which currently rely on third-party providers.
Tab Management: Berjon critiques the use of tabs, suggesting that they are an ineffective method for organizing and interacting with web content, and advocates for better UI solutions.
Business Models: He delves into the financial aspects of browsers, highlighting the significant profits derived from setting search engine defaults. Berjon argues for reinvestment of these profits into the web as a public good and for developing business models that truly benefit user agency.
Potential for Change: Despite the challenges, Berjon is optimistic about the possibility of change, noting that there is room for product differentiation and that financial incentives can drive innovation in the browser space.
I found the one about User Agency and Personal Data Servers particularly fascinating. I've been exploring the idea of a federated search engine, where a person curates their own search through their browsing history (and ultimately could share it socially), in DownloadNet: https://github.com/dosyago/DownloadNet
And my company has been developing a platform for building extended and customized browsing experiences and delivering them anywhere. It's my hope that BrowserBox will play a part in the future direction of the browser as user agent. It's open source so if you care about the future of the web, get involved: https://github.com/BrowserBox/BrowserBox :)
Google Chrome pushes browser history-based ad targeting
4 projects | news.ycombinator.com | 6 Sep 2023

If you're interested in utilizing your history information for something in your intentional interests, consider saving an archive of pages you browse to make a search engine you can query back through later.
You can save the full content for indexing with full text search, and you can even export archives as tarballs by zipping up the directory. Many people find this a useful way to "mine" their own browser history to create a curated search engine aligned with your interests. Or simply to save the pages they browse for review offline--either to save bandwidth, or just because they're actually "offline"--at a remote site, or on an airplane.
Everything is saved in a fully interactive way. Personally tho, I find search the most useful feature. Also, we're open source so if you want to get involved, please do so!
https://github.com/dosyago/DiskerNet
Show HN: Linkwarden – An open source collaborative bookmark manager
8 projects | news.ycombinator.com | 31 Jul 2023

If you want full-text-search with archiving check out my project, DiskerNet. https://github.com/dosyago/DiskerNet --> also well done on LinkWarden! Looks like a great product! :)
Show HN: DiskerNet – Browse the Internet from Your Disk, Now Open Source
1 project | /r/hypeurls | 19 Jul 2023

3 projects | news.ycombinator.com | 16 Jul 2023
Wayback: Self-hosted archiving service integrated with Internet Archive
7 projects | news.ycombinator.com | 15 Apr 2023

For archiving, look into https://github.com/dosyago/DiskerNet
It's real next gen thinking on this topic.
As for the featured tool wayback... If HN readers can't figure out what it does after reading docs, its likely the thinking behind it is equally unclear.
DiskerNet - Save and index web content locally
1 project | /r/CKsTechNews | 28 Mar 2023
Show HN: DiskerNet – save and index web content locally
1 project | news.ycombinator.com | 28 Mar 2023

ripgrep-all

Posts with mentions or reviews of ripgrep-all. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-11-30.

Ripgrep-all: rga: ripgrep, but also search PDFs, E-Books, Office documents, zip
1 project | news.ycombinator.com | 30 Nov 2023
Ripgrep is faster than {grep, ag, Git grep, ucg, pt, sift}
14 projects | news.ycombinator.com | 30 Nov 2023

I searched in portage, and it seems there is another version working also with other documents like PDFs and doc.
https://github.com/phiresky/ripgrep-all
Calibre – New in Calibre 7.0
11 projects | news.ycombinator.com | 18 Nov 2023

If you want even faster search across different formats, you can try ripgrep-all ( https://github.com/phiresky/ripgrep-all ). It can search across epub, docx, pdf, zip, mp4 etc. If you are handy with the tool, you can write custom adaptor to search across images using OCR with tesseract.
Rga: Ripgrep, but also search in PDF, ebooks, office documents, zip, tar.gz etc.
1 project | news.ycombinator.com | 30 Jul 2023
Show HN: Khoj – Chat Offline with Your Second Brain Using Llama 2
14 projects | news.ycombinator.com | 30 Jul 2023

1. If you want better adoption especially among corporations, GPL-3 wont cut it. Maybe think of some business friendly licenses (MIT etc)
2. I understand the excitement about llm's. But how about making something more accessible. I use rip-grep-all (rga) along with fzf [1] that can search all files including pdfs in a specific folders. However, I would like a GUI tool to search across multiple folders, provide priority of results across folders and store and search histories where I can do a meta-search. This is sufficient for 95% of my usecases to search locally and I dont need LLM. If khoj can enable such search as default without LLM that will be a gamechanger for many people without a heavy compute machine or who dont want to use OpenAI.
[1] https://github.com/phiresky/ripgrep-all/wiki/fzf-Integration
How to make file paths clickable?
1 project | /r/KittyTerminal | 27 Jun 2023

I use `rga` to search through multiple PDF files for work. The tool returns a list of files and I would like to make those file paths clickable.
Burgr – Books in Your Terminal
9 projects | news.ycombinator.com | 23 Feb 2023
Is there a way to searching multiple epub and pdf?
1 project | /r/DataHoarder | 21 Dec 2022

rga, aka ripgrep-all
Internet Archive Scholar
6 projects | news.ycombinator.com | 9 Dec 2022

I wanted to say 'au contrer' to your 'screenshots are not searchable' and link this[0] but I don't actually see images in the readme.. I swear it was there, maybe it's a buried extra flag..
[0] https://github.com/phiresky/ripgrep-all
Recoll – Full-text search for your desktop
4 projects | news.ycombinator.com | 1 Dec 2022

What are some alternatives?

When comparing DownloadNet and ripgrep-all you can also consider the following projects:

min - A fast, minimal browser that protects your privacy

pdfgrep - PDFGrep is a GNU/Emacs module providing grep comparable facilities but for PDF files

SingleFileZ - Web Extension to save a faithful copy of an entire web page in a self-extracting ZIP file

OCRmyPDF - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

BackstopJS - Catch CSS curve balls.

notational-fzf-vim - Notational velocity for vim.

hamsterbase - self-hosted, local-first web archive application.

InvoiceNet - Deep neural network to extract intelligent information from invoice documents.

ZAP - The ZAP core project

fd - A simple, fast and user-friendly alternative to 'find'

Archiver - a streaming interface for archive generation

ripgrep - ripgrep recursively searches directories for a regex pattern while respecting your gitignore

DownloadNet vs min ripgrep-all vs pdfgrep DownloadNet vs SingleFileZ ripgrep-all vs OCRmyPDF DownloadNet vs BackstopJS ripgrep-all vs notational-fzf-vim DownloadNet vs hamsterbase ripgrep-all vs InvoiceNet DownloadNet vs ZAP ripgrep-all vs fd DownloadNet vs Archiver ripgrep-all vs ripgrep

Compare DownloadNet vs ripgrep-all and see what are their differences.

DownloadNet

ripgrep-all

DownloadNet

ripgrep-all

What are some alternatives?