22120
DownloadNet
Our great sponsors
22120 | DownloadNet | |
---|---|---|
13 | 20 | |
2,638 | 3,643 | |
- | 2.1% | |
9.7 | 6.4 | |
over 2 years ago | 5 days ago | |
JavaScript | JavaScript | |
GNU General Public License v3.0 or later | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
22120
-
Is there a browser addon which locally archives every website I visit?
Here. An archivist browser controller that caches everything you browse, a library server with full text search to serve your archive.
- Show HN: Irchiver, your full-resolution personal web archive
-
Ask HN: Full text search engine in JavaScript for English and and Chinese?
Following your "hilarious" and disrespectful answer here https://github.com/i5ik/22120/issues/63#issuecomment-7275272..., I would prefer that you remove any reference to SingleFile in the description of your project. I could not open an issue because you blocked me. And please don't accuse people without proof.
- 22120: self-host the Internet with an Offline Archive. Similar to ArchiveBox, SingleFile and WebMemex. Works well with WorldBrain/Memex to give you full-text search. Why not WARC? Uses Chrome DevTools protocol to intercept all requests, and caches responses against a key of (method, URL)
-
Request: Proxy caching all visited websites text in DB, making history searchable
https://github.com/i5ik/22120 is a tool that archives as you browse that you can then view offline later
- Is the there a way I can cache videos(reddit.4chan) I watch in browser (Linux)?
-
So you want to write a GUI framework
My solution to this (it's been done before), is to use the existing browser engine (not the system webview) installed. So far I only utilize Chrome, but as the way I connect to it is over the Chrome DevTools protocol which is somewhat fluent with the Remote Debugging Protocol[0] that Firefox is doing, this is a reasonable approach.
So far my "tool" to do this is simply a template repository with some conveniences, providing in essence a skeleton for these types of apps. I hope to flesh this out a little more, and expose a much richer API, as well as convert some of my existing popular apps (like 22120[1]) to the "framework".
The benefit of this is Graderjs has a built in 'app builder' that can create a cross-platform binary (excluding or ignoring the necessity (on MacOS) and near-necessity (on Windows) to sign your executable somehow, that lets you display your UI in JS/HTML/CSS using the already installed browser engine, as well as run code in NodeJS and using the rich APIs[2] of the browser engine itself. I'm really happy with this project and think that, even tho it's small now, it will in time become my most popular and powerful one: even bigger than my remote browser and popular web archiver.
Just give it time! :)
[0]: https://firefox-source-docs.mozilla.org/remote/index.html
[1]: https://github.com/i5ik/22120
[2]: https://chromedevtools.github.io/devtools-protocol/tot/Brows...
The GraderJS: https://github.com/i5ik/graderjs
-
Ask HN: Why saving webpages on hard disk has not got better?
I use this to backup pages automatically
https://github.com/i5ik/22120
-
Saving all browsed websites automatically
Does this potentially help? https://github.com/c9fe/22120
-
Make Your Own Internet Archive with Archive Box
From the blog comments, I think this is what youâre after https://github.com/c9fe/22120
DownloadNet
-
ArchiveBox: Open-source self-hosted web archiving
For anyone who uses Chrome and wants to view their archived pages in the browser as if they were still online (URL and everything intact), and also full-text search through their browsing history that was archived (like AB plans to add in future, I think, right nikki?) you can check out DownloadNet: https://github.com/dosyago/DownloadNet
You can have multiple archives, and even use a mode where you only archive pages you bookmark rather than everything.
-
Show HN: Rem: Remember Everything (open source)
This does look cool. It reminds me of a recent discovery I made. The other day, while trying to recover some disk space, I found a giant file on my hard disk. It turned out to be a nine-hour screen recording from almost a year ago. I had no idea it existed, so I mustâve accidentally left the screen recording on. Watching it was fascinating; it was like a window into my thought process at that time. You could see how I was researching something online. It was almost like a play-by-play, akin to re-watching a sports performance â very instructive and surprisingly useful.
In a similar vein to what youâve done, but focusing specifically on web browsing, Iâve created a tool called âDownloadNet.â It archives for offline use and fully indexes every page you visit. Additionally, it can be configured to archive only the pages you bookmark, offering another mode of operation. Itâs an open-source tool, so feel free to check it out: https://github.com/dosyago/DownloadNet
-
You're Gonna Need a Bigger Browser
Given that I directly work in this space I found the article's synthesis of a range of ideas about browser innovation to be highly relevant.
More generally, the article is actually extremely interesting and examines a bunch of ideas worthy of consideration if you're interested in the future of web browsing.
Perhaps none of the ideas are new in isolation, but it's encouraging that people are doing this foundational conceptual work and imagining where a synthesis of them would go.
Despite being interesting somehow on the page it was not so easy to read. Here's a summary of key ideas:
Stagnation in Browser Evolution: Berjon notes that despite being central to the web's architecture, browsers haven't changed much in their fundamental design for a long time. They have undergone incremental changes but the core concept remains largely the same as it was decades ago.
Reimagining Browsers: He suggests that to increase user agencyâa principle that the web should empower usersâwe need to consider major overhauls to what a browser is and how it operates.
Integration of Search and Social: Berjon challenges the traditional separation of browsers, search engines, and social platforms. He advocates for an integrated approach where the browser encompasses these functions, aligning more closely with users' experiences and expectations.
Shift From Client to Agent: The author proposes rethinking the browser not just as a client for retrieving documents but as an "agent" that provides a variety of services, potentially including server-like functions, to empower users.
User Agency and Personal Data Servers: By incorporating elements such as Personal Data Servers (PDS), users could manage their own data and services like recommendations, identity, and subscriptions, which currently rely on third-party providers.
Tab Management: Berjon critiques the use of tabs, suggesting that they are an ineffective method for organizing and interacting with web content, and advocates for better UI solutions.
Business Models: He delves into the financial aspects of browsers, highlighting the significant profits derived from setting search engine defaults. Berjon argues for reinvestment of these profits into the web as a public good and for developing business models that truly benefit user agency.
Potential for Change: Despite the challenges, Berjon is optimistic about the possibility of change, noting that there is room for product differentiation and that financial incentives can drive innovation in the browser space.
I found the one about User Agency and Personal Data Servers particularly fascinating. I've been exploring the idea of a federated search engine, where a person curates their own search through their browsing history (and ultimately could share it socially), in DownloadNet: https://github.com/dosyago/DownloadNet
And my company has been developing a platform for building extended and customized browsing experiences and delivering them anywhere. It's my hope that BrowserBox will play a part in the future direction of the browser as user agent. It's open source so if you care about the future of the web, get involved: https://github.com/BrowserBox/BrowserBox :)
-
Google Chrome pushes browser history-based ad targeting
If you're interested in utilizing your history information for something in your intentional interests, consider saving an archive of pages you browse to make a search engine you can query back through later.
You can save the full content for indexing with full text search, and you can even export archives as tarballs by zipping up the directory. Many people find this a useful way to "mine" their own browser history to create a curated search engine aligned with your interests. Or simply to save the pages they browse for review offline--either to save bandwidth, or just because they're actually "offline"--at a remote site, or on an airplane.
Everything is saved in a fully interactive way. Personally tho, I find search the most useful feature. Also, we're open source so if you want to get involved, please do so!
https://github.com/dosyago/DiskerNet
-
Show HN: Linkwarden â An open source collaborative bookmark manager
If you want full-text-search with archiving check out my project, DiskerNet. https://github.com/dosyago/DiskerNet --> also well done on LinkWarden! Looks like a great product! :)
- Show HN: DiskerNet â Browse the Internet from Your Disk, Now Open Source
-
Wayback: Self-hosted archiving service integrated with Internet Archive
For archiving, look into https://github.com/dosyago/DiskerNet
It's real next gen thinking on this topic.
As for the featured tool wayback... If HN readers can't figure out what it does after reading docs, its likely the thinking behind it is equally unclear.
- DiskerNet - Save and index web content locally
- Show HN: DiskerNet â save and index web content locally
What are some alternatives?
ArchiveBox - đ Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
min - A fast, minimal browser that protects your privacy
asciidoctor-latex - :triangular_ruler: Add LaTeX features to AsciiDoc & convert AsciiDoc to LaTeX
SingleFileZ - Web Extension to save a faithful copy of an entire web page in a self-extracting ZIP file
pywb - Core Python Web Archiving Toolkit for replay and recording of web archives
BackstopJS - Catch CSS curve balls.
SingleFile - Web Extension for saving a faithful copy of a complete web page in a single HTML file
hamsterbase - self-hosted, local-first web archive application.
notes - A zero dependency shell script that makes it really simple to manage your text notes.
ZAP - The ZAP core project
linux-surface - Linux Kernel for Surface Devices
Archiver - a streaming interface for archive generation