DownloadNet
promnesia
DownloadNet | promnesia | |
---|---|---|
20 | 33 | |
3,653 | 1,698 | |
2.1% | - | |
6.1 | 7.6 | |
18 days ago | about 1 month ago | |
JavaScript | Python | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
DownloadNet
-
ArchiveBox: Open-source self-hosted web archiving
For anyone who uses Chrome and wants to view their archived pages in the browser as if they were still online (URL and everything intact), and also full-text search through their browsing history that was archived (like AB plans to add in future, I think, right nikki?) you can check out DownloadNet: https://github.com/dosyago/DownloadNet
You can have multiple archives, and even use a mode where you only archive pages you bookmark rather than everything.
-
Show HN: Rem: Remember Everything (open source)
This does look cool. It reminds me of a recent discovery I made. The other day, while trying to recover some disk space, I found a giant file on my hard disk. It turned out to be a nine-hour screen recording from almost a year ago. I had no idea it existed, so I must’ve accidentally left the screen recording on. Watching it was fascinating; it was like a window into my thought process at that time. You could see how I was researching something online. It was almost like a play-by-play, akin to re-watching a sports performance – very instructive and surprisingly useful.
In a similar vein to what you’ve done, but focusing specifically on web browsing, I’ve created a tool called ‘DownloadNet.’ It archives for offline use and fully indexes every page you visit. Additionally, it can be configured to archive only the pages you bookmark, offering another mode of operation. It’s an open-source tool, so feel free to check it out: https://github.com/dosyago/DownloadNet
-
You're Gonna Need a Bigger Browser
Given that I directly work in this space I found the article's synthesis of a range of ideas about browser innovation to be highly relevant.
More generally, the article is actually extremely interesting and examines a bunch of ideas worthy of consideration if you're interested in the future of web browsing.
Perhaps none of the ideas are new in isolation, but it's encouraging that people are doing this foundational conceptual work and imagining where a synthesis of them would go.
Despite being interesting somehow on the page it was not so easy to read. Here's a summary of key ideas:
Stagnation in Browser Evolution: Berjon notes that despite being central to the web's architecture, browsers haven't changed much in their fundamental design for a long time. They have undergone incremental changes but the core concept remains largely the same as it was decades ago.
Reimagining Browsers: He suggests that to increase user agency—a principle that the web should empower users—we need to consider major overhauls to what a browser is and how it operates.
Integration of Search and Social: Berjon challenges the traditional separation of browsers, search engines, and social platforms. He advocates for an integrated approach where the browser encompasses these functions, aligning more closely with users' experiences and expectations.
Shift From Client to Agent: The author proposes rethinking the browser not just as a client for retrieving documents but as an "agent" that provides a variety of services, potentially including server-like functions, to empower users.
User Agency and Personal Data Servers: By incorporating elements such as Personal Data Servers (PDS), users could manage their own data and services like recommendations, identity, and subscriptions, which currently rely on third-party providers.
Tab Management: Berjon critiques the use of tabs, suggesting that they are an ineffective method for organizing and interacting with web content, and advocates for better UI solutions.
Business Models: He delves into the financial aspects of browsers, highlighting the significant profits derived from setting search engine defaults. Berjon argues for reinvestment of these profits into the web as a public good and for developing business models that truly benefit user agency.
Potential for Change: Despite the challenges, Berjon is optimistic about the possibility of change, noting that there is room for product differentiation and that financial incentives can drive innovation in the browser space.
I found the one about User Agency and Personal Data Servers particularly fascinating. I've been exploring the idea of a federated search engine, where a person curates their own search through their browsing history (and ultimately could share it socially), in DownloadNet: https://github.com/dosyago/DownloadNet
And my company has been developing a platform for building extended and customized browsing experiences and delivering them anywhere. It's my hope that BrowserBox will play a part in the future direction of the browser as user agent. It's open source so if you care about the future of the web, get involved: https://github.com/BrowserBox/BrowserBox :)
-
Google Chrome pushes browser history-based ad targeting
If you're interested in utilizing your history information for something in your intentional interests, consider saving an archive of pages you browse to make a search engine you can query back through later.
You can save the full content for indexing with full text search, and you can even export archives as tarballs by zipping up the directory. Many people find this a useful way to "mine" their own browser history to create a curated search engine aligned with your interests. Or simply to save the pages they browse for review offline--either to save bandwidth, or just because they're actually "offline"--at a remote site, or on an airplane.
Everything is saved in a fully interactive way. Personally tho, I find search the most useful feature. Also, we're open source so if you want to get involved, please do so!
https://github.com/dosyago/DiskerNet
-
Show HN: Linkwarden – An open source collaborative bookmark manager
If you want full-text-search with archiving check out my project, DiskerNet. https://github.com/dosyago/DiskerNet --> also well done on LinkWarden! Looks like a great product! :)
- Show HN: DiskerNet – Browse the Internet from Your Disk, Now Open Source
-
Wayback: Self-hosted archiving service integrated with Internet Archive
For archiving, look into https://github.com/dosyago/DiskerNet
It's real next gen thinking on this topic.
As for the featured tool wayback... If HN readers can't figure out what it does after reading docs, its likely the thinking behind it is equally unclear.
- DiskerNet - Save and index web content locally
- Show HN: DiskerNet – save and index web content locally
promnesia
-
Mozilla "MemoryCache" Local AI
In term of automatically saving everything, There is heyday.xyz, polished but quite expensive. Or https://github.com/karlicoss/promnesia, a more experimental take.
-
Update 4: RedReader granted non-commercial accessibility exemption
Promnesia & theconversation.social were on similar themes/solutions.
-
Ask HN: How do you save and browse external interesting URLs?
1. you often don't know what resources you will really "value" in the future, so no more to save or not to save, this is the question
2. tagging, to be effective, require discipline (thinking about then sticking to an agile system). So, we just replace it with search, preferably NLP/AI (so you don't have to remember the exact keywords)
Apps do exist, from the expansive [1] to the experimental [2].
Personally I invested time in my filling system, and over-saving does not cause me much angst, so I’m OK with it. I also use maintenance as an occasion for renewed discovery.
[1] https://heyday.xyz/
[2] https://github.com/karlicoss/promnesia
- Ask HN: Search what you've seen on the web before
- Making Twitter likes/bookmarks backup tool as side quest of offline first browser (that saves everything)
- Making Twitter likes/bookmarks backup tool as side quest of browser that saves everything
-
Making Twitter likes backup tool as side quest of browser/second brain
I want to build a browser that captures everything I saw on the internet, allows me to search it, run graph algorithms (like PageRank). Improves navigation (by showing trails as tree instead of tabs). Heavily offline focused (Backend only for updates, maybe for analytics).
Difference with rewind.ai: linkkraft does not have funding, i'm solo, no apps & image/video/audio recognition. Focus on web, trails, research and using web copies, selections/highlights as part of your notes & whiteboards. Preserving all possible graphs.
My inspirations: https://pages.gseis.ucla.edu/faculty/bates/berrypicking.html, https://beepb00p.xyz/promnesia.html, Jeff Raskin (Global Search, Zoom UI) https://linkkraft.com/notes/backstory
I've built a prototype with trails tree & HTML snapshoting. For each my step even inside SPA linkkraft creates HTML snapshot.
-
Is there a browser extension, which shows suggestions of my vault, when googeling like Evernote's webclipper?
Promnesia works like that: https://github.com/karlicoss/promnesia/
- The coolest Python projects you've ever seen?
- Ask HN: Does anybody still use bookmarking services?
What are some alternatives?
min - A fast, minimal browser that protects your privacy
grasp - A reliable org-capture browser extension for Chrome/Firefox
SingleFileZ - Web Extension to save a faithful copy of an entire web page in a self-extracting ZIP file
ArchiveBox - 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
BackstopJS - Catch CSS curve balls.
archivy - Archivy is a self-hostable knowledge repository that allows you to learn and retain information in your own personal and extensible wiki.
hamsterbase - self-hosted, local-first web archive application.
PowerDeleteSuite - Power Delete Suite for Reddit
ZAP - The ZAP core project
monolith - ⬛️ CLI tool for saving complete web pages as a single HTML file
Archiver - a streaming interface for archive generation
ArchiveBox - 🗃 The open source self-hosted web archive. Takes browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more... [Moved to: https://github.com/ArchiveBox/ArchiveBox]