monolith
promnesia
monolith | promnesia | |
---|---|---|
23 | 33 | |
9,929 | 1,693 | |
24.1% | - | |
7.2 | 7.6 | |
about 1 month ago | about 1 month ago | |
Rust | Python | |
Creative Commons Zero v1.0 Universal | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
monolith
-
🛠️Non-AI Open Source Projects that are 🔥
Monolith is a CLI tool for saving complete web pages as a single HTML file.
-
An Introduction to the WARC File
I have never used monolith to say with any certainty, but two things in your description are worth highlighting between the goals of WARC versus the umpteen bazillion "save this one page I'm looking at as a single file" type projects:
1. WARC is designed, as a goal, to archive the request-response handshake. It does not get into the business of trying to make it easy for a browser to subsequently display that content, since that's a browser's problem
2. Using your cited project specifically, observe the number of "well, save it but ..." options <https://github.com/Y2Z/monolith#options> which is in stark contrast to the archiving goals I just spoke about. It's not a good snapshot of history if the server responded with `content-type: text/html;charset=iso-8859-1` back in the 90s but "modern tools" want everything to be UTF-8 so we'll just convert it, shall we? Bah, I don't like JavaScript, so we'll just toss that out, shall we? And so on
For 100% clarity: monolith, and similar, may work fantastic for any individual's workflow, and I'm not here to yuck anyone's yum; but I do want to highlight that all things being equal it should always be possible to derive monolith files from warc files because the warc files are (or at least have the goal of) perfect fidelity of what the exchange was. I would guess only pcap files would be of higher fidelity, but also a lot more extraneous or potentially privacy violating details
- Reddit limits the use of API to 1000,Let's work together to save the content of StableDiffusion Subreddit as a team
-
nix-init: Create Nix packages with just the URL, with support for dependency inference, license detection, hash prefetching, and more
console $ nix-init default.nix -u https://github.com/Y2Z/monolith [...] (press enter to select the defaults) $ nix-build -E "(import { }).callPackage ./. { }" [...] $ result/bin/monilith --version monolith 2.7.0
-
What is the best free, least likely to discontinue, high data allowance app/service for saving articles/webpages permanently?
For example, here’s a command-line tool to save webpages as HTML files: https://github.com/Y2Z/monolith
- Offline Internet Archive
-
Rust Easy! Modern Cross-platform Command Line Tools to Supercharge Your Terminal
monolith: Convert any webpage into a single HTML file with all assets inlined.
-
Is there a way to (bulk) save all tabs as a pdf document in a quick way?
There is also a program (monolith: https://github.com/Y2Z/monolith) that does the same
-
Is there a good list of up-to-date data archiving tools for different websites?
besides wget, for single pages I use monolith https://github.com/Y2Z/monolith
-
Ask HN: Full-text browser history search forever?
You can pipe the URLs through something like monolith[1].
https://github.com/Y2Z/monolith
promnesia
-
Mozilla "MemoryCache" Local AI
In term of automatically saving everything, There is heyday.xyz, polished but quite expensive. Or https://github.com/karlicoss/promnesia, a more experimental take.
-
Update 4: RedReader granted non-commercial accessibility exemption
Promnesia & theconversation.social were on similar themes/solutions.
-
Ask HN: How do you save and browse external interesting URLs?
1. you often don't know what resources you will really "value" in the future, so no more to save or not to save, this is the question
2. tagging, to be effective, require discipline (thinking about then sticking to an agile system). So, we just replace it with search, preferably NLP/AI (so you don't have to remember the exact keywords)
Apps do exist, from the expansive [1] to the experimental [2].
Personally I invested time in my filling system, and over-saving does not cause me much angst, so I’m OK with it. I also use maintenance as an occasion for renewed discovery.
[1] https://heyday.xyz/
[2] https://github.com/karlicoss/promnesia
- Ask HN: Search what you've seen on the web before
- Making Twitter likes/bookmarks backup tool as side quest of offline first browser (that saves everything)
- Making Twitter likes/bookmarks backup tool as side quest of browser that saves everything
-
Making Twitter likes backup tool as side quest of browser/second brain
I want to build a browser that captures everything I saw on the internet, allows me to search it, run graph algorithms (like PageRank). Improves navigation (by showing trails as tree instead of tabs). Heavily offline focused (Backend only for updates, maybe for analytics).
Difference with rewind.ai: linkkraft does not have funding, i'm solo, no apps & image/video/audio recognition. Focus on web, trails, research and using web copies, selections/highlights as part of your notes & whiteboards. Preserving all possible graphs.
My inspirations: https://pages.gseis.ucla.edu/faculty/bates/berrypicking.html, https://beepb00p.xyz/promnesia.html, Jeff Raskin (Global Search, Zoom UI) https://linkkraft.com/notes/backstory
I've built a prototype with trails tree & HTML snapshoting. For each my step even inside SPA linkkraft creates HTML snapshot.
-
Is there a browser extension, which shows suggestions of my vault, when googeling like Evernote's webclipper?
Promnesia works like that: https://github.com/karlicoss/promnesia/
- The coolest Python projects you've ever seen?
- Ask HN: Does anybody still use bookmarking services?
What are some alternatives?
SingleFile - Web Extension for saving a faithful copy of a complete web page in a single HTML file
grasp - A reliable org-capture browser extension for Chrome/Firefox
ArchiveBox - 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
SingleFileZ - Web Extension to save a faithful copy of an entire web page in a self-extracting ZIP file
archivy - Archivy is a self-hostable knowledge repository that allows you to learn and retain information in your own personal and extensible wiki.
shrface - Extend eww/nov with org-mode features, archive web pages to org files with shr.
PowerDeleteSuite - Power Delete Suite for Reddit
ArchiveBox - 🗃 The open source self-hosted web archive. Takes browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more... [Moved to: https://github.com/ArchiveBox/ArchiveBox]
Wallabag - wallabag is a self hostable application for saving web pages: Save and classify articles. Read them later. Freely.
org-linkz - Managing browser links in org file.