markdownload
ArchiveBox
Our great sponsors
markdownload | ArchiveBox | |
---|---|---|
35 | 248 | |
2,471 | 19,737 | |
- | 3.1% | |
5.2 | 9.7 | |
24 days ago | 12 days ago | |
JavaScript | Python | |
Apache License 2.0 | MIT |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
markdownload
-
Show HN: I made a tool to clean and convert any webpage to Markdown
This fork:
https://github.com/deathau/markdownload
With extension available for Firefox, Google Chrome, Microsoft Edge and Safari.
- Show HN: Zenfetch – Turn your saved browsing content into an AI second brain
-
A structured note-taking app for personal use
> Not really. Obsidian has its shares of problems too, and most of them originate from using Markdown.
Aha. Which problems do you mean?
> Markdown is a freeform text-format, and works very well for writing text, but it really sucks for data and structured content.
Joplin is using md to. And if Joplin does a good job on "data" and "structured content" (whatever you mean by that) by separating that in their DB, it's a big NO for me since it's a closed silo.
This: https://github.com/blacksmithgu/obsidian-dataview works so wonderful for me, and it never breaks anything in my simple md files.
> Most plugins and features in that area are very brittle and overspecialized, working only well enough in their specific use case.
Aha. I don't think so. Which authority says that? And even if It's like that, my markdown files would survive everything, since they are a) in git. https://github.com/denolehov/obsidian-git and b) easy to fix since it's a text file. Gosh!
> And gosh, Obsidian has really a huge amount of plugins for data-handling.
And gosh, this is a good thing!
> At some point, it was so bad that there were multiple competing task-plugins which broke each other just because they had different formatting for dates.
Installing multiple task plugins shows that something is "broke" on the user side. It's not the fault of Markdown or Obsidian.
Just have a look on: https://github.com/ivan-lednev/obsidian-day-planner but you dont need a fancy task plugin like this, if you know your way around https://github.com/blacksmithgu/obsidian-dataview or https://github.com/obsidian-tasks-group/obsidian-tasks
Since the Ecosystem around Obsidian and pure Markdown, most of the time I stay in my browser https://github.com/deathau/markdownload and nvim https://github.com/epwalsh/obsidian.nvim
-
What are your second brain apps like Obsidian?
markdownload - (firefox) - I can use to download entire webpages into markdown - https://github.com/deathau/markdownload - sometimes it's just easier to snippet out a thing I want to keep or reference.
- Ask HN: What are some unpopular technologies you wish people knew more about?
-
Grimoire: Open-Source bookmark manager with extra features
My perfect bookmark manager is Markdownload https://github.com/deathau/markdownload
Just save the complete page, only selected text or only the link to a markdown file or Obsidian. With downloaded, linked or without pictures. My OS and Obsidian can search those files, they have more (automatically added) metadata.
I can even edit them in the browser: add your thoughts, tags or change the name of the file before they are saved.
I can (automatically) do with them what ever I need. They can be used to (automatically) generate an always up to date start page or a data vault on GitHub.
My local AI assistant can parse them.
Local, versatile, permanent, flexible, cost effective, future save. No need for a bookmark manager.
- Copy webpage text, convert to Markdown
-
Ask HN: Should we be saving our favorite information locally?
Yes and no.
Instead of PDF, use Markdownload (on iOS, use a Safari web content to markdown file extension):
https://github.com/deathau/markdownload
And save in a journaled folder like "YYYY-MM-DD - Page Title.md" with a YAML frontmatter of all available metadata.
Have this as a folder in your PKM of choice (Obsidian, Foam, whatever).
These days, point some text embedding at it, and let it generate your own LLM brain.
But you can also static-site-generate that back into your own web knowledge site or base.
-
Los impactos de la nueva normativa que permite a las AFP invertir en ETF activos
Como extraigo texto: MarkDownload - PC y markdownr - Android.
-
Wayback: Self-hosted archiving service integrated with Internet Archive
Looking at the link you gave does not help much in seeing what DiskerNet does and looks like, neither.
Keeping it simple, I download pages in Markdown adding some metadata (some tags). When I want images or more I use singlefile extension. Add Recoll to the mix and that's all I need.
https://github.com/deathau/markdownload
ArchiveBox
-
Ask HN: What Underrated Open Source Project Deserves More Recognition?
Two projects I greatly appreciate, allowing me to easily archive my bandcamp and GOG purchases (after the initial setup anyways):
https://github.com/easlice/bandcamp-downloader
https://github.com/Kalanyr/gogrepoc
And I recently learned about archivebox, which I think is going to be a fast favorite and finally let me clear out my mess of tabs/bookmarks: https://github.com/ArchiveBox/ArchiveBox
- YaCy, a distributed Web Search Engine, based on a peer-to-peer network
-
Vice website is shutting down
If you really want to save the content for yourself, use something like https://archivebox.io/
I've been running a local instance for a few years now and download/save tech articles all time. I can search and find them as needed.
-
An Introduction to the WARC File
API is coming soon (relatively, it's still a one-man project)! Stay tuned https://github.com/ArchiveBox/ArchiveBox/issues/496
I have an event-sourcing refactor in progress now to allow us to pluginize functionality like the API (similar to Home Assistant with a plugin app sotre), it will take a month or two. Next up is the REST API using the new plugin system.
-
Ask HN: How can I back up an old vBulletin forum without admin access?
I guess your best chance is to use something like https://archivebox.io/.
-
ArchiveBox – open-source self-hosted web archiving
Yeah this is a cool project but it was discussed 2 days ago.
As mentioned by the maintainer there, they even maintain a list of alternatives, very classy:
https://github.com/ArchiveBox/ArchiveBox/wiki/Web-Archiving-...
- ArchiveBox: Open-source self-hosted web archiving
- Linkhut: A Social Bookmarking Site
- Show HN: Rem: Remember Everything (open source)
- Bookmark manager with a focus on organization?
What are some alternatives?
logseq - A local-first, non-linear, outliner notebook for organizing and sharing your personal knowledge base. Use it to organize your todo list, to write your journals, or to record your unique life.
Wallabag - wallabag is a self hostable application for saving web pages: Save and classify articles. Read them later. Freely.
obsidian-clipper - A Chrome extension that easily clips selections to Obsidian
paimon-moe - Your best Genshin Impact companion! Help you plan what to farm with ascension calculator and database. Also track your progress with todo and wish counter.
nulis - Mind-mapping software that helps writers collect and organize their knowledge, develop their ideas. Built with React, Redux, Node.js, hosted on Digital Ocean.
SingleFile - Web Extension for saving a faithful copy of a complete web page in a single HTML file
obsidian-mind-map - An Obsidian plugin for displaying markdown notes as mind maps using Markmap.
ArchivesSpace - The ArchivesSpace archives management tool
vscode-memo - Markdown knowledge base with bidirectional [[link]]s built on top of VSCode [Moved to: https://github.com/svsool/memo]
grab-site - The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Templater - A template plugin for obsidian
Archivematica - Free and open-source digital preservation system designed to maintain standards-based, long-term access to collections of digital objects.