ArchiveBox
Wallabag
Our great sponsors
ArchiveBox | Wallabag | |
---|---|---|
248 | 64 | |
19,433 | 9,607 | |
3.3% | 1.9% | |
9.7 | 9.8 | |
8 days ago | 3 days ago | |
Python | PHP | |
MIT | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ArchiveBox
-
Ask HN: What Underrated Open Source Project Deserves More Recognition?
Two projects I greatly appreciate, allowing me to easily archive my bandcamp and GOG purchases (after the initial setup anyways):
https://github.com/easlice/bandcamp-downloader
https://github.com/Kalanyr/gogrepoc
And I recently learned about archivebox, which I think is going to be a fast favorite and finally let me clear out my mess of tabs/bookmarks: https://github.com/ArchiveBox/ArchiveBox
- YaCy, a distributed Web Search Engine, based on a peer-to-peer network
-
An Introduction to the WARC File
API is coming soon (relatively, it's still a one-man project)! Stay tuned https://github.com/ArchiveBox/ArchiveBox/issues/496
I have an event-sourcing refactor in progress now to allow us to pluginize functionality like the API (similar to Home Assistant with a plugin app sotre), it will take a month or two. Next up is the REST API using the new plugin system.
The ArchiveBox project (which gets reposted on the regular: e.g. https://news.ycombinator.com/item?id=38954189 ) also saves in WARC https://github.com/ArchiveBox/ArchiveBox#output-formats although I've personally not used it to comment further
-
Ask HN: How can I back up an old vBulletin forum without admin access?
I guess your best chance is to use something like https://archivebox.io/.
-
ArchiveBox – open-source self-hosted web archiving
Yeah this is a cool project but it was discussed 2 days ago.
As mentioned by the maintainer there, they even maintain a list of alternatives, very classy:
https://github.com/ArchiveBox/ArchiveBox/wiki/Web-Archiving-...
-
ArchiveBox: Open-source self-hosted web archiving
Actually closer to 7 years ago :)
You can learn about the origin story / motivation here:
https://github.com/ArchiveBox/ArchiveBox#background--motivat...
https://2020.pycon.co/en/talks/5/ (a conference talk I gave about it)
Direct link: https://3xn.nl/projects/2022/02/17/archivebox-root-issue-in-...
note you no longer need to create a user manually though, so this shouldn't be an issue anymore. just set ADMIN_USERNAME and ADMIN_PASSWORD env vars and it'll autocreate the user and collection on first run.
https://github.com/ArchiveBox/ArchiveBox/wiki/Configuration#...
I may add an opt-in federation option at some point in the far future, it would be great to figure out a way to link willing donor's ArchiveBox instances together for public benefit.
Follow here for progress: https://github.com/ArchiveBox/ArchiveBox/issues/50
Wallabag
-
Linkhut: A Social Bookmarking Site
Wallabag[0] is useful too if you want a self-hosted bookmarking solution. I'm with Pinboard too, but regularly export my bookmarks so I have a backed up local copy of recent bookmarks I've added to Pinboard.
-
VectorDB: Vector Database Built by Kagi Search
https://github.com/wallabag/wallabag
No one has mentioned wallabag yet, so wanted to. Been working well for me - has apps and extensions. If you’re not excited to self-host - https://www.wallabag.it/en has been flawless with the exorbitant price of… 11 euro a year.
-
Free Tech Tools and Resources - WinPE Build, Cheatsheet Tool, PW Recovery & More
wallabag is a versatile self-hosted application designed to effortlessly save and organize web pages, keeping online content organized and readily accessible. With its intuitive GUI, users can conveniently store and categorize articles, allowing for easy retrieval whenever you're ready to read later on. Kalc_DK recommends it "for managing links/content that goes into your knowledge base."
-
wallabag can't save youtube properly
That's been logged as an issue in Wallabag, from back in 2016. So I wouldn't hold my breathe on this being implemented anytime soon. https://github.com/wallabag/wallabag/issues/2149
there are some issues on github and the latest one is too old.
-
Looking for a tool to save links to websites, articles, videos, and other content with auto generated tags
You could try wallabag
-
Any URL/Website hoarders?
I use Shaarli for links, and I have an agent network that, among other things, throws links I want to save into a Wallabag install for archival and reference.
-
Omnivore is a free and open source read-it-later service that allows you to sync your reading to Obsidian
If you want self-hosting, wallabag has been available for quite some time and does everything I need in a readitlater.
- Looking for recommendations (Bookmarks/Links)
-
Software that you love and/or makes your job easier
Wallabag or Omnivore for managing links/content that goes into your knowledge base.
What are some alternatives?
Shiori - Simple bookmark manager built with Go
Nunux Keeper
paimon-moe - Your best Genshin Impact companion! Help you plan what to farm with ascension calculator and database. Also track your progress with todo and wish counter.
Readflow - readflow is a news-reading (or read-it-later) solution focused on versatility and simplicity.
ArchiveBox - 🗃 The open source self-hosted web archive. Takes browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more... [Moved to: https://github.com/ArchiveBox/ArchiveBox]
LinkAce - LinkAce is a self-hosted archive to collect links of your favorite websites.
SingleFile - Web Extension for saving a faithful copy of a complete web page in a single HTML file
ArchivesSpace - The ArchivesSpace archives management tool
grab-site - The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Tiny-Tiny-RSS - A PHP and Ajax feed reader
Archivematica - Free and open-source digital preservation system designed to maintain standards-based, long-term access to collections of digital objects.
knowledge - Everything I know