stract
ArchiveBox
stract | ArchiveBox | |
---|---|---|
5 | 249 | |
1,741 | 19,959 | |
8.4% | 2.2% | |
9.7 | 9.8 | |
3 days ago | 5 days ago | |
Rust | Python | |
GNU Affero General Public License v3.0 | MIT |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
stract
-
YaCy, a distributed Web Search Engine, based on a peer-to-peer network
There are already many project about search:
- https://www.marginalia.nu/
- https://searchmysite.net/
- https://lucene.apache.org/
- elastic search
- https://presearch.com/
- https://stract.com/
- https://wiby.me/
I think that all project are fun. I would like to see one succeeding at reaching mainstream level of attention.
I have also been gathering links meta data for some time. Maybe I will use them to feed any eventual self hosted search engine, or language model, if I decide to experiment with that.
- domains for seed https://github.com/rumca-js/Internet-Places-Database
- bookmarks seed https://github.com/rumca-js/RSS-Link-Database
- links for year https://github.com/rumca-js/RSS-Link-Database-2024
-
Open Source Search Engine as an Alternative to Google Built in Spare Time
The actual URL is https://stract.com/
The link is to a blog that tries to get your email in order to let you read.
- StractOrg/stract: web search done right
-
The Guy Building an Open-Source Google Search Competitor in His Spare Time
Title was edited for length and synthesized by combining the original 404 Media email newsletter title[0] with the original article title[1].
[0] The Guy Building an Open-Source Google Search Competitor
[1] This Guy Has Built an Open Source Search Engine as an Alternative to Google in His Spare Time
Previously/related on HN:
Stract: Open-souce, non-profit search engine (https://stract.com/)
-
Stract: Open-Souce, Non-Profit Search Engine
Sources: https://github.com/StractOrg/stract
Backend in Rust (axum web framework, rocksdb), frontend with Svelte.
ArchiveBox
-
Ask HN: What Underrated Open Source Project Deserves More Recognition?
Two projects I greatly appreciate, allowing me to easily archive my bandcamp and GOG purchases (after the initial setup anyways):
https://github.com/easlice/bandcamp-downloader
https://github.com/Kalanyr/gogrepoc
And I recently learned about archivebox, which I think is going to be a fast favorite and finally let me clear out my mess of tabs/bookmarks: https://github.com/ArchiveBox/ArchiveBox
- YaCy, a distributed Web Search Engine, based on a peer-to-peer network
-
Vice website is shutting down
If you really want to save the content for yourself, use something like https://archivebox.io/
I've been running a local instance for a few years now and download/save tech articles all time. I can search and find them as needed.
-
An Introduction to the WARC File
API is coming soon (relatively, it's still a one-man project)! Stay tuned https://github.com/ArchiveBox/ArchiveBox/issues/496
I have an event-sourcing refactor in progress now to allow us to pluginize functionality like the API (similar to Home Assistant with a plugin app sotre), it will take a month or two. Next up is the REST API using the new plugin system.
-
Ask HN: How can I back up an old vBulletin forum without admin access?
I guess your best chance is to use something like https://archivebox.io/.
-
ArchiveBox – open-source self-hosted web archiving
Yeah this is a cool project but it was discussed 2 days ago.
As mentioned by the maintainer there, they even maintain a list of alternatives, very classy:
https://github.com/ArchiveBox/ArchiveBox/wiki/Web-Archiving-...
- ArchiveBox: Open-source self-hosted web archiving
- Linkhut: A Social Bookmarking Site
- Show HN: Rem: Remember Everything (open source)
- Bookmark manager with a focus on organization?
What are some alternatives?
Wallabag - wallabag is a self hostable application for saving web pages: Save and classify articles. Read them later. Freely.
paimon-moe - Your best Genshin Impact companion! Help you plan what to farm with ascension calculator and database. Also track your progress with todo and wish counter.
SingleFile - Web Extension for saving a faithful copy of a complete web page in a single HTML file
ArchivesSpace - The ArchivesSpace archives management tool
grab-site - The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Archivematica - Free and open-source digital preservation system designed to maintain standards-based, long-term access to collections of digital objects.
knowledge - Everything I know
logseq - A local-first, non-linear, outliner notebook for organizing and sharing your personal knowledge base. Use it to organize your todo list, to write your journals, or to record your unique life.
CKAN - CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data.humdata.org among many other sites.
Access to Memory (AtoM) - Open-source, web application for archival description and public access.
Shiori - Simple bookmark manager built with Go
LinkAce - LinkAce is a self-hosted archive to collect links of your favorite websites.