HPI
ArchiveBox
Our great sponsors
HPI | ArchiveBox | |
---|---|---|
14 | 248 | |
1,391 | 19,737 | |
- | 3.1% | |
8.7 | 9.7 | |
25 days ago | 13 days ago | |
Python | Python | |
MIT License | MIT |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
HPI
-
First Personal Search Engine Prototype
If this is interesting to you, you should check out the interesting work that karlicoss and others have done with "Human Programming Interface" [0] / [1].
I've been kicking this idea around for quite a few years and have gone through multiple iterations before finding HPI and tossing out all my work in favor of building off theirs.
HPI is a great platform to build your own stuff off and benefit from all the work that has already been done because imo building a good foundation is the hardest part. Sean Breckenridge's HPI-API is super interesting and useful, could likely be worked into this search engine concept, quite sure Sean actually has both newsboat and Firefox modules already made.
I wrote modules of my own and made an authentication wrapped HPI-API and a GraphQL instance but currently in the middle of an infra move so nothing super cool to show off.
Lots of interesting stuff in collecting and leveraging your data. If any of this stuff catches your eye, I highly encourage browsing karlicoss' exobrain [2] because there are some interesting things in there.
[0]: https://github.com/karlicoss/HPI
[1]: my own stuff, not trying to step on Karli, just wanted a 3 letter org for my stuff: https://github.com/hpi
[2]: https://beepb00p.xyz/myinfra.html
-
I put my whole life into a single database
My version of this: Human Programming Interface https://github.com/karlicoss/HPI
It's a bit heavier on the automatic data aggregation side, but has some manual inputs as data sources too.
- “Obtaining My Personal Data from Amazon Was a Nightmare”
-
Gains I'm Seeing from My Second Brain Tool
This is my approach!
I'm using HPI [0] as a sort of universal API for almost all of my data (manual notes, bookmarks, instant messages, internet comments, etc)
Then I use it in tools like Orger [1] and Promnesia [2] which function as my second brain
[0] https://github.com/karlicoss/HPI
[1] https://github.com/karlicoss/orger
[2] https://beepb00p.xyz/promnesia.html
-
Electric Tables – an experiment in personal databases
I suppose HPI[0] kind of is that? ;)
A community repository would be super nice for those. Something along the lines of DefinitelyTyped[1], all managed through git, easily integrates with other stuff (like shown on npmjs.org when the @types package exists), allows maintainers to "own" the adapters they contribute. It's really the N adapters * T time per adapter that really makes it hard for one person to do. That plus monitoring API changes/flakiness of each adapter to make sure the data is still solid.
[0] https://github.com/karlicoss/HPI
[1] https://github.com/DefinitelyTyped/DefinitelyTyped
-
Ask HN: Who Wants to Collaborate?
I'm working on tools/projects to unify, access, interact and use my personal data for quantified self, knowledge management, etc.
A couple of examples:
- https://github.com/karlicoss/HPI#readme
- https://github.com/karlicoss/promnesia#readme
Would very much love to discuss it with other people, collaborate etc.
-
Questions about Emacs
Emacs is born as a human-computer interface, not specifically a PIM/PKM systems (Personal Information Management systems, Personal Knowledge Management systems), those are born at Xerox Parc and they never really took off, unfortunately. You can find a small intro like https://doi.org/10.1145/1480506.1480524 you can find many research articles and thesis on the ACM and other places, try https://karl-voit.at/tagstore/downloads/Voit2011.pdf by /u/publicvoit today in Emacs the most popular of such systems in org-roam, a wrapper/accessing tool for org-mode, witch is probably one of the most powerful, Memacs is another classic one that do something more and something less, Dimitri Gerasimov have it's own public HPI https://beepb00p.xyz/hpi.html with Grasp and Promnesia extensions for Emacs and probably many others do exists but they are used/developed by a small community and while in the "old" wiki book before, "personal note/evernote boom", now "zettelkasten boom" interest keep being there documentation especially at newcomer level is nearly zero... There are research papers, few whole books, tons of articles, but nothing like a complete and simple learning path...
-
Need opinions regarding developing a browser extension(firefox) for taking notes from a webpage
Their author have developed a more complex script collection (HPI, https://beepb00p.xyz/hpi.html) witch is a bit confuse, but seems alive and for certain aspects do extra things then memacs (https://github.com/novoid/Memacs).
-
How often do you refresh reddit profile?
Side note: His Promnesia and HPI projects are just mind blowing!
-
One Hundred Ideas for Computing
Some of my favourites:
- "5. Life engine" and "92. Personal Data API"
I'm working on this in "Human Programming Interface" :) https://github.com/karlicoss/HPI#readme It's far from solving these in general, but it works for me very well.
ArchiveBox
-
Ask HN: What Underrated Open Source Project Deserves More Recognition?
Two projects I greatly appreciate, allowing me to easily archive my bandcamp and GOG purchases (after the initial setup anyways):
https://github.com/easlice/bandcamp-downloader
https://github.com/Kalanyr/gogrepoc
And I recently learned about archivebox, which I think is going to be a fast favorite and finally let me clear out my mess of tabs/bookmarks: https://github.com/ArchiveBox/ArchiveBox
- YaCy, a distributed Web Search Engine, based on a peer-to-peer network
-
Vice website is shutting down
If you really want to save the content for yourself, use something like https://archivebox.io/
I've been running a local instance for a few years now and download/save tech articles all time. I can search and find them as needed.
-
An Introduction to the WARC File
API is coming soon (relatively, it's still a one-man project)! Stay tuned https://github.com/ArchiveBox/ArchiveBox/issues/496
I have an event-sourcing refactor in progress now to allow us to pluginize functionality like the API (similar to Home Assistant with a plugin app sotre), it will take a month or two. Next up is the REST API using the new plugin system.
-
Ask HN: How can I back up an old vBulletin forum without admin access?
I guess your best chance is to use something like https://archivebox.io/.
-
ArchiveBox – open-source self-hosted web archiving
Yeah this is a cool project but it was discussed 2 days ago.
As mentioned by the maintainer there, they even maintain a list of alternatives, very classy:
https://github.com/ArchiveBox/ArchiveBox/wiki/Web-Archiving-...
- ArchiveBox: Open-source self-hosted web archiving
- Linkhut: A Social Bookmarking Site
- Show HN: Rem: Remember Everything (open source)
- Bookmark manager with a focus on organization?
What are some alternatives?
deepstream.io - deepstream.io server
Wallabag - wallabag is a self hostable application for saving web pages: Save and classify articles. Read them later. Freely.
wakatime - Command line interface used by all WakaTime text editor plugins.
paimon-moe - Your best Genshin Impact companion! Help you plan what to farm with ascension calculator and database. Also track your progress with todo and wish counter.
org-roam-ui - A graphical frontend for exploring your org-roam Zettelkasten
SingleFile - Web Extension for saving a faithful copy of a complete web page in a single HTML file
megadetector-gui - A desktop application that makes using MegaDetector's model easier
ArchivesSpace - The ArchivesSpace archives management tool
PowerDeleteSuite - Power Delete Suite for Reddit
grab-site - The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Memacs - What did I do on February 14th 2007? Visualize your (digital) life in Org-mode
Archivematica - Free and open-source digital preservation system designed to maintain standards-based, long-term access to collections of digital objects.